Tags:
tag this topic
create new tag
view all tags
<!-- keep this as a security measure: * Set ALLOWTOPICCHANGE = Main.TWikiAdminGroup,Main.LCGAdminGroup * Set ALLOWTOPICRENAME = Main.TWikiAdminGroup,Main.LCGAdminGroup #uncomment this if you want the page only be viewable by the internal people # * Set ALLOWTOPICVIEW = Main.TWikiAdminGroup,Main.LCGAdminGroup --> ---+ How to check the CSCS Tier-2 status for CMS site contacts / site managers *This is a small routine which should be performed once a day by the responsible CMS site contact*. Some of these things can and should be automatized at some point, but the manual check does not take much time and will increase your understanding of the system. All the basic information and links can be found on our main monitoring page: PhoenixMonOverview. The following list basically tells you at what you should look on this page. 1. *Look at the three pie charts* for the worker nodes, service nodes, and the file servers. <br>The service and fileserver pie charts must show no black parts (i.e. nodes down). A few worker nodes that are down are not so critical, but you still may want to contact the site admins. 1. *Check all SAM tests* using the links towards the top of the page, the CMS SAM tests being the most important ones for us. 1. *Check the graphs for running and queued jobs.* <br> You should only see a number of queued CMS jobs, if the cluster is filled with running jobs. If jobs stay in the queue despite free slots on the cluster, something with the scheduling is wrong. 1. *Check the free storage space graph* for CMS, and take note of the trend shown over the last week. <br> You can check how much space is taken up by users and datasets by using the Links below the _Storage Element_ section. 1. *Take a look at the graphs for the dcache movers.* If you see a large number of queued movers (especially if it is still growing) you may want to notify the CSCS admins. In case of problems you may also want to look at the Pool Transfer Queues, Active Transfers, and Detailed Tape Transfer Queue (don't be misguided by this name - it applies to disk transfer problems, too) in the _dCache GUI_. 1. *Check !Phedex* by looking at the log analyzer output on the !PhEDEx download and export pages (links are located below _Networking and File Transfers_) * I there is zero activity, make sure that the !Phedex processes are up * If there are lots of transfer errors, try to analyze them based on what you see in the log analyzer and [[https://savannah.cern.ch/support/?func=additem&group=cmscompinfrasup][post a support request on savannah]] (assign to cmscompinfrasup-datatransfer group or contact the responsible site admins directly. 1. *Check whether there are any pending data set requests* (There is a link to the correct page below the _Storage Element_ section). <br> The decision whether to allow the request must be based on the available space and policy -- Main.DerekFeichtinger - 27 Nov 2008
E
dit
|
A
ttach
|
Watch
|
P
rint version
|
H
istory
: r4
<
r3
<
r2
<
r1
|
B
acklinks
|
V
iew topic
|
Ra
w
edit
|
M
ore topic actions
Topic revision: r4 - 2009-02-05
-
DerekFeichtinger
LCGTier2
Log In
(Topic)
LCGTier2 Web
Create New Topic
Index
Search
Changes
Notifications
Statistics
Preferences
Users
Entry point / Contact
RoadMap
ATLAS Pages
CMS Pages
CMS User Howto
CHIPP CB
Outreach
Technical
Cluster details
Services
Hardware and OS
Tools & Tips
Monitoring
Logs
Maintenances
Meetings
Tests
Issues
Blog
Home
Site map
CmsTier3 web
LCGTier2 web
PhaseC web
Main web
Sandbox web
TWiki web
LCGTier2 Web
Users
Groups
Index
Search
Changes
Notifications
RSS Feed
Statistics
Preferences
P
View
Raw View
Print version
Find backlinks
History
More topic actions
Edit
Raw edit
Attach file or image
Edit topic preference settings
Set new parent
More topic actions
Warning: Can't find topic "".""
Account
Log In
E
dit
A
ttach
Copyright © 2008-2024 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki?
Send feedback