create new tag
view all tags

Phoenix Monitoring Overview


Status plots from 2018-12-17 - 03:07 (page is refreshed every 5 minutes). Historical view here.

Warning, important Note: The now values printed at the bottom of some graphs are not correct.

Phoenix is a cluster comprised of Phoenix nodes and Phoenix4 nodes. Phoenix4 is simply a new way of calling the cluster, with a newer, dedicated Slurm configuration. Over time, all nodes of Phoenix will be moved to Phoenix4 Slurm and charts.

Site Tests: Region DE ( Nagios, history) / MIDMON Nagios APEL status / Site-View / MyWLCG site availability / WLCG Transfers dashboard

Batch jobs (Phoenix and Daint)

Usage by VO (Phoenix & Phoenix4)

Usage by VO (Phoenix)

Show... Hide

Usage by CE (Phoenix)

Show... Hide

Older charts

Show... Hide

Usage by VO (static charts)

Usage by CE

Worker nodes Phoenix + Phoenix4

<img src="http://ganglia.lcg.cscs.ch/ganglia3/graph.php?g=load_report&z=medium&c=PHOENIX-workers&m=&r=day&s=descending&hc=4&st=now" /> <img src="http://ganglia.lcg.cscs.ch/ganglia3/graph.php?g=cpu_report&z=medium&c=PHOENIX-workers&m=&r=day&s=descending&hc=4&st=now" /> <img src="http://ganglia.lcg.cscs.ch/ganglia3/graph.php?g=mem_report&z=medium&c=PHOENIX-workers&m=&r=day&s=descending&hc=4&st=now" />

Storage Element


Free storage space:

The below plot is from the dcache web ui representing active movers.

Networking and File Transfers


Plotting interval:

<img src="http://ganglia.lcg.cscs.ch/ganglia3/graph.php?g=network_report&z=medium&c=PHOENIX-workers&m=&r=day&s=descending&hc=4&st=now" /> <img src="http://ganglia.lcg.cscs.ch/ganglia3/graph.php?g=network_report&z=medium&c=PHOENIX-fileservers&m=&r=day&s=descending&hc=4&st=now" /> <img src="http://ganglia.lcg.cscs.ch/ganglia3/graph.php?g=network_report&z=medium&c=PHOENIX-gpfs2&m=&r=day&s=descending&hc=4&st=now" /> <img src="http://ganglia.lcg.cscs.ch/ganglia3/graph.php?g=network_report&z=medium&c=PHOENIX-services&m=&r=day&s=descending&hc=4&st=now" />

Number of active dCache movers (transfers) on the WAN (gridftp, srm) / LAN ("regular",dcap):

Number of queued dCache movers (waiting transfers) on the WAN (gridftp, srm) / LAN ("regular",dcap):

Number of pending dCache requests:

External monitoring

  • CSCS external network to CERN (shared with other projects, only visible from SWITCH network, view from CERN's perspective):

  • EGI Nagios from FZK, last 5 days (hover mouse to see hostname)

87" align="center" style="border-right: 1px solid; margin:0px; padding:0px;"> 13 14 15 16 12" align="center" style="border-right: none; margin:0px; padding:0px; ">17

  • Power consumption (Island 1)

  • Gridview current status graphs

Other monitoring websites

CMS Monitoring Page


Edit | Attach | Watch | Print version | History: r253 < r252 < r251 < r250 < r249 | Backlinks | Raw View | Raw edit | More topic actions
Topic revision: r253 - 2018-12-11 - GianniRicciardi
This site is powered by the TWiki collaboration platform Powered by Perl This site is powered by the TWiki collaboration platformCopyright © 2008-2018 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback