%INCLUDE{%SYSTEMWEB%.WebChangesRightBox}% <!-- keep this as a security measure: * Set ALLOWTOPICCHANGE = Main.TWikiAdminGroup,Main.LCGAdminGroup * Set ALLOWTOPICRENAME = Main.TWikiAdminGroup,Main.LCGAdminGroup * Set GANGLIA3BASE = http://ganglia.lcg.cscs.ch/ganglia3/ #uncomment this if you want the page only be viewable by the internal people # * Set ALLOWTOPICVIEW = Main.TWikiAdminGroup,Main.LCGAdminGroup --> %CALC{"$SET(jobint,$IF($EXACT(%URLPARAM{"jobint"}%,),day,%URLPARAM{"jobint"}%))"}% %CALC{"$SET(jobintp,$IF($EXACT(%URLPARAM{"jobintp"}%,),day,%URLPARAM{"jobintp"}%))"}% %CALC{"$SET(jobintd,$IF($EXACT(%URLPARAM{"jobintd"}%,),day,%URLPARAM{"jobintd"}%))"}% %CALC{"$SET(freestorageint,$IF($EXACT(%URLPARAM{"freestorageint"}%,),week,%URLPARAM{"freestorageint"}%))"}% %CALC{"$SET(netwint,$IF($EXACT(%URLPARAM{"netwint"}%,),day,%URLPARAM{"netwint"}%))"}% %CALC{"$SET(coreint,$IF($EXACT(%URLPARAM{"coreint"}%,),day,%URLPARAM{"coreint"}%))"}% ---+!! Phoenix Monitoring Overview %TOC% ---+++ VoMonitoringDashboard Status plots from %SERVERTIME% (page is refreshed every 5 minutes). Phoenix is a cluster comprised of =Phoenix= nodes and =Daint(HPC)= nodes. The following URLs are related to external monitoring pages and VO-specific monitoring pages: CERN credentials or a personal X509 certificate installed in your browser may be required: *Monitoring Pages*: * *Site mon:* * SLURM top: [[https://lhcpublic.cscs.ch/sltop_wlcg.html][SLTOP Daint]] */* [[https://lhcpublic.cscs.ch/sltop_phoenix4.html][SLTOP Phoenix4]] * SLURM errors: [[https://lhcpublic.cscs.ch/slurmerrors_phoenix4.html][slurmerrors]] * *CMS:* [[https://etf-cms-prod.cern.ch/etf/check_mk/index.py?start_url=%2Fetf%2Fcheck_mk%2Fview.py%3Fview_name%3Dsearchhost%26host_regex%3DCSCS%26filled_in%3Dfilter][ETF]] */* [[http://dashb-ssb.cern.ch/dashboard/request.py/siteview?site=T2_CH_CSCS][Overview]] */* [[http://hammercloud.cern.ch/hc/app/cms/][CMS hammercloud]] */* [[https://cmsweb.cern.ch/phedex/debug/Activity::QualityPlots?graph=quality_all&entity=src&src_filter=&dest_filter=CSCS&no_mss=true&period=l7d&upto=][PhEDEx transfers WORLD -> CSCS]] */* [[http://cms-site-readiness.web.cern.ch/cms-site-readiness/SiteReadiness/HTML/SiteReadinessReport.html][CMS Site Readiness reports]] * *ATLAS:* [[http://dashb-atlas-sum.cern.ch/dashboard/request.py/historicalsmryview-sum#view=siteavl&time=last12&granularity=default&profile=ATLAS_CRITICAL&group=All+sites&site=CSCS-LCG2&type=quality][Site-view]] */* [[http://dashb-atlas-ssb.cern.ch/dashboard/request.py/siteview?site=CSCS-LCG2][Overview]] */* [[http://dashb-atlas-sam.cern.ch/dashboard/request.py/latestresultssmry?siteSelect3=400&serviceTypeSelect3=0&sites=CSCS-LCG2&services=CE&services=CREAMCE&services=FTS&services=LFC_C&services=LFC_L&services=SRMv2&services=VOBOX&services=gRB&tests=74541&tests=74543&tests=37878&tests=74732&tests=74734&tests=37947&tests=74569&tests=74571&tests=74567&exitStatus=all&table=true][Tiny Panel]] */* [[http://happyface-goegrid.gwdg.de/cloudmon/CloudMon.html][Cloud Monitor]] */* Panglia ( [[http://gridinfo.triumf.ca/panglia/sites/site_detail.php?SITE=ANALY_CSCS][Analysis]] , [[http://gridinfo.triumf.ca/panglia/sites/site_detail.php?SITE=CSCS-LCG2][Production]]) */* [[http://dashb-atlas-data.cern.ch/dashboard/request.py/site?name=DE&statsInterval=4][Dashboard]] */* () */* Panda ( [[http://panda.cern.ch/server/pandamon/query?job=*&type=&hours=12&jobsetID=any&jobStatus=&site=ANALY_CSCS&cplot=yes&plot=yes&processingType=gangarobot&cplot=yes][Analysis]] , [[http://panda.cern.ch/server/pandamon/query?job=*&type=&hours=12&jobsetID=any&jobStatus=&site=&cplot=yes&plot=yes&processingType=gangarobot-pft&computingSite=CSCS-LCG2][Production]]) */* ARC ( [[http://www.nordugrid.org/monitor/loadmon.php][Cloud monitor]], [[http://panda.cern.ch/server/pandamon/query?jobsummary=site&site=ARC-T2][Panda]]) */* [[https://sam-atlas-prod.cern.ch/nagios/cgi-bin/status.cgi?hostgroup=site-CSCS-LCG2&style=detail][ATLAS Nagios for CSCS]] * *LHCb:* [[http://dashb-lhcb-ssb.cern.ch/dashboard/request.py/siteview?site=LCG.CSCS.ch][Overview]] */* [[http://dashb-lhcb-sam.cern.ch/dashboard/request.py/latestresultssmry?siteSelect3=500&serviceTypeSelect3=0&sites=LCG.CSCS.ch&services=CE&services=CREAMCE&services=FTS&services=LFC_C&services=LFC_L&services=RB&services=SRMv2&services=VOBOX&services=gRB&tests=398&tests=404&tests=405&tests=406&tests=403&tests=407&tests=37624&tests=399&tests=2&tests=5&tests=7&tests=14&tests=25&tests=37732&tests=37700&tests=37703&tests=37710&tests=37715&tests=37760&tests=51&tests=50&tests=37638&tests=37553&tests=37554&tests=37555&tests=37636&tests=37637&tests=37556&tests=37557&tests=37643&tests=37399&exitStatus=all&table=true][Tiny Panel]] */* [[http://lhcbweb.pic.es/DIRAC/LHCb-Production/visitor/jobs/SiteSummary/display][Dirac Site status]] */* [[http://dashb-lhcb-ssb.cern.ch/dashboard/request.py/sitehistory#currentView=By+tier&time=168&start_date=&end_date=&values=false&spline=false&white=false?site=LCG.CSCS.ch][Dashboard]] */* [[http://lhcbproject.web.cern.ch/lhcbproject/Operations/queues.html][Queues length]] * * Accounting*: [[http://goc-accounting.grid-support.ac.uk/rss/CSCS-LCG2_Pub.html][APEL status]] ---++ Batch jobs (Phoenix and Daint) <form name="formJobintP" action="%TOPICURL%?#Batch_jobs_on_Phoenix" method=GET> <select name="jobintp" onchange="formJobintP.submit()"> <option %CALC{"$IF($EXACT($GET(jobintp),hour),selected,)"}%>hour</option> <option %CALC{"$IF($EXACT($GET(jobintp),day),selected,)"}%>day</option> <option %CALC{"$IF($EXACT($GET(jobintp),week),selected,)"}%>week</option> <option %CALC{"$IF($EXACT($GET(jobintp),month),selected,)"}%>month</option> <option %CALC{"$IF($EXACT($GET(jobintp),year),selected,)"}%>year</option> </select> <input type="hidden" name="jobintp" value=%CALC{"$GET(jobintp)"}%> <input type="hidden" name="netwint" value=%CALC{"$GET(netwint)"}%> </form> %CALC{"$IF($EXACT($GET(jobintp),hour),$SET(jobintp2, 1h),)"}% %CALC{"$IF($EXACT($GET(jobintp),day),$SET(jobintp2, 1d),)"}% %CALC{"$IF($EXACT($GET(jobintp),week),$SET(jobintp2, 1w),)"}% %CALC{"$IF($EXACT($GET(jobintp),month),$SET(jobintp2, 1M),)"}% %CALC{"$IF($EXACT($GET(jobintp),year),$SET(jobintp2, 1y),)"}% <img alt="" src="https://lhcpublic.cscs.ch/slurm_phoenix_%CALC{"$GET(jobintp2)"}%.png" /> <img alt="" src="https://lhcpublic.cscs.ch/slurm_daint_%CALC{"$GET(jobintp2)"}%.png" /> ---+++ Cores Usage by VO (Phoenix and Daint) <form name="formCoreint" action="%TOPICURL%?#Core_Usage_on_Phoenix" method=GET> <select name="coreint" onchange="formCoreint.submit()"> <option %CALC{"$IF($EXACT($GET(coreint),hour),selected,)"}%>hour</option> <option %CALC{"$IF($EXACT($GET(coreint),day),selected,)"}%>day</option> <option %CALC{"$IF($EXACT($GET(coreint),week),selected,)"}%>week</option> <option %CALC{"$IF($EXACT($GET(coreint),month),selected,)"}%>month</option> <option %CALC{"$IF($EXACT($GET(coreint),year),selected,)"}%>year</option> </select> <input type="hidden" name="coreint" value=%CALC{"$GET(coreint)"}%> </form> %CALC{"$IF($EXACT($GET(coreint),hour),$SET(coreint2, 1h),)"}% %CALC{"$IF($EXACT($GET(coreint),day),$SET(coreint2, 1d),)"}% %CALC{"$IF($EXACT($GET(coreint),week),$SET(coreint2, 1w),)"}% %CALC{"$IF($EXACT($GET(coreint),month),$SET(coreint2, 1M),)"}% %CALC{"$IF($EXACT($GET(coreint),year),$SET(coreint2, 1y),)"}% <img alt="" src="https://lhcpublic.cscs.ch/slurm_cores_phoenix_%CALC{"$GET(coreint2)"}%.png" /> <img alt="" src="https://lhcpublic.cscs.ch/slurm_cores_daint_%CALC{"$GET(coreint2)"}%.png" /> ---++ Storage Element Links: * [[http://%SEHOST%:2288/][dCache GUI]] */* [[http://dashb-atlas-ssb.cern.ch/dashboard/request.py/sitehistory#currentView=FAX+endpoints?site=CSCS-LCG2][FAX endpoint status]] * *ATLAS:* [[http://bourricot.cern.ch/dq2/accounting/allspacetokens_view/CSCS-LCG2/30/][DQ2 accounting on srm tokens]] * *CMS:* [[http://dashb-ssb.cern.ch/dashboard/request.py/siteview?view=storage][dashboard summary view]] */* List all [[https://cmsweb.cern.ch/phedex/prod/Data::Replicas?view=global&rcolumn=Name&nvalue=Node+bytes&rows=interesting&dbs=7&dbs=11&dbs=14&dbs=15&dbs=10&dbs=16&dbs=12&dbs=2&dbs=1&dbs=3&dbs=9&dbs=13&dbs=6&dbs=4&dbs=5&dbs=22&dbs=8&node=27&filter=.*][hosted datasets]]/ [[https://cmsweb.cern.ch/phedex/prod/Request::View?type=xfer&nodes=T2_CH_CSCS&state=pend&.submit=Submit][requests]] */* [[https://cmsweb.cern.ch/phedex/prod/Data::Subscriptions?filter=.%2A;node=27][subscriptions]] */* [[https://cmsweb.cern.ch/phedex/prod/Reports::SiteUsage?node=T2_CH_CSCS][accounting per phys. group]] (also see [[https://cmsweb.cern.ch/victor/association_view/T2_CH_CSCS/AnalysisOps][victor tool]]) Free storage space: <form name="formFreestorageint" action="%TOPICURL%?#Storage_Element" method=GET> <select name="freestorageint" onchange="formFreestorageint.submit()"> <option %CALC{"$IF($EXACT($GET(freestorageint),hour),selected,)"}%>hour</option> <option %CALC{"$IF($EXACT($GET(freestorageint),day),selected,)"}%>day</option> <option %CALC{"$IF($EXACT($GET(freestorageint),week),selected,)"}%>week</option> <option %CALC{"$IF($EXACT($GET(freestorageint),month),selected,)"}%>month</option> <option %CALC{"$IF($EXACT($GET(freestorageint),year),selected,)"}%>year</option> </select> <input type="hidden" name="jobint" value=%CALC{"$GET(jobint)"}%> <input type="hidden" name="netwint" value=%CALC{"$GET(netwint)"}%> </form> %CALC{"$IF($EXACT($GET(freestorageint),hour),$SET(freestorageint2, 1h),)"}% %CALC{"$IF($EXACT($GET(freestorageint),day),$SET(freestorageint2, 1d),)"}% %CALC{"$IF($EXACT($GET(freestorageint),week),$SET(freestorageint2, 1w),)"}% %CALC{"$IF($EXACT($GET(freestorageint),month),$SET(freestorageint2, 1M),)"}% %CALC{"$IF($EXACT($GET(freestorageint),year),$SET(freestorageint2, 1y),)"}% <img alt="" src="https://lhcpublic.cscs.ch/dcache_%CALC{"$GET(freestorageint2)"}%.png" /> ---++ Networking and File Transfers Links: * *CMS* Status [[http://cmsweb.cern.ch/phedex/prod/Components::Status][Prod]] / [[http://cmsweb.cern.ch/phedex/Debug/Components::Status][Debug]] */* [[http://cmsweb.cern.ch/phedex/prod/Components::Links#?from_filter=T2_CH_CSCS&andor=or&to_filter=T2_CH_CSCS&Update=Update][PhEDEx enabled data transfer links at CSCS]] * *ATLAS* [[http://dashb-atlas-data.cern.ch/dashboard/request.py/site?statsInterval=168&name=FZK][ATLAS FZK Cloud Data activity last 7 days]] * *FTS Channels* [[http://ftm-kit.gridka.de/ftsmonitor/ftschannel.php?channel=STAR-CSCS&vo=all][all-CSCS]] */* [[http://ftm-kit.gridka.de/ftsmonitor/ftschannel.php?channel=CSCS-FZK&vo=all][CSCS-FZK]] ---++ External monitoring * CSCS external network to CERN (shared with other projects, only visible from SWITCH network, view from CERN's perspective): <img alt="" src="https://traffic.lan.switch.ch/pub/swiss-map/cms-mini-graph.cgi?type=png;target=/weathermap/ce-lug;inst=0;dslist=ifInOctets,ifOutOctets;range=ÊLC{" /> ---++ Other monitoring websites * [[http://grono.cscs.ch/switchmap/vlans/vlan64.html][CSCS Internal VLAN64]] ---++ CMS Monitoring Page [[CMSMonitoring]]
This topic: LCGTier2
>
WebHome
>
PhoenixMonOverview
Topic revision: r257 - 2019-01-17 - GianniRicciardi
Copyright © 2008-2024 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki?
Send feedback