Tags:
view all tags
<span data-mce-mark="1"><!-- keep this as a security measure:<br />* Set ALLOWTOPICCHANGE = Main.TWikiAdminGroup,Main.LCGAdminGroup,Main.EgiGroup<br />* Set ALLOWTOPICRENAME = Main.TWikiAdminGroup,Main.LCGAdminGroup<br />#uncomment this if you want the page only be viewable by the internal people<br />#* Set ALLOWTOPICVIEW = Main.TWikiAdminGroup,Main.LCGAdminGroup,Main.ChippComputingBoardGroup<br />--></span> ---+ Swiss Grid Operations Meeting on 2018-11-08 at 14:00 * *Place*: Vidyo (room: Swiss_Grid_Operations_Meeting, extension: 10537598) * *External link*: <span data-mce-mark="1">https://vidyoportal.cern.ch/flex.html?roomdirect.html&key=FAEn4zjAba7BqoQ11TGZu66VSDE</span> * *Phone gate*: From Switzerland: 0227671400 (portal) + 10537598 (extension) + # (pound sign) * *IRC chat*: <span data-mce-mark="1">irc:gridchat.cscs.ch:994#lcg</span> (ask pw via email) * *Switch Vidyo SIP IP*: 137.138.248.204 <span data-mce-mark="1"> <span data-mce-mark="1">%TOC%</span></span> ---++ Site status ---+++ CSCS * ---+++ PSI * Storage: decommissioning of old SGI and !NetApp * Infrastructure: new network patches deployment * [[http://t3mon.psi.ch/PSIT3-custom/accounting.txt][Accounting numbers (from scheduler) ]] ---+++ UNIBE-LHEP * * A bit less stable (lack of manpower), lower delivery for a few months, still fulfilling the pledge. * Ubelixed dropped out silently on 10th October * Running an average <1900 slots (typical 2500), Ubelix contribution 12% (typical 23%) * Large t2k.org run in September, 1 cluster reserved for a local user for almost the entire month<br /><br /> * <b>Accounting numbers (from scheduler) from last month (October), LHEP only</b><br /><br /><span data-mce-mark="1"><span data-mce-mark="1">%EDITTABLE{}%</span></span> <table border="1" cellpadding="0" cellspacing="1"> <tbody> <tr><th>VO</th><th>Job Type</th><th>Produced WC core-hours</th> <td> </td> <td> </td> </tr> <tr> <td>ATLAS</td> <td>Any</td> <td> <p>1157991</p> </td> <td> </td> <td> </td> </tr> <tr> <td>ops</td> <td>Any</td> <td>44</td> <td> </td> <td> </td> </tr> <tr> <td>t2k.org</td> <td>Any</td> <td> <p>0</p> </td> <td> </td> <td> </td> </tr> <tr> <td>uboone</td> <td>Any</td> <td>0</td> <td> </td> <td><br /><br /></td> </tr> </tbody> </table> <br /><img alt="" height="283" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/consumptions_individual?sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&granularity=Daily&generic=0&sortBy=16&series=All&type=ewa" width="377" /><br /><br /> * <b>Five month history Unibe (pledge: 18 kHS06)</b><br /><img alt="" height="319" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/resourceutilization_individual?sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-06-01&end=2018-10-31&timeRange=daily&granularity=Monthly&generic=0&sortBy=16&diag1=0&diag2=0&diag3=0&diag4=0&diag5=0&diag6=0&diag7=0&diag8=0&diagT=0&diag8pl=0&series=All&type=wchs" width="425" /> * <b>Swiss ATLAS statistics<br /></b> * * *HC availability [1]:* * CSCS-LCG2: 95% Prod, 97% Analy * CSCS-LCG2-HPC: 75% Prod, 76% Analy * UNIBE-LHEP: 99% Prod, 96% Analy * UNIBE-LHEP-UBELIX: 100% ($), Prod, 27% Analy <p>($) effectively up ~30% only</p> <p> </p> * <b>CSCS running 3300 slots on average, UNIBE running 1850<br /></b> * *Accounting numbers (from dashboard) from last month for CSCS and UNIBE* <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <span data-mce-mark="1"> <span data-mce-mark="1">%EDITTABLE{}%</span></span> | *Cluster* | *Job Type* | *Produced WC core-hours* | *Good vs Bad WC %* | *CPU eff good jobs %* | | CSCS | Any | 2901550 (69%) | 0.71 | 0.89 | | Unibe | Any | 1266896 (31%) | 0.85 | 0.85 | <br /><img alt="" height="230" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/resourceutilization_individual?sites=CSCS-LCG2&sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&granularity=8 Hours&generic=0&sortBy=0&diag1=0&diag2=0&diag3=0&diag4=0&diag5=0&diag6=0&diag7=0&diag8=0&diagT=0&diag8pl=0&series=All&type=wchs" width="307" /><img alt="" height="225" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/resourceutilization_individual?sites=CSCS-LCG2&sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&granularity=8 Hours&generic=0&sortBy=0&diag1=0&diag2=0&diag3=0&diag4=0&diag5=0&diag6=0&diag7=0&diag8=0&diagT=0&diag8pl=0&series=All&type=a" width="301" /> <img alt="" height="233" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/consumptions_individual?sites=CSCS-LCG2&sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&granularity=8 Hours&generic=0&sortBy=0&series=All&type=ewa" width="310" /><img alt="" height="237" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/consumptions_individual?sites=CSCS-LCG2&sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&granularity=8 Hours&generic=0&sortBy=0&series=All&type=ewg" width="316" /><br /><br /><img alt="" height="186" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/terminatedjobsstatus_individual?sites=CSCS-LCG2&sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&sortBy=0&granularity=Daily&generic=0&series=All&type=ebwc" width="298" /><img alt="" height="205" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/efficiency_individual?sites=CSCS-LCG2&sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&granularity=Daily&generic=0&sortBy=0&series=All&type=egl" width="273" /><br /><br /><br />[1] <span data-mce-mark="1">http://dashb-atlas-ssb.cern.ch/dashboard/request.py/siteviewhistorywithstatistics?columnid=562#time=custom&start_date=2018-10-01&end_date=2018-10-31&use_downtimes=false&merge_colors=false&sites=multiple&clouds=all&site=ANALY_CSCS,ANALY_CSCS-HPC,ANALY_UNIBE-LHEP,ANALY_UNIBE-LHEP-UBELIX,CSCS-LCG2-HPC_MCORE,CSCS-LCG2_MCORE,UNIBE-LHEP-UBELIX_MCORE,UNIBE-LHEP_MCORE</span> ---+++ UNIBE-ID * Enabled EGI ARGO notification e-mails in GOCDB to respond to CE stalling silently * Opportunistic usage on Ubelix to be added as soon as the sl6 legacy partition will be discontinued * slurm pre-emptable partition * ATLAS can use idle slots * ATLAS jobs killed (not checkpointed) when slots needed by other users ---+++ UNIGE * Re-commissioning of ARC CE delayed * Distrtibuted DPM storage working well <p> </p> ---+++ NGI_CH * Our deal with EGI for certificates expires in March 2019 * Science IT support Bern is looking into what the alternative will be<br /><br /> <p> </p> * NGI-CH Open Tickets review | *Ticket-ID* | *Type* | *VO* | *Site* | *Priority* | *Resp. Unit* | *Status* | *Last Update* | *Subject* | *Scope* | | <a href="https://ggus.eu/index.php?mode=ticket_info&ticket_id=138314" target="_blank">138314 </a> | <img alt="" border="0" height="10" src="https://ggus.eu/index.php?mode=download&img=team_ticket.gif" width="38" /> | atlas | CSCS-LCG2 | less urgent | NGI_CH | assigned | 2018-11-15 | DE CSCS-LCG2 : transfer failures with ... | WLCG | | <a href="https://ggus.eu/index.php?mode=ticket_info&ticket_id=138296" target="_blank">138296 </a> | | cms | CSCS-LCG2 | urgent | NGI_CH | assigned | 2018-11-14 | Transfers failing from T2_CH_CSCS | WLCG | | <a href="https://ggus.eu/index.php?mode=ticket_info&ticket_id=133695" target="_blank">133695 </a> | <img alt="" border="0" height="10" src="https://ggus.eu/index.php?mode=download&img=team_ticket.gif" width="38" /> | lhcb | CSCS-LCG2 | urgent | NGI_CH <img alt="" border="0" src="https://ggus.eu/index.php?mode=download&img=tri.gif" /> assigned | in progress | 2018-10-19 | Data access problem at CSCS-LCG2 | WLCG | | <a href="https://ggus.eu/index.php?mode=ticket_info&ticket_id=132927" target="_blank">132927 </a> | | cms | CSCS-LCG2 | urgent | NGI_CH <img alt="" border="0" src="https://ggus.eu/index.php?mode=download&img=tri.gif" /> assigned <img alt="" border="0" src="https://ggus.eu/index.php?mode=download&img=tri.gif" /> involved | in progress | 2018-11-12 | Problem with APEL Accounting for all of ... | EGI | | <a href="https://ggus.eu/index.php?mode=ticket_info&ticket_id=131965" target="_blank">131965 </a> | | none | UNIBE-LHEP | less urgent | NGI_CH <img alt="" border="0" src="https://ggus.eu/index.php?mode=download&img=tri.gif" /> assigned | on hold | 2018-10-04 | IPv6 deployment at WLCG Tier-2 sites | EGI | | <a href="https://ggus.eu/index.php?mode=ticket_info&ticket_id=131948" target="_blank">131948 </a> | | none | CSCS-LCG2 | less urgent | NGI_CH <img alt="" border="0" src="https://ggus.eu/index.php?mode=download&img=tri.gif" /> assigned | in progress | 2018-11-13 | IPv6 deployment at WLCG Tier-2 sites | EGI | ---++ Other topics * <b>Follow up to fair-share meeting<br /><br /></b> * Two questions, one for the slurm experts, one for the VO reps: * is slurm charging the _reserved_ time or the _elapsed_ time to the user fair-share? * possible mitigation: pack single core jobs on nodes, as opposed to distribute them across all nodes. How does this sound? * this should reduce the node fragmentatiopn and give the MC jobs more opportunities to run timely<br /><br /> * <span style="background-color: transparent;">Other possible mitigations to be discussed internally between VOs need input from CSCS:<br /></span> * Distribution of job queue waiting time, last 2 Quarters, split by: Daint vs Phoenix, VO and 8-core vs 1-core (we should not count the T0 jobs) * Anything else?<br /><br /> * <span style="background-color: transparent;">Can we agree that the Daint and Phoenix shares (30 or 60 day historical view) will be monitored monthly at this meeting?<br /></span> * <b>Topic2</b><br />... <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> Next meeting date: ---++ A.O.B. ---++ Attendants * CSCS: * CMS: * ATLAS: * LHCb: * EGI: <p> </p> ---++ Action items * Item1
Edit
|
Attach
|
Watch
|
P
rint version
|
H
istory
:
r8
<
r7
<
r6
<
r5
<
r4
|
B
acklinks
|
V
iew topic
|
Raw edit
|
More topic actions...
Topic revision: r7 - 2018-11-15
-
GianfrancoSciacca
LCGTier2
Log In
(Topic)
LCGTier2 Web
Create New Topic
Index
Search
Changes
Notifications
Statistics
Preferences
Users
Entry point / Contact
RoadMap
ATLAS Pages
CMS Pages
CMS User Howto
CHIPP CB
Outreach
Technical
Cluster details
Services
Hardware and OS
Tools & Tips
Monitoring
Logs
Maintenances
Meetings
Tests
Issues
Blog
Home
Site map
CmsTier3 web
LCGTier2 web
PhaseC web
Main web
Sandbox web
TWiki web
LCGTier2 Web
Users
Groups
Index
Search
Changes
Notifications
RSS Feed
Statistics
Preferences
P
View
Raw View
Print version
Find backlinks
History
More topic actions
Edit
Raw edit
Attach file or image
Edit topic preference settings
Set new parent
More topic actions
Warning: Can't find topic "".""
Account
Log In
Edit
Attach
Copyright © 2008-2024 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki?
Send feedback