<span data-mce-mark="1"><!-- keep this as a security measure:<br />* Set ALLOWTOPICCHANGE = Main.TWikiAdminGroup,Main.LCGAdminGroup,Main.EgiGroup<br />* Set ALLOWTOPICRENAME = Main.TWikiAdminGroup,Main.LCGAdminGroup<br />#uncomment this if you want the page only be viewable by the internal people<br />#* Set ALLOWTOPICVIEW = Main.TWikiAdminGroup,Main.LCGAdminGroup,Main.ChippComputingBoardGroup<br />--></span> ---+ Swiss Grid Operations Meeting on 2018-11-08 at 14:00 * *Place*: Vidyo (room: Swiss_Grid_Operations_Meeting, extension: 10537598) * *External link*: <span data-mce-mark="1">https://vidyoportal.cern.ch/flex.html?roomdirect.html&key=FAEn4zjAba7BqoQ11TGZu66VSDE</span> * *Phone gate*: From Switzerland: 0227671400 (portal) + 10537598 (extension) + # (pound sign) * *IRC chat*: <span data-mce-mark="1">irc:gridchat.cscs.ch:994#lcg</span> (ask pw via email) * *Switch Vidyo SIP IP*: 137.138.248.204 <span data-mce-mark="1"> <span data-mce-mark="1">%TOC%</span></span> ---++ Site status ---+++ CSCS * ---+++ PSI * Storage: decommissioning of old SGI and !NetApp * Infrastructure: new network patches deployment * [[http://t3mon.psi.ch/PSIT3-custom/accounting.txt][Accounting numbers (from scheduler) ]] ---+++ UNIBE-LHEP * * A bit less stable (lack of manpower), lower delivery for a few months, still fulfilling the pledge. * Ubelixed dropped out silently on 10th October * Running an average <1900 slots (typical 2500), Ubelix contribution 12% (typical 23%) * Large t2k.org run in September, 1 cluster reserved for a local user for almost the entire month<br /><br /> * <b>Accounting numbers (from scheduler) from last month (October), LHEP only</b><br /><br /><span data-mce-mark="1"><span data-mce-mark="1">%EDITTABLE{}%</span></span> <table border="1" cellpadding="0" cellspacing="1"> <tbody> <tr><th>VO</th><th>Job Type</th><th>Produced WC core-hours</th> <td> </td> <td> </td> </tr> <tr> <td>ATLAS</td> <td>Any</td> <td> <p>1157991</p> </td> <td> </td> <td> </td> </tr> <tr> <td>ops</td> <td>Any</td> <td>44</td> <td> </td> <td> </td> </tr> <tr> <td>t2k.org</td> <td>Any</td> <td> <p>0</p> </td> <td> </td> <td> </td> </tr> <tr> <td>uboone</td> <td>Any</td> <td>0</td> <td> </td> <td><br /><br /></td> </tr> </tbody> </table> <br /><img alt="" height="283" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/consumptions_individual?sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&granularity=Daily&generic=0&sortBy=16&series=All&type=ewa" width="377" /><br /><br /> * <b>Five month history Unibe (pledge: 18 kHS06)</b><br /><img alt="" height="319" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/resourceutilization_individual?sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-06-01&end=2018-10-31&timeRange=daily&granularity=Monthly&generic=0&sortBy=16&diag1=0&diag2=0&diag3=0&diag4=0&diag5=0&diag6=0&diag7=0&diag8=0&diagT=0&diag8pl=0&series=All&type=wchs" width="425" /> * <b>Swiss ATLAS statistics<br /></b> * * *HC availability [1]:* * CSCS-LCG2: 95% Prod, 97% Analy * CSCS-LCG2-HPC: 75% Prod, 76% Analy * UNIBE-LHEP: 99% Prod, 96% Analy * UNIBE-LHEP-UBELIX: 100% ($), Prod, 27% Analy <p>($) effectively up ~30% only</p> <p> </p> * <b>CSCS running 3300 slots on average, UNIBE running 1850<br /></b> * *Accounting numbers (from dashboard) from last month for CSCS and UNIBE* <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <span data-mce-mark="1"> <span data-mce-mark="1">%EDITTABLE{}%</span></span> | *Cluster* | *Job Type* | *Produced WC core-hours* | *Good vs Bad WC %* | *CPU eff good jobs %* | | CSCS | Any | 2901550 (69%) | 0.71 | 0.89 | | Unibe | Any | 1266896 (31%) | 0.85 | 0.85 | <br /><img alt="" height="230" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/resourceutilization_individual?sites=CSCS-LCG2&sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&granularity=8 Hours&generic=0&sortBy=0&diag1=0&diag2=0&diag3=0&diag4=0&diag5=0&diag6=0&diag7=0&diag8=0&diagT=0&diag8pl=0&series=All&type=wchs" width="307" /><img alt="" height="225" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/resourceutilization_individual?sites=CSCS-LCG2&sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&granularity=8 Hours&generic=0&sortBy=0&diag1=0&diag2=0&diag3=0&diag4=0&diag5=0&diag6=0&diag7=0&diag8=0&diagT=0&diag8pl=0&series=All&type=a" width="301" /> <img alt="" height="233" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/consumptions_individual?sites=CSCS-LCG2&sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&granularity=8 Hours&generic=0&sortBy=0&series=All&type=ewa" width="310" /><img alt="" height="237" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/consumptions_individual?sites=CSCS-LCG2&sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&granularity=8 Hours&generic=0&sortBy=0&series=All&type=ewg" width="316" /><br /><br /><img alt="" height="186" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/terminatedjobsstatus_individual?sites=CSCS-LCG2&sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&sortBy=0&granularity=Daily&generic=0&series=All&type=ebwc" width="298" /><img alt="" height="205" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/efficiency_individual?sites=CSCS-LCG2&sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&granularity=Daily&generic=0&sortBy=0&series=All&type=egl" width="273" /><br /><br /><br />[1] <span data-mce-mark="1">http://dashb-atlas-ssb.cern.ch/dashboard/request.py/siteviewhistorywithstatistics?columnid=562#time=custom&start_date=2018-10-01&end_date=2018-10-31&use_downtimes=false&merge_colors=false&sites=multiple&clouds=all&site=ANALY_CSCS,ANALY_CSCS-HPC,ANALY_UNIBE-LHEP,ANALY_UNIBE-LHEP-UBELIX,CSCS-LCG2-HPC_MCORE,CSCS-LCG2_MCORE,UNIBE-LHEP-UBELIX_MCORE,UNIBE-LHEP_MCORE</span> ---+++ UNIBE-ID * Enabled EGI ARGUS notification e-mails in GOCDB to respond to CE stalling silently * Opportunistic usage on Ubelix to be added as soon as the sl6 legacy partition will be discontinued * slurm pre-emptable partition * ATLAS can use idle slots * ATLAS jobs killed (not checkpointed) when slots needed by other users ---+++ UNIGE * Re-commissioning of ARC CE delayed * Distrtibuted DPM storage working well <p> </p> ---+++ NGI_CH * Our deal with EGI for certificates expires in March 2019 * Science IT support Bern is looking into what the alternative will be<br /><br /> * NGI-CH Open Tickets review | *Ticket-ID* | *Type* | *VO* | *Site* | *Priority* | *Resp. Unit* | *Status* | *Last Update* | *Subject* | *Scope* | | <a href="https://ggus.eu/index.php?mode=ticket_info&ticket_id=138314" target="_blank">138314 </a> | <img alt="" border="0" height="10" src="https://ggus.eu/index.php?mode=download&img=team_ticket.gif" width="38" /> | atlas | CSCS-LCG2 | less urgent | NGI_CH | assigned | 2018-11-15 | DE CSCS-LCG2 : transfer failures with ... | WLCG | | <a href="https://ggus.eu/index.php?mode=ticket_info&ticket_id=138296" target="_blank">138296 </a> | | cms | CSCS-LCG2 | urgent | NGI_CH | assigned | 2018-11-14 | Transfers failing from T2_CH_CSCS | WLCG | | <a href="https://ggus.eu/index.php?mode=ticket_info&ticket_id=133695" target="_blank">133695 </a> | <img alt="" border="0" height="10" src="https://ggus.eu/index.php?mode=download&img=team_ticket.gif" width="38" /> | lhcb | CSCS-LCG2 | urgent | NGI_CH <img alt="" border="0" src="https://ggus.eu/index.php?mode=download&img=tri.gif" /> assigned | in progress | 2018-10-19 | Data access problem at CSCS-LCG2 | WLCG | | <a href="https://ggus.eu/index.php?mode=ticket_info&ticket_id=132927" target="_blank">132927 </a> | | cms | CSCS-LCG2 | urgent | NGI_CH <img alt="" border="0" src="https://ggus.eu/index.php?mode=download&img=tri.gif" /> assigned <img alt="" border="0" src="https://ggus.eu/index.php?mode=download&img=tri.gif" /> involved | in progress | 2018-11-12 | Problem with APEL Accounting for all of ... | EGI | | <a href="https://ggus.eu/index.php?mode=ticket_info&ticket_id=131965" target="_blank">131965 </a> | | none | UNIBE-LHEP | less urgent | NGI_CH <img alt="" border="0" src="https://ggus.eu/index.php?mode=download&img=tri.gif" /> assigned | on hold | 2018-10-04 | IPv6 deployment at WLCG Tier-2 sites | EGI | | <a href="https://ggus.eu/index.php?mode=ticket_info&ticket_id=131948" target="_blank">131948 </a> | | none | CSCS-LCG2 | less urgent | NGI_CH <img alt="" border="0" src="https://ggus.eu/index.php?mode=download&img=tri.gif" /> assigned | in progress | 2018-11-13 | IPv6 deployment at WLCG Tier-2 sites | EGI | ---++ Other topics * <b>Follow up to fair-share meeting<br /><br /></b> * Two questions, one for the slurm experts, one for the VO reps: * is slurm charging the _reserved_ time or the _elapsed_ time to the user fair-share? * possible mitigation: pack single core jobs on nodes, as opposed to distribute them across all nodes. How does this sound? * this should reduce the node fragmentatiopn and give the MC jobs more opportunities to run timely<br /><br /> * <span style="background-color: transparent;">Other possible mitigations to be discussed internally between VOs need input from CSCS:<br /></span> * Distribution of job queue waiting time, last 2 Quarters, split by: Daint vs Phoenix, VO and 8-core vs 1-core (we should not count the T0 jobs) * Anything else?<br /><br /> * <span style="background-color: transparent;">Can we agree that the Daint and Phoenix shares (30 or 60 day historical view) will be monitored monthly at this meeting?<br /></span> * <b>Topic2</b><br />... <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> Next meeting date: ---++ A.O.B. ---++ Attendants * CSCS: * CMS: * ATLAS: * LHCb: * EGI: <p> </p> ---++ Action items * Item1
This topic: LCGTier2
>
WebHome
>
MeetingsBoard
>
MeetingSwissGridOperations20181108
Topic revision: r6 - 2018-11-15 - DinoConciatore
Copyright © 2008-2024 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki?
Send feedback