Tags:
tag this topic
create new tag
view all tags
<span data-mce-mark="1"><!-- keep this as a security measure:<br />* Set ALLOWTOPICCHANGE = Main.TWikiAdminGroup,Main.LCGAdminGroup,Main.EgiGroup<br />* Set ALLOWTOPICRENAME = Main.TWikiAdminGroup,Main.LCGAdminGroup<br />#uncomment this if you want the page only be viewable by the internal people<br />#* Set ALLOWTOPICVIEW = Main.TWikiAdminGroup,Main.LCGAdminGroup,Main.ChippComputingBoardGroup<br />--></span> ---+ Swiss Grid Operations Meeting on 2018-11-08 at 14:00 * *Place*: Vidyo (room: Swiss_Grid_Operations_Meeting, extension: 10537598) * *External link*: <span data-mce-mark="1">https://vidyoportal.cern.ch/flex.html?roomdirect.html&key=FAEn4zjAba7BqoQ11TGZu66VSDE</span> * *Phone gate*: From Switzerland: 0227671400 (portal) + 10537598 (extension) + # (pound sign) * *IRC chat*: <span data-mce-mark="1">irc:gridchat.cscs.ch:994#lcg</span> (ask pw via email) * *Switch Vidyo SIP IP*: 137.138.248.204 <span data-mce-mark="1"> <span data-mce-mark="1">%TOC%</span></span> ---++ Site status ---+++ CSCS * ---+++ PSI * Storage: decommissioning of old SGI and !NetApp * Infrastructure: new network patches deployment * [[http://t3mon.psi.ch/PSIT3-custom/accounting.txt][Accounting numbers (from scheduler) ]] ---+++ UNIBE-LHEP * * A bit less stable (lack of manpower), lower delivery for a few months, still fulfilling the pledge. * Ubelixed dropped out silently on 10th October * Running an average <1900 slots (typical 2500), Ubelix contribution 12% (typical 23%) * Large t2k.org run in September, 1 cluster reserved for a local user for almost the entire month<br /><br /> * <b>Accounting numbers (from scheduler) from last month (October), LHEP only</b><br /><br /><span data-mce-mark="1"><span data-mce-mark="1">%EDITTABLE{}%</span></span> <table border="1" cellpadding="0" cellspacing="1"> <tbody> <tr><th>VO</th><th>Job Type</th><th>Produced WC core-hours</th> <td> </td> <td> </td> </tr> <tr> <td>ATLAS</td> <td>Any</td> <td> <p>1157991</p> </td> <td> </td> <td> </td> </tr> <tr> <td>ops</td> <td>Any</td> <td>44</td> <td> </td> <td> </td> </tr> <tr> <td>t2k.org</td> <td>Any</td> <td> <p>0</p> </td> <td> </td> <td> </td> </tr> <tr> <td>uboone</td> <td>Any</td> <td>0</td> <td> </td> <td><br /><br /></td> </tr> </tbody> </table> <br /><img alt="" height="283" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/consumptions_individual?sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&granularity=Daily&generic=0&sortBy=16&series=All&type=ewa" width="377" /><br /><br /> * <b>Five month history Unibe (pledge: 18 kHS06)</b><br /><img alt="" height="319" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/resourceutilization_individual?sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-06-01&end=2018-10-31&timeRange=daily&granularity=Monthly&generic=0&sortBy=16&diag1=0&diag2=0&diag3=0&diag4=0&diag5=0&diag6=0&diag7=0&diag8=0&diagT=0&diag8pl=0&series=All&type=wchs" width="425" /> * <b>Swiss ATLAS statistics<br /></b> * * *HC availability [1]:* * CSCS-LCG2: 95% Prod, 97% Analy * CSCS-LCG2-HPC: 75% Prod, 76% Analy * UNIBE-LHEP: 99% Prod, 96% Analy * UNIBE-LHEP-UBELIX: 100% ($), Prod, 27% Analy <p>($) effectively up ~30% only</p> <p> </p> * <b>CSCS running 3300 slots on average, UNIBE running 1850<br /></b> * *Accounting numbers (from dashboard) from last month for CSCS and UNIBE* <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <span data-mce-mark="1"> <span data-mce-mark="1">%EDITTABLE{}%</span></span> | *Cluster* | *Job Type* | *Produced WC core-hours* | *Good vs Bad WC %* | *CPU eff good jobs %* | | CSCS | Any | 2901550 (69%) | 0.71 | 0.89 | | Unibe | Any | 1266896 (31%) | 0.85 | 0.85 | <br /><img alt="" height="230" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/resourceutilization_individual?sites=CSCS-LCG2&sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&granularity=8 Hours&generic=0&sortBy=0&diag1=0&diag2=0&diag3=0&diag4=0&diag5=0&diag6=0&diag7=0&diag8=0&diagT=0&diag8pl=0&series=All&type=wchs" width="307" /><img alt="" height="225" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/resourceutilization_individual?sites=CSCS-LCG2&sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&granularity=8 Hours&generic=0&sortBy=0&diag1=0&diag2=0&diag3=0&diag4=0&diag5=0&diag6=0&diag7=0&diag8=0&diagT=0&diag8pl=0&series=All&type=a" width="301" /> <img alt="" height="233" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/consumptions_individual?sites=CSCS-LCG2&sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&granularity=8 Hours&generic=0&sortBy=0&series=All&type=ewa" width="310" /><img alt="" height="237" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/consumptions_individual?sites=CSCS-LCG2&sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&granularity=8 Hours&generic=0&sortBy=0&series=All&type=ewg" width="316" /><br /><br /><img alt="" height="186" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/terminatedjobsstatus_individual?sites=CSCS-LCG2&sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&sortBy=0&granularity=Daily&generic=0&series=All&type=ebwc" width="298" /><img alt="" height="205" src="http://dashb-atlas-job.cern.ch/dashboard/request.py/efficiency_individual?sites=CSCS-LCG2&sites=UNIBE-LHEP&sitesCat=All Countries&resourcetype=All&sitesSort=2&sitesCatSort=0&start=2018-10-01&end=2018-10-31&timeRange=daily&granularity=Daily&generic=0&sortBy=0&series=All&type=egl" width="273" /><br /><br /><br />[1] <span data-mce-mark="1">http://dashb-atlas-ssb.cern.ch/dashboard/request.py/siteviewhistorywithstatistics?columnid=562#time=custom&start_date=2018-10-01&end_date=2018-10-31&use_downtimes=false&merge_colors=false&sites=multiple&clouds=all&site=ANALY_CSCS,ANALY_CSCS-HPC,ANALY_UNIBE-LHEP,ANALY_UNIBE-LHEP-UBELIX,CSCS-LCG2-HPC_MCORE,CSCS-LCG2_MCORE,UNIBE-LHEP-UBELIX_MCORE,UNIBE-LHEP_MCORE</span> ---+++ UNIBE-ID * Enabled EGI ARGO notification e-mails in GOCDB to respond to CE stalling silently * Opportunistic usage on Ubelix to be added as soon as the sl6 legacy partition will be discontinued * slurm pre-emptable partition * ATLAS can use idle slots * ATLAS jobs killed (not checkpointed) when slots needed by other users ---+++ UNIGE * Re-commissioning of ARC CE delayed * Distrtibuted DPM storage working well <p> </p> ---+++ NGI_CH * Our deal with EGI for certificates expires in March 2019 * Science IT support Bern is looking into what the alternative will be<br /><br /> <p> </p> <p> </p> * NGI-CH Open Tickets review | *Ticket-ID* | *Type* | *VO* | *Site* | *Priority* | *Resp. Unit* | *Status* | *Last Update* | *Subject* | *Scope* | | <a href="https://ggus.eu/index.php?mode=ticket_info&ticket_id=138314" target="_blank">138314 </a> | <img alt="" border="0" height="10" src="https://ggus.eu/index.php?mode=download&img=team_ticket.gif" width="38" /> | atlas | CSCS-LCG2 | less urgent | NGI_CH | assigned | 2018-11-15 | DE CSCS-LCG2 : transfer failures with ... | WLCG | | <a href="https://ggus.eu/index.php?mode=ticket_info&ticket_id=138296" target="_blank">138296 </a> | | cms | CSCS-LCG2 | urgent | NGI_CH | assigned | 2018-11-14 | Transfers failing from T2_CH_CSCS | WLCG | | <a href="https://ggus.eu/index.php?mode=ticket_info&ticket_id=133695" target="_blank">133695 </a> | <img alt="" border="0" height="10" src="https://ggus.eu/index.php?mode=download&img=team_ticket.gif" width="38" /> | lhcb | CSCS-LCG2 | urgent | NGI_CH <img alt="" border="0" src="https://ggus.eu/index.php?mode=download&img=tri.gif" /> assigned | in progress | 2018-10-19 | Data access problem at CSCS-LCG2 | WLCG | | <a href="https://ggus.eu/index.php?mode=ticket_info&ticket_id=132927" target="_blank">132927 </a> | | cms | CSCS-LCG2 | urgent | NGI_CH <img alt="" border="0" src="https://ggus.eu/index.php?mode=download&img=tri.gif" /> assigned <img alt="" border="0" src="https://ggus.eu/index.php?mode=download&img=tri.gif" /> involved | in progress | 2018-11-12 | Problem with APEL Accounting for all of ... | EGI | | <a href="https://ggus.eu/index.php?mode=ticket_info&ticket_id=131965" target="_blank">131965 </a> | | none | UNIBE-LHEP | less urgent | NGI_CH <img alt="" border="0" src="https://ggus.eu/index.php?mode=download&img=tri.gif" /> assigned | on hold | 2018-10-04 | IPv6 deployment at WLCG Tier-2 sites | EGI | | <a href="https://ggus.eu/index.php?mode=ticket_info&ticket_id=131948" target="_blank">131948 </a> | | none | CSCS-LCG2 | less urgent | NGI_CH <img alt="" border="0" src="https://ggus.eu/index.php?mode=download&img=tri.gif" /> assigned | in progress | 2018-11-13 | IPv6 deployment at WLCG Tier-2 sites | EGI | ---++ Other topics * <b>Follow up to fair-share meeting<br /><br /></b> * Two questions, one for the slurm experts, one for the VO reps: * is slurm charging the _reserved_ time or the _elapsed*cores_ time to the user fair-share? * NICK: no, it is using (endtime-starttime)*cores<br /><br /> * possible mitigation: pack single core jobs on nodes, as opposed to distribute them across all nodes. How does this sound? * this should reduce the node fragmentation and give the MC jobs more opportunities to run timely * NICK: cannot comment at the moment, will look at it<br /><br /> * <span style="background-color: transparent;">Other possible mitigations to be discussed internally between VOs _need_ input from CSCS:<br /></span> * Distribution of job queue waiting time, last 2 Quarters, split by: Daint vs Phoenix, VO and 8-core vs 1-core (we should exclude from these plots the T0 jobs) * NICK: CSCS will investigate providing queue wait time reporting * Anything else? * NICK: Move forward with Stefano’s recommendation on Tuesday for a face-to-face meeting, preferably before the end of the year<br /><br /> * <span style="background-color: transparent;">Can we agree that the Daint and Phoenix shares (30 or 60 day historical view) will be monitored monthly at this meeting?<br /></span> * GIANFRANCO: not discussed<br /><br /> * <b>Topic2</b><br />... <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> Next meeting date: ---++ A.O.B. ---++ Attendants * CSCS: * CMS: * ATLAS: * LHCb: * EGI: <p> </p> ---++ Action items * Item1
E
dit
|
A
ttach
|
Watch
|
P
rint version
|
H
istory
: r8
<
r7
<
r6
<
r5
<
r4
|
B
acklinks
|
V
iew topic
|
Ra
w
edit
|
M
ore topic actions
Topic revision: r8 - 2018-11-16
-
GianfrancoSciacca
LCGTier2
Log In
(Topic)
LCGTier2 Web
Create New Topic
Index
Search
Changes
Notifications
Statistics
Preferences
Users
Entry point / Contact
RoadMap
ATLAS Pages
CMS Pages
CMS User Howto
CHIPP CB
Outreach
Technical
Cluster details
Services
Hardware and OS
Tools & Tips
Monitoring
Logs
Maintenances
Meetings
Tests
Issues
Blog
Home
Site map
CmsTier3 web
LCGTier2 web
PhaseC web
Main web
Sandbox web
TWiki web
LCGTier2 Web
Users
Groups
Index
Search
Changes
Notifications
RSS Feed
Statistics
Preferences
P
View
Raw View
Print version
Find backlinks
History
More topic actions
Edit
Raw edit
Attach file or image
Edit topic preference settings
Set new parent
More topic actions
Warning: Can't find topic "".""
Account
Log In
E
dit
A
ttach
Copyright © 2008-2024 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki?
Send feedback