<!-- keep this as a security measure:
* Set ALLOWTOPICCHANGE =
TWikiAdminGroup,Main.LCGAdminGroup,Main.EgiGroup
* Set ALLOWTOPICRENAME =
TWikiAdminGroup,Main.LCGAdminGroup
#uncomment this if you want the page only be viewable by the internal people
#* Set ALLOWTOPICVIEW =
TWikiAdminGroup,Main.LCGAdminGroup,Main.ChippComputingBoardGroup
-->
Swiss Grid Operations Meeting on 2018-12-06 at 14:00
Site status
CSCS
System
- Started planning services transition to Piz Daint. Question about the new ARC CE version to Gianfranco
- Issues with some nodes and slurm subnet, reconfiguration needed.
- Grafana dashboard is ready to be exposed (data sync in progress)
dCache
- Process to IPv6 / dual-stack completed. Required quite some effort
- We need to add dual-stack to CMS02 too
- Updated to 3.2.40
- Planning next year data migration (due to a complete storage renewal)
- Planning head nodes (re)installation
Scratch
- Stable operations
- Much improved IO between (Daint) DVS nodes and storage nodes since the last DVS upgrade
- Outperforming SSD cache nominal performance (see attachment "ssd-cache-perf.png)"
- We should have a further improvement with the new software
PSI
UNIBE-LHEP
-
- Stable operation, slightly lower delivery for LHEP (dying nodes). Pledged: 18k, delivered 20.6k
- Ubelix back on 8th November
- Running an average >2100 slots (<1900 last month, 2500 typical), Ubelix back to 23% (typical)
-
Accounting numbers (from scheduler) from last month (October), LHEP only
6-month history Unibe (pledge: 18 kHS06)

- Swiss ATLAS statistics
- HC availability
- could not retrieve data
- Runnins slots
- CSCS: 3000 (3300 October) ; UniBe: 2150 (1850 October)
- Accounting Numbers from ATLAs dashboard (November) CSCS+UniBe




- ARC6 upgrade heads-up
- At some point in 2019
- Mayor arc.conf rewrite
- At the recent NorduGrid developer retreat, I have produced a preliminary conversion to ARC6 of the arc.conf for arc04@lcg.cscs.ch
UNIBE-ID
Stable operations in November after the stuck a-rex issue in October
During the mid December's maitenance down:Deommissioning of nodes that comprise the el6legacy partition
Setup of a subordinate partition for preemptable jobs of the ATLAS experiment at the same tabme
No sysadmin from UNIBE-ID can join this afternoon
UNIGE
Discussing this week how to revive the ARC CE
@UniGe
NGI_CH
NGI-CH Open Tickets review
Ticket-ID |
Type |
VO |
Site |
Priority |
Resp. Unit |
Status |
Last Update |
Subject |
Scope |
138592 |
|
cms |
CSCS-LCG2 |
urgent |
NGI_CH |
waiting for reply |
2018-12-06 |
Transfers failing from T2_CH_CSCS to ... |
WLCG |
138296 |
|
cms |
CSCS-LCG2 |
urgent |
NGI_CH |
waiting for reply |
2018-12-05 |
Transfers failing from T2_CH_CSCS |
WLCG |
133695 |
|
lhcb |
CSCS-LCG2 |
urgent |
NGI_CH assigned |
waiting for reply |
2018-11-30 |
Data access problem at CSCS-LCG2 |
WLCG |
131965 |
|
none |
UNIBE-LHEP |
less urgent |
NGI_CH assigned |
on hold |
2018-11-15 |
IPv6 deployment at WLCG Tier-2 sites |
EGI |
131948 |
|
none |
CSCS-LCG2 |
less urgent |
NGI_CH assigned |
in progress |
2018-12-03 |
IPv6 deployment at WLCG Tier-2 sites |
EGI |
Other topics
Update on experiment share re-balance
- Discussed within CHIPP
- Internal meeting next week to finalise decision
- Current direction (not final):
- Reduce max WC for all VOs at the same level
- Pack single core jobs to nodes (as opposed to spread them)
- Trial period of 1-2 months
Attachment below: ATLAS pending jobs (last 90 days)
Topic2
Next meeting date:
A.O.B.
Attendants
CSCS:
CMS:
ATLAS:
LHCb:
EGI:
Action items
Item1
ssd-cache-perf.png: