<!-- keep this as a security measure:
* Set ALLOWTOPICCHANGE =
TWikiAdminGroup,Main.LCGAdminGroup,Main.EgiGroup
* Set ALLOWTOPICRENAME =
TWikiAdminGroup,Main.LCGAdminGroup
#uncomment this if you want the page only be viewable by the internal people
#* Set ALLOWTOPICVIEW =
TWikiAdminGroup,Main.LCGAdminGroup,Main.ChippComputingBoardGroup
-->
Swiss Grid Operations Meeting on 2017-06-08 at 14:00
Site status
CSCS
HEPiX short report:
TIMETABLE:
https://indico.cern.ch/event/595396/timetable/#20170424.detailed
- CSCS Sitereport:
- A around 120 participants
- Many sites started using OpenStack
- CERN Migration from AFS (impacted various sites) Replacement options (CERN Box, BNL Box, IHEP Box, EOS, CVMFS, etc)
- Many sited reported consolidation and new facility
- Increasing adoption of CentOS /SL7
- Migration to Grafana for many sites
- Increasing adoption of IPv6 dual-stack at many sites
- Puppet migration from 3 to 4
- mostly smooth
- main issue Cron is now integrated in puppet (no more Cron moule is needed)
- Very nice presentation on elasticsearch at CERN
- Consolidation and unify CERN IT monitoring
- Old school logging at Blabit and IN2P3
Systems:
- During the maintenance updated and cleaned all VM
- Some swapping issues, working to track all jobs memory and cpu usage live
- Working on a new node monitoring dashboard (CPU,Memeory,Swap,etc.. usage) with Grafana and Metricbeat
Storage:
dCache
- - update from 2.13.50 to 2.13.58 issue
- - migrated ~1.2PB and decommissioned the IBM DCS3700, deployed new ~1.6PB from NetApp E5600 systems
- - working with CMS support for spacemon deploy
- - working on pp dcache cluster (future upgrade)
GPFS
- - new GPFS servers ready, will be deployed online
- - GPFS new layout in place: Server-Cluster / WN-Cluster / Daint-Cluster
PSI
UNIBE-LHEP
- New hardware, partially installed: 480 cores (E5-2630 v4 @ 2.20GHz) and 150 TB of storage
- HammerCloud status
100% UNIBE-LHEP, UNIBE-ID, UNIGE-DPNC, ~70% for CSCS*
http://dashb-atlas-ssb.cern.ch/dashboard/request.py/siteviewhistorywithstatistics?columnid=562&view=Shifter%20view#time=720&start_date=&end_date=&use_downtimes=false&merge_colors=false&sites=multiple&clouds=all&site=ANALY_CSCS,ANALY_CSCS-HPC,ANALY_UNIBE-LHEP,ANALY_UNIBE-LHEP-UBELIX,CSCS-LCG2,CSCS-LCG2-HPC,CSCS-LCG2-HPC_MCORE,CSCS-LCG2_MCORE,UNIBE-LHEP,UNIBE-LHEP-UBELIX,UNIBE-LHEP-UBELIX_MCORE,UNIBE-LHEP_CLOUD,UNIBE-LHEP_CLOUD_MCORE,UNIBE-LHEP_MCORE,UNIGE-DPNC,UNIGE-DPNC_MCORE
- Accounting numbers (from scheduler) from May 2017
VO | Job Type | Produced WC core-hours | | |
ATLAS | Any | 1034439 | | |
ops | Any | 14 | | |
t2k.org | Any | 0 | | |
uboone | Any | 6795 | |
|
UNIBE-ID
- Deploying singularity un Ubelix. Use case for ATLAS, since the cluster will transition to Centos7 by end of year and this OS is still a bti problematic for ATLAS. They will start soon testing using the homegrown image also used by UNIBE-LHEP for running on SwitchEngines.
UNIGE
- Hardware: Upgraded 6 machines (in May 2017)
- Old machines acquired in 2012, but upgraded in May 2017
- Added 6x32 (192) cores and 6x96 (576) GB of RAM memory: 1 User Interface and 5 bacth nodes
- It cooresponds to a ratio of 3 GB of RAM memory per core
- Operations: May 2017 (CPU effciency):
- Accounting numbers (from scheduler) from last month:
NGI_CH
- Xxx
- NGI-CH Open Tickets review
Other topics
Next meeting date:
A.O.B.
Attendants
- CSCS:
- CMS:
- ATLAS:
- LHCb:
- EGI:
Action items