Tags: view all tags

Swiss Grid Operations Meeting on 2015-12-10

Date and time: 14:00
Place: Vidyo (room: Swiss_Grid_Operations_Meeting, extension: 109305236)
External link: http://vidyoportal.cern.ch/flex.html?roomdirect.html&key=gDf6l4RlIAGN
Phone gate: From Switzerland: 0227671400 (portal) + 109305236 (extension) + # (pound sign)
IRC chat: irc:gridchat.cscs.ch:994#lcg (ask pw via email)

Swiss Grid Operations Meeting on 2015-12-10
- Site status
  - CSCS
  - PSI
  - UNIBE-LHEP
  - UNIBE-ID
  - UNIGE
  - NGI_CH
- Other topics
- A.O.B.
- Attendants
- Action items

Site status

CSCS

Xxx

PSI

Xxx

UNIBE-LHEP

Operations
- ce01 cluster re-installation virtually completed (about 900 worker cores running, 120 still to be installed, 256 awaiting delivery)
- Started with a simple slurm setup (slurm-15.08.1) in order to cut down on commissioning time: one partition with
```
SelectType=select/cons_res
SelectTypeParameters=CR_CPU_Memory
MemLimitEnforce=no
```
- We don't over-subscribe memory anymore: nodes don't starve and crash
- Memory usage is properly accounted for in 15.08 (PSS): no jobs killed on (artificial) over-limit of "vmem" (now the full address space reserved by a process, no what's allocated or used)
- Comparing job fail rates between ce01 and ce02 (still on old SGE) has convinced me to rush the re-installation of ce02 (started earlier today)
ATLAS specific operations
- Stable worflows by ATLAS (very large improvement since beginning of run II)
- Stuck with the implementation of monthly dumps of the namespace on the DPM SE:
  - headnode on SLC5: the dump script does not work and also generating a valid proxy is problematic
  - decided to push the re-deployment of the head node on SLC6
  - legacy config tool (YAIM) no longer supported
  - puppet based configuration, got the right docs at the DPM workshop earlier this week in CERN
  - tests ongoing on a pps VM
  - also complicated by the fact my site-bdii is still co-located with the DPM head node
  - this will likely be the first task for 2016

UNIBE-ID

Xxx

UNIGE

Xxx

NGI_CH

Xxx

Other topics

Proposal to add to this meeting: T2 monthly pledge review (CSCS, UNIBE); GGUS open ticket review
Topic2

Next meeting date:

A.O.B.

Attendants

CSCS:
CMS:
ATLAS:Gianfranco
LHCb:
EGI:Gianfranco

Action items

Item1

~~Edit~~ | ~~Attach~~ | ~~Watch~~ | Print version | History: r6 | r4 < r3 < r2 < r1 | Backlinks | Raw View | ~~Raw edit~~ | ~~More topic actions...~~

Topic revision: r2 - 2015-12-09 - GianfrancoSciacca

LCGTier2

Log In

(Topic)

Home
LCGTier2 Web
- Users
- Groups
- Index
- Search
- Changes
- Notifications
- RSS Feed
- Statistics
- Preferences
P
View
Edit

Warning: Can't find topic "".""

Account
- Log In

~~Edit~~
~~Attach~~

Copyright © 2008-2024 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback