Swiss Grid Operations Meeting on 2015-12-10

Site status

CSCS

  • Xxx

PSI

  • Xxx

UNIBE-LHEP

  • Operations
    • ce01 cluster re-installation virtually completed (about 900 worker cores running, 120 still to be installed, 256 awaiting delivery)
    • Started with a simple slurm setup (slurm-15.08.1) in order to cut down on commissioning time: one partition with
      SelectType=select/cons_res
      SelectTypeParameters=CR_CPU_Memory
      MemLimitEnforce=no
    • We don't over-subscribe memory anymore: nodes don't starve and crash
    • Memory usage is properly accounted for in 15.08 (PSS): no jobs killed on (artificial) over-limit of "vmem" (now the full address space reserved by a process, no what's allocated or used)
    • Comparing job fail rates between ce01 and ce02 (still on old SGE) has convinced me to rush the re-installation of ce02 (started earlier today)
  • ATLAS specific operations
    • Stable worflows by ATLAS (very large improvement since beginning of run II)
    • Stuck with the implementation of monthly dumps of the namespace on the DPM SE:
      • headnode on SLC5: the dump script does not work and also generating a valid proxy is problematic
      • decided to push the re-deployment of the head node on SLC6
      • legacy config tool (YAIM) no longer supported
      • puppet based configuration, got the right docs at the DPM workshop earlier this week in CERN
      • tests ongoing on a pps VM
      • also complicated by the fact my site-bdii is still co-located with the DPM head node
      • this will likely be the first task for 2016

UNIBE-ID

  • Xxx

UNIGE

  • Xxx

NGI_CH

  • Xxx

Other topics

  • Proposal to add to this meeting: T2 monthly pledge review (CSCS, UNIBE); GGUS open ticket review
  • Topic2
Next meeting date:

A.O.B.

Attendants

  • CSCS:
  • CMS:
  • ATLAS:Gianfranco
  • LHCb:
  • EGI:Gianfranco

Action items

  • Item1
Edit | Attach | Watch | Print version | History: r6 | r4 < r3 < r2 < r1 | Backlinks | Raw View | Raw edit | More topic actions...
Topic revision: r2 - 2015-12-09 - GianfrancoSciacca
 
  • Edit
  • Attach
This site is powered by the TWiki collaboration platform Powered by Perl This site is powered by the TWiki collaboration platformCopyright © 2008-2024 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback