Tags:
create new tag
view all tags
<!-- keep this as a security measure:
* Set ALLOWTOPICCHANGE = TWikiAdminGroup,Main.LCGAdminGroup,Main.EgiGroup
* Set ALLOWTOPICRENAME = TWikiAdminGroup,Main.LCGAdminGroup
#uncomment this if you want the page only be viewable by the internal people
#* Set ALLOWTOPICVIEW = TWikiAdminGroup,Main.LCGAdminGroup,Main.ChippComputingBoardGroup
-->

Swiss Grid Operations Meeting on 2019-05-02 at 14:00

Site status

CSCS

Systems

  • Phoenix complete shutdown
  • Un-Racked all WN and Services nodes and cleaned from cables.

Storage

  • dCache
  1. - No special reports for the service
  2. - New (SE) servers installed, starting full data migration soon
  3. - SE servers will have GPFS as backend: this will give more performance and flexibility to the service and is the base for future steps (containers)
  4. - Planning the future of storage01-02: as first step they will be virtualized and then will be consolidated in one (v)server with external chimera (postgres) database

  • GPFS (scratch)
  1. - Servers moved to a new location, on a new network with no impact to the filesystem
  2. - Storage (SSD) moved to the new location
  3. - HDD Tier/pool moved from DDN SF12k to DELL SC9000
  4. - Will double the FC connections squeeze more performance (sometimes I see FC connections reaching the limit)

PSI

UNIBE-LHEP

  • Stable running despite lack of maintenance
  • Started decommissioning nodes, but re-deployment delayed, since Phoenix hardware is still at CSCS
    • this will very likely affect our ability to meet the deadline for CentOs 7 migration (1st June)
  • Monthly summary: Pledged: 42k, delivered 22k
  • Ubelix contributing ~45% (23% typical)
  • Running an average >2000 slots (2500 typical during the previous pledge period)
WC UNIBE-LHEP: WC_UNIBE-LHEP.png
5month_UNIBE-LHEP: 5month_UNIBE-LHEP.png


  • Accounting numbers (from scheduler) from last month
    • Omitted this month

Swiss ATLAS statistics

  • Hammercloud availability:

    ATLAS_HammerCloud.png
    • ANALY_CSCS-HPC: 80%
    • CSCS-LCG2-HPC_MCORE: 99%
    • ANALY_UNIBE*: 100%
    • UNIBE-* : 99%

  • Running slots
    ATLAS_slots.png

    • CSCS: 3.4k
    • UNIBE: 2.1k (38%)

  • CPU consumption
    ATLAS_CPU.png


  • Accounting Numbers from the ATLAS dashboard (April 2019) CSCS+UNIBE

Cluster Job Type Produced WC core-hours Good vs Bad WC % CPU eff good jobs %
CSCS Any 2'133'333; 59% 0.68 0.85
UniBe Any 1'502'777; 41% 0.75 0.80


  • Delivered vs pledged
    CSCS-LCG2: pledged 50k, delivered 39.4k
    UNIBE-LHEP: pledged 42k, delivered 22k

    ATLAS_delivered_vs_pledge.png

  • Update on VO shares at CSCS
    • We have discussed this at length ion the past. In that occasion the issue seems to afflict mostly Phoenix and much less or not at all Piz Daint.
    • Shares on Daint have been ok also after the decommissioning of Phoenix (40:40:20)
    • Following the recent outages however, the shares are largely wrong, much worse problem that before
    • RT ticket, Miguel following up

  • CSCS-slots.png


  • Further ATLAS CSCS topics
    • CentOS 7 migration
      • deadline 1st June, awaiting clarification from ATLAS on what to do
    • ARC monitoring
      • Dino will join the NorduGrid conference in June to discuss his ideas
      • There is alreday a feature request open since many weeks

UNIBE-ID

  • Smooth operation of ARC-CE services for UNIBE-LHEP-UBELIX * currently lots of failed jobs due to memory excess
  • Soon changing storage current storage in a downtime (one of the next two)
    • current: 575TB GPFS based, new: 3.9PB still GPFS based

UNIGE

  • ARC CE provisioning further delayed

NGI_CH

  • Status of CA / certificates

  • NGI-CH Open Tickets review
    4 Tickets found
    Ticket-ID Type VO Site Priority Resp. Unit Status Last Update Subject Scope
    140546 lhcb CSCS-LCG2 very urgent NGI_CH involved in progress 2019-04-23 Data transfers problem from CSCS-LCG2 WLCG
    139574 dteam CSCS-LCG2 less urgent NGI_CH in progress 2019-04-29 please configure mesh on ... EGI
    131965 none UNIBE-LHEP less urgent NGI_CH assigned on hold 2019-02-19 IPv6 deployment at WLCG Tier-2 sites EGI
    131432 none CSCS-LCG2 urgent NGI_CH assigned involved in progress 2019-04-18 Storage accounting deployment EGI

Other topics

  • Topic1
  • Topic2

    Next meeting date:

A.O.B.

Attendants

  • CSCS:
  • CMS:
  • ATLAS: Gianfranco
  • LHCb:
  • EGI: Gianfranco

Action items

  • Item1 *
Topic attachments
I Attachment HistorySorted ascending Action Size Date Who Comment
PNGpng 5month_UNIBE-LHEP.png r1 manage 250.9 K 2019-05-02 - 08:17 GianfrancoSciacca 5month_UNIBE-LHEP
PNGpng ATLAS_CPU.png r1 manage 216.3 K 2019-05-02 - 08:44 GianfrancoSciacca ATLAS_CPU
PNGpng ATLAS_HammerCloud.png r1 manage 274.7 K 2019-05-02 - 08:26 GianfrancoSciacca ATLAS_HammerCloud
PNGpng ATLAS_delivered_vs_pledge.png r1 manage 141.7 K 2019-05-02 - 09:33 GianfrancoSciacca  
PNGpng ATLAS_slots.png r1 manage 194.7 K 2019-05-02 - 08:33 GianfrancoSciacca ATLAS_slots
PNGpng CSCS-slots.png r1 manage 186.3 K 2019-05-02 - 09:38 GianfrancoSciacca CSCS-slots
PNGpng WC_UNIBE-LHEP.png r1 manage 78.9 K 2019-05-02 - 08:08 GianfrancoSciacca WC UNIBE-LHEP
Edit | Attach | Watch | Print version | History: r11 < r10 < r9 < r8 < r7 | Backlinks | Raw View | Raw edit | More topic actions
Topic revision: r11 - 2019-05-02 - DinoConciatore
 
This site is powered by the TWiki collaboration platform Powered by Perl This site is powered by the TWiki collaboration platformCopyright © 2008-2024 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback