May-20

EOS
- addition of /eos subprojects (/eos/cms, etc) on login nodes
- update of eos-client 4.5.9 -> 4.7.7 (and eos-xrootd 4.11.2 -> 4.11.3) on UIs
- decommissioning of EOS test partition Slurm (no single usage since ~2.5 months) and return idle CPU to other queues

Grid host cert
- QuoVadis subscription re-registered to cms-tier3-alerts@lists.psi.ch
- instruction documented on t3wiki https://wiki.chipp.ch/twiki/bin/view/CmsTier3/GridHostCert

April-20

Slurm
- memory is configured as consumable resource (default DefMemPerCPU is 2GB/CPU) to prevent out of memory situations caused by users jobs
- added to client nodes LNAG enviromental variables to /etc/locale.conf to shut out LC_CTYPE/UTF-8 errors of ssh-sessions

dCache Upgrade Follow-ups:
- add CMS TFC config to xrootd door on SE node (https://www.dcache.org/downloads/xrootd4j/index.shtml)
- implementation of Postgres Backup script to copy DB to t3nfs02:/zfs/data01/swshare/postgres
- dcache after upgrade became too verbose and filled out /var/log partition; to fix the problem dcache restart was done on Sun Mar 15 (without user activity)
Storage Cleaning due to almost no free space on dcache:
- deletion of leftover user data took several days. Too many (hundred thousands) files in single directories: dcache can't handle it
- overal clenup brought ~ 30% free space; next step is needed - check and clean ~150TB of mc, data dirs
Slurm:
- add QoS (500 cpu/user) to quick partition
- EOS test configuration (enabled on Worker Nodes and UIs): since February no user feedback
Monitoring:
- manually added non-standard /work server t3nfs02 to ganglia
- solved the problem with SELinux (the reason of http access error) on ganglia server; works stably
- dcache space monitoring added to t3wiki: https://wiki.chipp.ch/twiki/bin/view/CmsTier3/StoragePlots
all configuration changes saved on hiera/puppet/gitlab
most of this list was done remotely from home with no drop in efficiency in compare to work from PSI office

Topic revision: r4 - 2020-05-28 - NinaLoktionova

User Pages
Main Page
Policies

Admin Pages
AdminArea
Cluster Specs