CMS Tier-3 Upgrade Planning Page
phase B restructuring
Summary
dcache upgrade to 1.9.5-16. Test evacuation of old X4500 pools. Research slow t3fs05 transfer speeds
--++ Details
dCache upgrade to 1.9.5-16
test on t3fs05 filesystem to find bottleneck
Pool migration from t3fs05 to a new Thor
List of open tasks
- Virtual machine infrastructure
- install a semi-permanent vmware-server host (t3wn08 has 1 broken NIC port. Should probably free this machine for repairs)
- test running VMs over NFS with the images residing on ZFS on a thumper (t3fs06?)
- migrate all virtual machines to this new installation DOWNTIME
- File servers and dCache
- find solution to upgrade problem
- prepare new Thors for dcache
- make standard configuration procedure where puppet takes over most of the config. We cannot do a full puppet host install, since there is no coupling between the JumpStart and puppet
- setup raidz2 ZFS structure for the pools
- install dCache and bring the Thors online with writes disabled
- migrate the data to the new pools to free up servers
- Reinstall the old thumpers through Jumpstart, standard config + puppet, so that we have everywhere the same Solaris version and a raidz2 configuration
- Migrate dcache to Chimera
- Home directories
- implement daily snapshots of shome on t3fs06 (cron based script, delete older snapshots)
- implement incremental snapshot transfers to a backup server
- Services
- convert the VM t3ui02 to a real physical machine (let's take t3wn01)
- Setup a new VM for the VO-Box (mostly phedex... I think that frontier should stay on a phys host with local HD)
- Lesser priority
- LDAP direcetory service
- should we move that onto a VM? Is the admin host indeed a good place for this? backup and failback? This is a critical system
- make use of the new extension fields, so that the can be used in automated scripts
- Attach system to NAGIOS (maybe a project for a practicum student?)