Phenix updated installation and configuration status
Minutes of the phone call held the 13th February 2007
Participants:
Sergio Maffioletti
Alessandro Usai
Tom Guptil
Derek Feichtinger
Sigve Haug
Zhiling Chen
Status of installation and configuration
WN (X2200)
SLC308 [ok]
LCG/GLite [ok]
WN integrated into old LRMS [ok]
we could have all WNs integrated in short time (sharing /apps via NFS)
CE (X4200)
SLC4 [ok]
for the time being we agreed on having the old ce01-lcg used and Torque as LRMS
SGE integration will be tested and plan to have it in production as soon as it will be stable
Nordugrid will have to be checked and tested too
Problem encoutered, solutions and workarounds
SLC306 does not work (missing controller drivers)
SLC4 does work but installing LCG/Glite sw is error prone
Thumper installation [ok] but we need to test ZFS functionalities
Tom proposed to change the current RAID configuration to use only 1 parity disk; this would give additional 4TB at the expense of reliability [still to be decided]
SUN N1 is not suitable for cluster management therefore we will use cfengine
planning to have all cluster management services on 1 X2200 on Linux (possibly with Solaris on a Virtual Machine)
2 X4200 --> should become free
Tests (tentative dates)
Reliability tests on Thumpers (Tom + Alessandro) --> 12 - 16 February
Performance test from WNs to Thumpers via dcache --> 14 - 21 February
Test different configurations of ZSF and dcache --> 12 - 23 February
Organisation of the dCache tests
functionality tests
VO codes
local load tests (mainly dcap):
writing files in parallel from multiple nodes
reading same file from multiple nodes
trying to write file that is being written by another process
erasing file that is being read by another process