Node Type: Mon

Firewall requirements

local port open to reason
8649/udp 192.33.123.0/24 ganglia collector
8670/udp 192.33.123.0/24 ganglia collector
8671/udp 192.33.123.0/24 ganglia collector
80/tcp * ganglia web server


Regular Maintenance work

Emergency Measures

Installation

Services

Ganglia

Installation details can be found here.

TODO for DerekFeichtinger (priority 2)
gmetad fails to start after reboot

Starting up gmetad can sometimes fail with this log entry:

May 11 10:49:56 t3ce01 /usr/sbin/gmetad[5651]: Please make sure that /var/lib/ganglia/rrds is owned by nobody

I am not yet sure what causes /var/lib/ganglia/rrds to be owned by root after reboots.

Additional monitoring

Look on AdditionalMonitoring page

Backups

OS snapshots are nightly taken by PSI VMWare Team ( like Peter Huesser ) + we have LinuxBackupsByLegato to recover a single file.
NodeTypeForm
Hostnames t3mon01, DNS alias t3mon = t3mon01
Services ganglia collector, ganglia web front end
Hardware PSI DMZ VMWare cluster
Install Profile mon
Guarantee/maintenance until VM
Edit | Attach | Watch | Print version | History: r13 < r12 < r11 < r10 < r9 | Backlinks | Raw View | Raw edit | More topic actions...
Topic revision: r11 - 2014-01-21 - FabioMartinelli
 
  • Edit
  • Attach
This site is powered by the TWiki collaboration platform Powered by Perl This site is powered by the TWiki collaboration platformCopyright © 2008-2024 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback