[Beowulf] using Nagios to monitor compute nodes: NPRE vs check_by_ssh

Rahul Nabar rpnabar at gmail.com
Tue Dec 23 10:24:23 PST 2008


On Mon, Dec 22, 2008 at 10:23 PM, Alex Younts <ayounts at tinkergeek.com> wrote:
> At my employer, we use a variety of monitoring tools for our various
> clusters. Our nagios box is a VM with a single processor and 512MB of
> memory. Currently, we monitor 1700 hosts, each with three or four
> service checks a piece (two of which SSH to nodes to run scripts). We
> check services about every 30 minutes.

Thanks Alex! I will give that a shot now! Are there any torque / pbs /
maui monitoring Nagios scripts out there? I wanted to avoid
reinventing the wheel if at all possible!

-- 
Rahul



More information about the Beowulf mailing list