[Beowulf] kdump / kexec to optain crash dumps from randomly crashing nodes.

Rahul Nabar rpnabar at gmail.com
Thu Oct 9 11:57:39 PDT 2008


On my Centos system I installed kexec/kdump to investigate the cause of
some random system-crashes by getting access to a crash-dump. I installed
the rpm for kexec and then made the change to grub.conf that reserves the
additional memory for the new kernel.

Also configured kdump.conf. I start the kexec service.and then I tried to
simulate a crash by echo c to sysrq-trigger.

The system does crash and then after a while reboots itself. But I see no
vmcore when it coms back up. /var/crash is empty. This is when I tried to
write to local drive.

I also tried a nfs write but then still no success.

Any idea what could be missing in my steps? Or any other debug
suggestions? Any other kdump users on Beowulf?

-- 
Rahul



More information about the Beowulf mailing list