[Beowulf] Re: RRDtools graphs of temp from IPMI
Ashley Pittman
apittman at concurrent-thinking.com
Tue Nov 11 06:53:37 PST 2008
On Tue, 2008-11-11 at 14:41 +0000, Dave Love wrote:
> Chris Samuel <csamuel at vpac.org> writes:
>
> > The reason it worries about high load is that we
> > used to see processes hang trying to read from the
> > IPMI device, but haven't seen that with more recent
> > kernels..
>
> How recent? We've seen similar trouble on Supermicros with a SuSE 10.3
> (2.6.22.17) kernel, hence doing it out-of-band, as I just posted.
> (Sorry I basically duplicated the in-band one of yours.) It involves
> the kipmi0 kernel thread going CPU-bound and sometimes getting a huge
> load average from failed ipmitool instances hanging around.
Even when it does work running "ipmptool sensor" in-band can often take
30 seconds to complete which isn't great for performance.
Ashley,
More information about the Beowulf
mailing list