[Beowulf] Re: RRDtools graphs of temp from IPMI

Chris Samuel csamuel at vpac.org
Tue Nov 11 12:08:21 PST 2008


----- "Dave Love" <d.love at liverpool.ac.uk> wrote:

> Chris Samuel <csamuel at vpac.org> writes:
> 
> > The reason it worries about high load is that we
> > used to see processes hang trying to read from the
> > IPMI device, but haven't seen that with more recent
> > kernels..
> 
> How recent?  We've seen similar trouble on Supermicros with a SuSE
> 10.3 (2.6.22.17) kernel, hence doing it out-of-band, as I just posted.

I think they seemed to go away somewhere around 2.6.27 I believe.

> (Sorry I basically duplicated the in-band one of yours.)  It involves
> the kipmi0 kernel thread going CPU-bound and sometimes getting a huge
> load average from failed ipmitool instances hanging around.

Sounds very much like what we were seeing on ours!

cheers,
Chris
-- 
Christopher Samuel - (03) 9925 4751 - Systems Manager
 The Victorian Partnership for Advanced Computing
 P.O. Box 201, Carlton South, VIC 3053, Australia
VPAC is a not-for-profit Registered Research Agency



More information about the Beowulf mailing list