[Beowulf] Problems with Dell M620 and CPU power throttling

Bill Wichser bill at princeton.edu
Tue Sep 17 17:40:57 PDT 2013


One week ago, on Tuesday September 10, we had a system downtime for some 
work on our central filesystem. During that outage we decided to not 
only power of the nodes but to also power off the chassis across 10 
racks of these blades.

Every day we check the systems by running a "stress" benchmark and 
checking the CPU speeds with "turbostat" and every day we find that the 
cores are all exceeding the 2.6GHz E5-2670 rating, mostly settling in at 
3.0 GHz (with turbo mode enabled).  Every day I wait to find that things 
have again degraded but have been happily surprised.

If we get through a whole month then I would say that after all the 
firmware and iDrac and CMC updates that a chassis power cycle is the 
answer.  But tomorrow I will look again and hopefully be happily 
surprised for one more day.

Bill

On 9/17/2013 8:06 PM, Richard Hickey wrote
> You are not alone in seeing this. We discovered it via by some of our
> weather codes running slow. A co-worker started running single node Linpack
> runs and we saw individual nodes running slow. A reboot did not fix,
> however a power cycle did. We can see a 2 to 3 fold increase in
> performance.
>
>
>
> We found that you could either do a physical reseat of the blade, or a
> logical one through the cmc command line. Either way fixes the problem
> temporarily.
>
>
>
> It's good to see that someone else is seeing this. Well, maybe not good,
> but at least we're not the only ones fighting this.
>
>   
>
>
>
> Rich
>
>
>
>
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org sponsored by Penguin Computing
> To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
>
>



More information about the Beowulf mailing list