Archives


- Beowulf
- Beowulf Announce
- Scyld-users
- Beowulf on Debian

[Beowulf] New member, upgrading our existing Beowulf cluster

Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.

Search

Gerald Creager gerry.creager at tamu.edu
Thu Dec 3 13:01:15 PST 2009


Prentice Bisbal wrote:
> Greg Lindahl wrote:
>> On Thu, Dec 03, 2009 at 10:40:12AM -0500, Prentice Bisbal wrote:
>>
>>> if a single node goes down, you need to take down all the
>>> nodes in the chassis before you can remove the dead node. Not very
>>> practical.
>> Eh? What's so hard about marking the other nodes as unusable in your
>> batch system, and waiting for them to become free?
>>
>> -- greg
>>
> 
> I didn't say it was hard - just impractical. ;)
> 
> I thought the same thing when HP told me the nodes weren't
> hot-swappable. But then when I learned the SuperMicros were hot
> swappable, I figured if SuperMicro can do it, why not HP?
> 
> I'm sure you'll agree that taking just one node down instead of 4 is
> more convenient, and is less likely to draw the ire of your
> number-crunchers.


Because of our ill-advised choice of a specific APC rack, whenever I've 
got to remove one of the supermicro's from the 2uTwin chassis, I have to 
power 'em all off.  I then have to ease the chassis out so I've enough 
room to get a node out.  I learned to "just do it" after a 2 month whine-in.

gerry



More information about the Beowulf mailing list