[Beowulf] Re: Beowulf of bare motherboards
Glen Gardner
Glen.Gardner at verizon.net
Mon Sep 27 20:16:36 PDT 2004
I've gone on a bit of a rant here, but I've done this, so I thought I
might share some of what I have learned.
You can see some pictures of what I'm discussing below in the mini
cluster links on my web site at:
http://members.bellatlantic.net/~vze24qhw/index.html
I have built a cluster on standoffs, and it has worked quite well. While
I am a relative newbie to Beowulf, I'm not new to computing and
electronics. I have roughly 40 years experience in electronics.
Hand fabricating enclosures is a lot of work. It also means you hand
fabricate a lot (or all) of the witing and power distribution circuitry.
I can tell you from experience that you will get very tired of drilling
hundreds of holes, and cutting hundreds of metal components by hand.
This needs to be a labor of love, because it is a ton of work.
You can build a cluster in this manner, but you have several problems to
consider. I've listed just a few of the "biggies" for your consideration.
1) thermal issues
Mounting power hungry boards close to each other can lead to thermal
issues due to heat radiating and/or convecting from one board to the next.
If boards are very close to each other, you are likely to need to add
additional fans to move air abundantly around the boards and cpu's to
keep the boards from cooking each other. One way to improve this is to
mount each board on a metal plate and stack the plates on metal
standoffs. Having metal plates between the boards will provide
significant thermal shielding and alleviate the tendency of the boards
to "cook" each other due to radiation and convection of heat. You will
still need to provide adequate air circulation (this means fans, no
matter what you do)
Also, motherboards have internal copper plane layers for grounds and
power which act as rf shielding and to help spread heat more evenly
across the board. You will notice that the mounting holes for most
motherboards have grounded pads around them , and there are sometimes
many tiny metal plated holes around the mounting hole pad. These are
"vias" which are connected to the intenral plane layers thermally ( and
to ground electrically). IF these are mechanically secured to the
chassis with a metal standoff, you not only benefit from an electrical
ground to the chassis (allowing the chassis to act as a faraday shield)
, you get a thermal path to the chassis to help dump some of the excess
heat from the board. This is no accident, this is a design feature of
the board, and a common trick used by engineers to provide for dumping
heat from the board itself. For this reason, and for mechanical reasons,
I'd suggest using ALL of the provided mounting holes. Believe it or not,
it costs money to add a hole to a board. (Yes a hole is a design
feature), so you can bet the engineer had a good reason for putting
"extra" holes in a board beyond those required for a standard
motherboard footprint.
2) RFI issues
The biggest RFI issue is interference to radio and TV reception. Believe
it or not there are FCC regulations governing unwanted radio emissions,
and if your machine is
making a real racket on radio frequencies, you could concievably have a
lot of headaches trying to fix the problem after the FCC shows up and
asks you to shut down.
RFI from board to board is something I have yet to encounter in moderm
pc's. I have encountered it many times in radio circuitry. The solution
is always the same.
Use shielding, decouple the power supply properly, and make sure all the
hardware and shielding is in place, and use a tight fitting metal
enclosure.
For us , that means mounting each motherboard on a metal plate, and
making sure that each board is grounded to that plate. In turn , the
plates are securely connected to each other and the chassis. For the
enclosure, it should be metal, tight fitting and not having large holes.
Power wiring should be dressed carefully, and long power leads should be
twisted and may require common mode chokes, etc.
Board to board interference
I can imagine a situation where motherboards having a high power demand
, in close proximity to each other might cause interference to each
other. But I think that mounting each board on a single metal plate
securely , with conductive standoffs for good grounding, would be more
than adequate to prevent motherboards from talking to each other in
virtually every situation. I'm presently stacking 3 VIA ( @ 15 W each)
motherboards on one metal plate, and I have 4 stacks of 3 motherboards.
They do not interfere with each other. A higher power cpu might cause
interference in my setup, but mounting each board on its own metal
"shielding" plate in the stack would alliveiate this situation.
3) power distribution issues (this part is only relevant if you are
using as odd a power supply setup as I am)
This is a problem I am presently wrestling with at the nusiance level. I
know the cure and am in the process of implimenting part of it now.
My computational nodes are in two racks of 6 nodes , each 6 node rach
has two stacks of 3 boards, with a metal plate on top , and a metal
plate on the bottom.
On the mettal plate is a DC-DC converter board which converts 12VDC to
the required ATX voltages used by the mini itx boards. All of the DC-DC
converters are fed from a single 12VDC regulated power supply at a power
distribution panel built into the top of each 6 node rack.
The issue I have encountered has caused two problems; First, when
powering up "cold", I occasionally have a node fail to boot correctly.
Also, I've observed that once in a while, at turnon, the NFS server hard
drive head will seek home , hard with an audible click, suggesting that
it is "unhappy" with something.
It turns out that with many nodes drawing current at turnon, across a
long power distribution, the nodes (and hard drive) are attemting to
start before the 12VDC power has stablilized. After almost a year of
operation, the mini cluster has devoured two full size ide hard drives
used for it's NFS mount. I also have 12 IBM microdrives in the nodes,
and have only lost one (after it was abused thermally during some
difficult testing). Interestingly enough the high failure rate IDE
drives are on the 12V supply, and the lower failure rate microdrives are
on a conditined 5V supply. It turns out that the 12VDC regulated supply
is not coming up fast enough at turnon, and as a result the hardware is
tending to glitch occasionally due to a "noisy" power supply.
My solution for this is going to involve the following: Replace the IDE
hard drive with a laptop drive that will operate on the much cleaner 5V
supply , and to save power and reduce inrush currents at turnon. Also
I'd like to sequence the turnon of the nodes such that the inrush
currents at turnon are lower, allowing the 12V supply to come up to a
stable 12VDC a bit faster.
At this point in time , my biggest concern is the occasional loss of the
ide drives, so I am focusing on the hard drive issue, but I have in mind
building a power control sequencer to bring up the nodes in a safe and
sane manner. (more on this later, if I actually do it).
4) mechanical integrity
Your design needs to be mechanically sound. Think things through and do
a good job. If you have never designed anything this large before you
might be wise to allow a lot of extra time and some extra materials so
you can get through the learning curve a bit easier (plan on mistakes).
The mini cluster racks were designed to be cheap and effective.
(Emphasis on cheap) A happy consequence of the design is that it is
compact, lightweight and strong. If I had it to do again , I'd have
built in a little more servicability at a slightly higher cost, because
the replacement of a node requires dismantling the racks completely.
Good luck;
Glen
Jack Wathey wrote:
>
>
> On Mon, 27 Sep 2004, Andrew Piskorski wrote:
>
>>> http://www.abo.fi/~physcomp/cluster/celeron.html
>>
>>
>> I recently bought a bunch of old ECS P4VXMS motherboards cheap on
>> Ebay, in order to build my own personal home cluster for testing and
>> experimentation. I plan to mount them on metal shelves somewhat like
>> the Celeron cluster shown above. However, that raises the following
>> questions:
>>
>> Normally, motherboards are mounted to the metal case using metal
>> standoffs, which touch the motherboard ONLY at the designed mounting
>> holes. Is this in fact necessary? Desirable?
>>
>> Is there any reason the standoffs need to be conductive?
>>
>> Would it be ok to instead simply screw the motherboard down to a flat
>> piece of wood or plastic? Or to wood or plastic strips which touch
>> all along much of the bottom of the motherboard?
>>
>
> Andrew,
>
> About a year ago I built a cluster much as you describe (bare
> motherboards on steel shelves), but with the motherboards mounted to
> aluminum sheets instead of wood or plastic. I used nylon standoffs
> (Mouser electronics part #561-A0250; www.mouser.com), and everything
> worked fine. The motherboards are properly grounded by virtue of the
> ground wires in the connector to the power supply, as are the aluminum
> sheets, but the standoffs are nonconducting. The standoffs just snap
> into the aluminum sheets (which are 1/16 inch thick) and snap into the
> motherboard holes. There are no threads and no nuts involved.
>
> Ken Chase (math at velocet.ca) has posted to this list, a year or 2 ago,
> about his experiments with packing motherboards close together in a
> plexiglas enclosure with no shielding between them. I believe he
> concluded that this was a bad idea, because radio-frequency interference
> between the boards caused some of them to fail to boot. Also large
> quantities of wood or plastic near a motherboard is a bad idea because
> of the risk of fire.
>
> Unfortunately I don't have a website describing my cluster, but, if
> you're interested, I could send you more details. One final word of
> warning: building a cluster like this (bare motherboards in a
> customized enclosure) is LOT more work than you probably expect. It
> was for me, anyway. I didn't have much choice, because of space
> constraints, but you might give
> serious thought to the more conventional approach.
>
> Hope this helps,
>
> Jack
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf
>
--
Glen E. Gardner, Jr.
AA8C
AMSAT MEMBER 10593
http://members.bellatlantic.net/~vze24qhw/index.html
More information about the Beowulf
mailing list