[Beowulf] RE: A couple of interesting comments
Greg Keller
Greg at keller.net
Wed Sep 24 07:37:12 PDT 2008
Gerry,
As a former installer/patsy at one of those nameless clumsy hardware
vendors, I thought this *may* be useful for you:
> 1. We specified "No OS" in the purchase so that we could install
> CentOS
> as our base. We got a set of systems with a stub OS, and an EULA for
> the diagnostics embedded on the disk. After clicking thru the EULA,
> it
> tells us we have no OS on the disk, but does not fail to PXE.
Vendors get into all kinds of legal issues when they don't sell M$
licenses they are supposed to have EULA's for cover. They used to
clobber/corrupt partion tables etc. when fdisk'd ruthelessly but are
largely benign now if you handle issue #2 gracefully and get the
standard install/fdisk launched. The vendor I worked for had a EULA
free sku but only if you purchased RHEL WS licenses. Blank disks
Always had EULAs.
>
> 2. BIOS had a couple of interesting defaults, including warn on
> keyboard error (Keyboard? Not intentionally. This is a compute node,
> and should never require a keyboard. Ever.) We also find the BIOS is
> set to boot from hard disk THEN PXE. But due to item 1, above, we
> never
> can fail over to PXE unless we load up a keyboard and monitor, and hit
> F12 to drop to PXE.
A fast trick I've used:
a. Keyboard errors are usually just complaint messages on a console
you'll never see. I haven't seen them halt a system during boot in
years so hopefully this is cosmetic.
b. to get around keyboarding through the BIOS config, a system with
hot swap drives can be simple:
Eject the drives
Boot the system (with ethernet cabled to a powered on switch)
Wait long enough for the system to post and decide there's no hard
drive, which bumps the PXE up to first boot
Power down, reinstall the drives - Now PXE is the primary boot device
This sounds more painful than it is. I've deployed a few 500+ and a
couple 1200+ node clusters and life was much faster and simpler this
way.
Note: With many systems you can switch back to Disk as primary boot
device by powering off the ethernet switch and going through the 2
boot dance so the local drive becomes the primary boot device again.
We normally setup PXE to tell the node to boot local on the occasions
we need a stateful install. YMMV
Once the nodes are booted you can script the "Config" utility from the
vendor to tune the bios/bmc etc to your preferred favorite settings.
I use Perceus so the local drives are always scratch+swap space and we
partition them and format on the way up if they need it.
Hope this helps,
Cheers!
Greg
More information about the Beowulf
mailing list