[Beowulf] Remote console management
Joe Landman
landman at scalableinformatics.com
Thu Sep 22 20:51:19 PDT 2005
Hi Bruce:
We (my company) have set something like this up for one of our
customers with the cluster they bought, where the admins can be (in
theory) 1/2 world away and completely control the machines.
It is quite possible to do this, but you have to be ready for the
sticker shock on the KVM over IP. Lets address the remote power first.
There are lots of power strips out there, many folks use APC units of
one form or the other. We use NBB-1600 from WTI
(http://www.wti.com/nbb16.htm) to handle power control. They work,
quite well at that. The only issue is that you will need a few of them
for full racks, and you would need to set the bios to power on on
restoration of power. If you have an IPMI capable machine (Sun v20z),
you plug in to the service processor via a managment network (make it
separate from your main network, but routeable and visible from the
WAN), and you can power cycle the machine. You get some nice diagnostic
bits in there for free as well.
Now onto to the remote consoles. I am sure I am going to regret
saying this, but ... I have seen many people purchase cyclades, and I
cannot say I have ever seen them being used. One customer asked me what
it was for, as their last cluster vendor required it as part of their
package, and they (the customer) never used it. In part because it was
never attached. IPMI gives you the ability to do a console over IPMI
(text only). I am not sure how well this works in general. I prefer
KVM over IP. Yes, it is expensive. Rariton KX232's are about
6000-8000$US/unit these days, and the CIMs that you attach to the
machines are about $100/unit. 128 CIMs will run you 12k$. And will
require 4 of these Raritan units. The Raritans also have an IE-only
(grrr!!!) interface. Some others are saner and use either a java
applet, or some sort of portable console. I have heard a rumor that
someone uses vnc, but have not looked hard enough to find out if this is
true.
Combining these two technologies will get you that level of control.
That said, some of the higher end vendors management processors
integrate all/most of these things. It would be overkill to get one of
those units with the aforementioned power/kvm technologies. Raritan
also integrates power control if you prefer single vendors.
Joe
Bruce Allen wrote:
> We're getting ready to put together our next large Linux compute
> cluster. This time around, we'd like to be able to interact with the
> machines remotely. By this I mean that if a machine is locked up, we'd
> like to be able to see what's on the console, power cycle it, mess with
> BIOS settings, and so on, WITHOUT having to drive to work, go into the
> cluster room, etc.
>
> One possible solution is to buy nodes that have IPMI cards. These
> piggyback on the ethernet LAN and let you interact with the machine even
> in the absence of an OS. With the appropriate tools running on a remote
> machine, you can interact with the nodes even if they have no OS on them
> or are hung.
>
> Another solution is to use the DB9 serial ports of the nodes. You have
> an 'administrative' box containing lots of high-port-count serial cards
> (eg, Cyclades 32 or 64 port cards) and then run a serial cable from each
> node to this box. By remotely logging into this admin box you can
> access the serial ports of the machines, and if the BIOS has the right
> settings/support, this lets you have keyboard/console access.
>
> Or one can do both IPMI + remote serial port access.
>
> Could people on this list please report their experiences with these or
> other approaches? In particular, does someone have a simple and
> inexpensive solution (say < $100/node) which lets them remotely:
> - power cycle a machine
> - examine/set BIOS values
> - look at console output even for a dead/locked/unresponsive box
> - ???
>
> Thanks!
>
> Bruce Allen
> U. of Wisconsin - Milwaukee Physics Department
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf
--
Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics LLC,
email: landman at scalableinformatics.com
web : http://www.scalableinformatics.com
phone: +1 734 786 8423
fax : +1 734 786 8452
cell : +1 734 612 4615
More information about the Beowulf
mailing list