[Beowulf] [EXTERNAL] Re: Have machine, will compute: ESXi or bare metal?

Skylar Thompson skylar.thompson at gmail.com
Wed Feb 19 17:17:22 PST 2020


Thanks for the pointers, Jim! I had this fear that there wasn't something I
was understanding with zeroconf addressing so it's good to know that other
people have had similar ideas. :)

With BCCD, we have a huge variety of NICs that we've supported over the
years, and even have had people run clusters over wifi (sometimes even
intentionally) though obviously a shared medium is bad for performance and latency.

On Wed, Feb 12, 2020 at 09:13:30PM +0000, Lux, Jim (US 337K) wrote:
> I've used zeroconf quite effectively on a couple clusters of beagleboards.  If you do it, then you can let the nodes use DHCP to get their IP addresses, which is handy if you're sharing a WiFi network, for instance. 
> 
> On the other hand, for a "training experience", having to go through the process of manually assigning IP addresses, and node identifiers (hostname) and keeping them all straight is a useful thing.  Having done that, you really appreciate zeroconf/bonjour and DHCP.
> 
> My two configurations are:
> Head node is a Macbook running OS X
> Configuration 1: Macbook using Wireless to connect to the "internet" and wired to connect to the cluster, which is 4 BeagleBoard Green with wired ethernet.
> Configuration 2: Macbook using wireless to connect to the "internet" and wired to an 802.11b/g access point, then the cluster is 4 BeagleBoard Green Wireless.
> 
> In both cases, I wind up setting up a bridge from the cluster to the outside world via the macbook, so that I can run things like "apt get" on the nodes to install software.
> I use a combination of screen, ssh, and pdsh to work with the nodes, and scp (with pdsh) to move files around.
> 
> Note that with beagles (and Rpis) you usually use a "network over USB" to get it up originally (the gadget interface)
> 
> On 2/11/20, 7:57 PM, "Beowulf on behalf of Skylar Thompson" <beowulf-bounces at beowulf.org on behalf of skylar.thompson at gmail.com> wrote:
> 
>     On Tue, Feb 11, 2020 at 06:25:24AM +0800, Benson Muite wrote:
>     > 
>     > 
>     > On Tue, Feb 11, 2020, at 9:31 AM, Skylar Thompson wrote:
>     > > On Sun, Feb 09, 2020 at 10:46:05PM -0800, Chris Samuel wrote:
>     > > > On 9/2/20 10:36 pm, Benson Muite wrote:
>     > > > 
>     > > > > Take a look at the bootable cluster CD here:
>     > > > > http://www.littlefe.net/
>     > > > 
>     > > > From what I can see BCCD hasn't been updated for just over 5 years, and the
>     > > > last email on their developer list was Feb 2018, so it's likely a little out
>     > > > of date now.
>     > > > 
>     > > > http://bccd.net/downloads
>     > > > 
>     > > > http://bccd.net/pipermail/bccd-developers/
>     > > > 
>     > > > On the other hand their TRAC does list some ticket updates a few months ago,
>     > > > so perhaps there are things going on but Skylar needs more hands?
>     > > > 
>     > > > https://cluster.earlham.edu/trac/bccd-ng/report/1?sort=created&asc=0&page=1
>     > > 
>     > > Wow, I had no idea people on the Beowulf list were still thinking of BCCD.
>     > > :)
>     > > 
>     > > I've been working on a major BCCD update for a while now (modern Debian,
>     > > better node auto-detection) but a combination of life interference and a
>     > > shift in focus for the project to curriculum development has slowed me
>     > > down.
>     > > 
>     > > At the end of the day, BCCD has three has three main goals:
>     > > 
>     > > 1. Non-destructive in its default mode
>     > > 2. Simple ("just press enter")
>     > > 3. Ready with pedagogically-useful ("validated, verified, and accredited")
>     > > curriculum modules
>     > > 
>     > > One thing I'm hoping can come out of this major update is to decouple the BCCD
>     > > from the underlying distribution, since that's been a barrier for some
>     > > people in using BCCD. That's just an aspiration right now, but we'll see
>     > > where it goes.
>     > 
>     > Can you give more details on how you expect to decouple it?
>     
>     There's a couple things that are tightly integrated with the init process
>     now:
>     
>     * Network setup - prompts the user for information (which NIC to run on,
>       whether to use DHCP to assign addresses, etc.), detects other BCCD
>       systems on the network, etc.
>     
>     * SSH public key broadcast (pkbcast) - needs to run after a user logs in to
>       ensure that authorized_keys is setup on other participating systems.
>     
>     The network setup in particular is a challenge in the systemd world, since
>     getting STDIN from systemd-invoked processes is not trivial. We've also had
>     some users wanting better integration between networking and desktop
>     applications, which pushed me to try to make use of the existing
>     systemd/networkd toolchain rather than rolling our own tooling.
>     
>     Right now the challenge is the node auto-detection, though I'm hoping that
>     we might be able to use mDNS and zeroconf-assigned addressing rather than
>     depending on custom DHCP tags which have been problematic in the past.
>     zeroconf might also mitigate the biggest problem we have in workshops:
>     someone jumping the gun and starting up a head node before we're ready to
>     go.
>     
>     While I'm not the biggest fan of systemd, it does have the potential to
>     allow us to get away from custom scripts and use functionality common
>     across more than one distribution.
>     
>     -- 
>     Skylar
>     _______________________________________________
>     Beowulf mailing list, Beowulf at beowulf.org sponsored by Penguin Computing
>     To change your subscription (digest mode or unsubscribe) visit https://beowulf.org/cgi-bin/mailman/listinfo/beowulf
>     
> 

-- 
Skylar


More information about the Beowulf mailing list