[Beowulf] Infiniband for MPI computations setup guide
Paul Edmon
pedmon at cfa.harvard.edu
Wed Oct 20 16:19:44 UTC 2021
Generally OpenMPI will be able to autodetect much of the IB setup you
just need to make sure you have UCX. With modern OpenMPI you will need
to build a version of PMIx to hook into Slurm. Slurm will also need to
be built against PMIx as well for best experience. Thus in terms of
order of operations:
1. Make sure UCX is installed and that it is detecting the IB.
2. Make sure to install PMIx
3. Get Slurm up and running built against PMIx (you don't need to build
it against UCX).
4. Build OpenMPI against PMIx, Slurm, and UCX. Generally OpenMPI will
autodetect these but you can provide commandline options to ensure that.
5. Test and verify it is working.
-Paul Edmon-
On 10/20/2021 6:08 AM, leo camilo wrote:
> I have recently acquired a few ConnectX-3 cards and an unmanaged IB
> switch (IS5022) to upgrade my department's beowulf cluster.
>
> Thus far, I have been able to verify that the cards and switch work
> via the MFT and opensource tools in ubuntu,
>
> Though, I was wondering if anyone knew of any guide or resources for
> setting up a cluster for MPI based computations in a linux/debian
> environment? Some guides about how to make it work with SLURM would
> also be appreciated.
>
> Thanks in advance for any suggestions, I am often a user of clusters,
> but it is my first time setting one up.
>
> Cheers,
>
> Leonardo
>
> _______________________________________________
> Beowulf mailing list,Beowulf at beowulf.org sponsored by Penguin Computing
> To change your subscription (digest mode or unsubscribe) visithttps://beowulf.org/cgi-bin/mailman/listinfo/beowulf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://beowulf.org/pipermail/beowulf/attachments/20211020/69ebea86/attachment.htm>
More information about the Beowulf
mailing list