[Beowulf] How to know if infiniband network works?
tegner at renget.se
tegner at renget.se
Wed Aug 2 23:59:16 PDT 2017
I often use
mpirun --np 2 --machinefile mpd.hosts mpitests-osu_latency
mpirun --np 2 --machinefile mpd.hosts mpitests-osu_bw
To test bandwidth and latency between to specific nodes (listed in mpd.hosts). On a CentOS/Redhat system these can be installed from the package mpitests-openmpi.
/jon
On 2 August 2017 at 18:44:17 +02:00, Faraz Hussain <info at feacluster.com> wrote:
> I have inherited a 20-node cluster that supposedly has an infiniband network. I am testing some mpi applications and am seeing no performance improvement with multiple nodes. So I am wondering if the Infiband network even works?
>
> The output of ifconfig -a shows an ib0 and ib1 network. I ran ethtools ib0 and it shows:
>
> Speed: 40000Mb/s
> Link detected: no
>
> and for ib1 it show:
>
> Speed: 10000Mb/s
> Link detected: no
>
> I am assuming this means it is down? Any idea how to debug further and restart it?
>
> Thanks!
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org sponsored by Penguin Computing
> To change your subscription (digest mode or unsubscribe) visit <http://www.beowulf.org/mailman/listinfo/beowulf>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20170803/a4a41ff2/attachment.html>
More information about the Beowulf
mailing list