[Beowulf] How to know if infiniband network works?

tegner at renget.se tegner at renget.se
Wed Aug 2 23:59:16 PDT 2017


I often use
mpirun --np 2 --machinefile mpd.hosts mpitests-osu_latency
mpirun --np 2 --machinefile mpd.hosts mpitests-osu_bw
To test bandwidth and latency between to specific nodes (listed in mpd.hosts). On a CentOS/Redhat system these can be installed from the package mpitests-openmpi.

/jon


On 2 August 2017 at 18:44:17 +02:00, Faraz Hussain <info at feacluster.com> wrote:

> I have inherited a 20-node cluster that supposedly has an infiniband network. I am testing some mpi applications and am seeing no performance improvement with multiple nodes. So I am wondering if the Infiband network even works?
> 
> The output of ifconfig -a shows an ib0 and ib1 network. I ran ethtools ib0 and it shows:
> 
> Speed: 40000Mb/s
> Link detected: no
> 
> and for ib1 it show:
> 
> Speed: 10000Mb/s
> Link detected: no
> 
> I am assuming this means it is down? Any idea how to debug further and restart it?
> 
> Thanks!
> 
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org sponsored by Penguin Computing
> To change your subscription (digest mode or unsubscribe) visit <http://www.beowulf.org/mailman/listinfo/beowulf>
> 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20170803/a4a41ff2/attachment.html>


More information about the Beowulf mailing list