[Beowulf] How to know if infiniband network works?

Joe Landman joe.landman at gmail.com
Wed Aug 2 09:50:24 PDT 2017


start with

     ibv_devinfo

     ibstat

     ibstatus


and see what (if anything) they report.

Second, how did you compile/run your MPI code?


On 08/02/2017 12:44 PM, Faraz Hussain wrote:
> I have inherited a 20-node cluster that supposedly has an infiniband 
> network. I am testing some mpi applications and am seeing no 
> performance improvement with multiple nodes. So I am wondering if the 
> Infiband network even works?
>
> The output of ifconfig -a shows an ib0 and ib1 network. I ran ethtools 
> ib0 and it shows:
>
>         Speed: 40000Mb/s
>         Link detected: no
>
> and for ib1 it show:
>
>         Speed: 10000Mb/s
>         Link detected: no
>
> I am assuming this means it is down? Any idea how to debug further and 
> restart it?
>
> Thanks!
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org sponsored by Penguin Computing
> To change your subscription (digest mode or unsubscribe) visit 
> http://www.beowulf.org/mailman/listinfo/beowulf

-- 
Joe Landman
e: joe.landman at gmail.com
t: @hpcjoe
w: https://scalability.org
g: https://github.com/joelandman
l: https://www.linkedin.com/in/joelandman



More information about the Beowulf mailing list