[Beowulf] Looking for NFS Server performance comparison

Stuart Midgley sdm900 at gmail.com
Fri Dec 16 16:08:46 PST 2005


Hi Bill

Try having multiple servers and then put 1/2 the users on one server  
and 1/2 on the other (or what ever fraction you are able to do).   
Then automount each home directory from /home1 /home2 on the two  
servers into /home on the compute nodes.  That way, your users still  
see /home/username as their home account (even though it is really / 
home1 or /home2).

You can then pull some tricks with login to ensure that a user (upon  
login) gets onto their native server (so that interactive tar/gzip  
etc are fast).  Create a new shell which, upon login, checks which  
server is the native for the user and then do an ssh behind the  
scenes to the appropriate node without them knowing and then run  
their normal shell.  Its good to have a direct back-to-back gig-E  
link to do this across.

Other things I've done to give better performance is to increase the  
number of NFS threads to 64 and set the read/write size to 8192.   
This was the optimal nfs performance on a dual ht cpu.  We see  
roughly 40MB/s over gig-e for large writes (so 2 servers gives you  
80MB/s).

You can setup your dns to round-robin your front end servers, just  
set multiple a records for the 1 dns name and give it the IP  
addresses of your multiple servers.  In this way, users incoming/ 
outgoing network bandwidth will (roughly) get spread over your  
multiple servers.

I run a moderate sized cluster in this way and it works really really  
well.  Most users are not even aware that we have multiple servers.

Stu.



On 08/12/2005, at 6:01, Bill Rankin wrote:

> Hi Gang,
>
> This may be a rehash, but I'm looking for any references to papers  
> that address the issue of NFS server scalability in a cluster  
> environment.  Our current cluster here at Duke is right at 500  
> nodes (and growing) and the single NFS server with attached SCSI  
> drives serving all data to all nodes just doesn't seem to hack it  
> anymore.
>
> I know that there are clustered solutions like PVFS, IBRIX, et. al.  
> as well as the dedicated filer approach (NetApp, EMC, etc.) and we  
> are looking at those. But first we need to convince the powers that  
> be that a Linux-based NFS server just ain't going to cut it as we  
> move forward.
>
> So if anyone knows of good documents (white papers, conference  
> papers, etc.) that will support my assertion, I would appreciate a  
> pointer to them.
>
> Thanks,
>
> -bill


--
Dr Stuart Midgley
sdm900 at gmail.com





More information about the Beowulf mailing list