Thank you all so much for the advice so far. This has helped me see a few more of the things that I did not realize at first.<br><br>For a little info on the project, I developed this project as a tool to work on my Senior Thesis in a year or so. Doing computational nuclear physics requires such resources. It will also be used heavily for Monte Carlo Simulations and just about any other form of computational physics. The two named are definite projects that are already on the line up for when I do get the cluster up and functional.<br>
<br>I want to be able to make the cluster easily expandable, in that I will be starting with only a few machines (about 2-8), but will be acquiring more as time goes on. The university that I am attending surpluses out "old" machines every 4 years, and we have set up a program where we can get a percentage of the surplus machines for out cluster.<br>
<br>So, as for size. Initially it will be a smaller cluster, but will grow as time goes on.<br><br>Being new to the Beowulf world, I am just mainly looking for some advice as to what distro to use (I would never dream of setting up a cluster on windows) and if there were any little tricks that weren't mentioned in the setup how to guides.<br>
<br>Oh, and I would also like to know if there was a way to set up a task priority where if I had only only application running it would use all the processors on the cluster, but if I had two tasks sent to the cluster then it would split the load between them and run both simultaneously, but still using a maximum for the needed processors.<br>
<br>Thanks again so much,<br><br>Berkley<br><br><div class="gmail_quote">On Feb 8, 2008 9:11 AM, Robert G. Brown <<a href="mailto:rgb@phy.duke.edu">rgb@phy.duke.edu</a>> wrote:<br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
<div class="Ih2E3d">On Thu, 7 Feb 2008, Berkley Starks wrote:<br><br>> Hello all,<br>><br>> I've been a computer user for the past several years working in different<br>> areas of the IT world. I've recently been commissioned by my university to<br>
> set up the first operating Beowulf Cluster.<br>><br>> I'm am moderately familiar with the Linux OS, having ran it for the past<br>> several years using the distro's of Debian, Ubuntu, Fedora Core, and<br>
> Mandriva.<br>><br>> With setting up this new cluster I would like any advice possible on what OS<br>> to use, how to set it up, and any other pertinent information that I might<br>> need.<br><br></div>This question has been answered on-list in detail a few zillion times.<br>
I'd suggest consulting (in rough order):<br><br> a) The list archives (now that you're a member you can get to them,<br>although they are digested and googleable for the most part anyway).<br><br> b) Google. For example, there is a lovely howto here:<br>
<br> <a href="http://www.linux.org/docs/ldp/howto/Parallel-Processing-HOWTO.html" target="_blank">http://www.linux.org/docs/ldp/howto/Parallel-Processing-HOWTO.html</a><br><br>that is remarkably current and a good quick place to start.<br>
<br> c) Feel free to browse my free online book here:<br><br> <a href="http://www.phy.duke.edu/%7Ergb/Beowulf/beowulf_book.php" target="_blank">http://www.phy.duke.edu/~rgb/Beowulf/beowulf_book.php</a><br><br>I'm working on making it paper-printable via lulu, but I need time I<br>
don't have and so that project languishes a bit. You "can" get a paper<br>copy there if you want, but it is pretty much what is on the free<br>website including the holes.<br><div class="Ih2E3d"><br>> Oh, and the cluster will be used for computational physics. I am a physics<br>
> major making it for the physics department here. It will need to be able to<br>> use C++ and Fortran at a bare minimum.<br><br></div>C, C++ and Fortran are all no problem. The more important questions<br>are:<br>
<br> a) How coupled are the parallel tasks? That is, do you want a cluster<br>that can run N independent jobs on N independent nodes (where the jobs<br>don't communicate with each other at all), or do you want a cluster<br>
where the N nodes all do work on a common task as part of one massive<br>parallel program? If the former, you're in luck and cluster design is<br>easy and the cluster purchase will be cheap.<br><br> b) If they are coupled, are the tasks "tightly coupled" so each<br>
subtask can only advance a little bit before communications are required<br>in order to take the next step? "Synchronous" so all steps have to be<br>completed on all nodes before any can advance? Are the messages really<br>
big (bandwidth limited) or tiny and frequent (latency limited)?<br><br>If any of these latter answers are "yes", post a detailed description of<br>the tasks (as best you can) to get some advice on choosing a network, as<br>
that's the design parameter that is largely controlled by the answers.<br><br> rgb<br><br>><br>> Thanks again<br>><br><font color="#888888"><br>--<br>Robert G. Brown Phone(cell): 1-919-280-8443<br>
Duke University Physics Dept, Box 90305<br>Durham, N.C. 27708-0305<br>Web: <a href="http://www.phy.duke.edu/%7Ergb" target="_blank">http://www.phy.duke.edu/~rgb</a><br>Book of Lilith Website: <a href="http://www.phy.duke.edu/%7Ergb/Lilith/Lilith.php" target="_blank">http://www.phy.duke.edu/~rgb/Lilith/Lilith.php</a><br>
Lulu Bookstore: <a href="http://stores.lulu.com/store.php?fAcctID=877977" target="_blank">http://stores.lulu.com/store.php?fAcctID=877977</a><br></font></blockquote></div><br>