<div dir="ltr"><br><br><div class="gmail_quote">2008/10/9 Rahul Nabar <span dir="ltr"><<a href="mailto:rpnabar@gmail.com">rpnabar@gmail.com</a>></span><br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
I<br>
<br>
I posted this problem on a PowerEdge mailing list but haven't gotten<br>
very far yet. Any suggestions are appreciated!<br>
<font color="#888888"><br>
</font></blockquote><div>Yes.<br><br>(1) Tell your Dell salesman that you have asked for help on this problem on a public mailing list for High Performance Computing. Tell him/her that you need high level Dell support on this. There are Dell customers on this list.<br>
<br>(2) Suspect the RAM. Ask some serious questions of your Dell support about RAM compatibility - HPC applications stress the RAM. Ask, and ask again, if the specific RAM chips you have are certified for that motherboard. Use dmidecode to read out the manufacturer codes of the RAM modules - do you have a mix of manufacturers?<br>
Ask and ask again about BIOS updates being available for these machines.<br>We had a case once of HP machines - even though the BIOSes were versioned the same on 200 machines, there were some differences - turns out you had to go as far as checking the build date.<br>
Get the very latest BIOS version you can.<br><br>(3) The RAM will be the problem - but if you can keep notes and there are specific machines which crash more than others point this out to Dell and maybe suspect the PSUs being weak on those machines.<br>
<br><br><br><br><br><br><br><br><br><br><br><br><br></div></div><br></div>