[Beowulf] Please help test compiler/hardware issue

Orion Poplawski orion at cora.nwra.com
Thu May 3 15:48:46 PDT 2007


Okay, I have a test case for the problem I reported before that I've 
attached.

We have two pairs of identical machines:

- 2 Tyan S2882 Dual Processor 244 stepping 10
- 2 Tyan S2882-D Dual processor dual core Opteron 275 stepping 2

The attached code when compiled with the Portland Group Fortran compiler 
with -O2 and run on either of the 244's will abort in random locations:

[orion at coop00 rams.debug]$ pgf95 -O2 -o testatob testatob.f90
[orion at coop00 rams.debug]$ ./testatob
  checkatob abort n=       246500 , i=         4685  a(i)=    8712085.
   b(i)=    8465585.
Abort
[orion at coop00 rams.debug]$ ./testatob
  checkatob abort n=       246500 , i=       145817  a(i)=    9592717.
   b(i)=    8853217.
Abort

[orion at coop01 rams.debug]$ time ./testatob
  checkatob abort n=       246500 , i=       118169  a(i)=    9565069.
   b(i)=    8825569.
Aborted

real    0m31.842s
user    0m16.476s
sys     0m0.060s


Haven't seen it run longer than 1 minute yet.

However, it runs fine on the 275's (or at least I haven't seen it crash 
yet).  It also runs fine on the 244's when compiled with -O1.

So, I guess this points to a hardware issue, but it may be a somewhat 
generalized hardware issue.  I'd love to hear reports on other 
(particularly other Tyan S2882 dual 244's) systems.

-- 
Orion Poplawski
Technical Manager                     303-415-9701 x222
NWRA/CoRA Division                    FAX: 303-415-9702
3380 Mitchell Lane                  orion at cora.nwra.com
Boulder, CO 80301              http://www.cora.nwra.com
-------------- next part --------------
A non-text attachment was scrubbed...
Name: testatob.f90
Type: text/x-fortran
Size: 844 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070503/3a67034e/attachment.bin>


More information about the Beowulf mailing list