[Beowulf] Re:Strange hardware? problems
Orion Poplawski
orion at cora.nwra.com
Tue May 1 14:41:28 PDT 2007
David Mathog wrote:
> Since the same code runs differently on two different Opteron models
> it's probably either a memory access issue or the use of a compiler
> flag that enables some feature on one model that is not present
> on the other. For instance, SSE3 vs. SSE2, although I don't know
> enough about these models to tell you what the most likely flag would
> be. (The fact that it runs ok on the newer one and blows up on the
> older one is consistent with this type of error.)
>
> Assuming gcc, recompile with:
>
> -O0 -g -std=c99 -Wall
>
> and clean up any warnings that result until you get a clean build.
> Repeat with -O3 and -O2, as for strange reasons that sometimes uncovers
> logic problems not seen at -O0. Then run the resulting binary
> within valgrind. Fix any memory access violations which are found.
> Valgrind can also alert you to the use of unsupported operations.
>
The code compiles quite cleanly, but I am seeing different behavior with
different compiler flags and different compilers. We'll see if I can
bisect the problem into a small enough box.
Thanks for the poke in that direction...
--
Orion Poplawski
Technical Manager 303-415-9701 x222
NWRA/CoRA Division FAX: 303-415-9702
3380 Mitchell Lane orion at cora.nwra.com
Boulder, CO 80301 http://www.cora.nwra.com
More information about the Beowulf
mailing list