[Beowulf] gpu benchmark

Massimiliano Fatica mfatica at gmail.com
Wed Aug 31 06:23:00 PDT 2016

Hpcg will check the results and can run for an arbitrary time and in parallel.


Sent from my BlackBerry 10 smartphone.
  Original Message  
From: Michael Di Domenico
Sent: Wednesday, August 31, 2016 5:38 AM
To: Beowulf Mailing List
Subject: [Beowulf] gpu benchmark

I'm looking for a benchmark that can keep a gpu busy (ideally both
compute and memory) for 2 or 3 hours. but here's the kick, at the end
of the benchmark it needs to check it's answers

i'm trying to hunt down some potentially bad hardware. linpack works
great for the bulk of it, but the nodes i have, have more gpu power
then ram available, so a linpack run using the full ram of a single
box doesn't run long enough. and successive runs one after another
doesn't seem to trigger it

i can trigger the error using large mpi jobs across many nodes, but
that doesn't let me isolate which gpu was at fault. and since these
are GTX level cards, no ecc and no error diagnostics on the console...

i'm going through my bag of tricks, but haven't come up with anything just yet.
Beowulf mailing list, Beowulf at beowulf.org sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf

More information about the Beowulf mailing list