[Beowulf] GPU diagnostics?
M J Harvey
m.j.harvey at imperial.ac.uk
Tue Mar 31 09:05:19 PDT 2009
David Mathog wrote:
> Have any of you CUDA folks produced diagnostic programs you run during
> "burn in" of new GPU based systems, in order to weed out problem units
> before putting them into service?
A while ago I wrote a CUDA implementation of a subset of the Memtest86+
algorithms,to test the reliability of the consumer GPUs used by our
distributed computing project, GPUGRID. You can get them here:
http://ccs.chem.ucl.ac.uk/~matt/cudamemtest.tgz
That said, we never really used it in anger (most of the stability
problems we were having turned out to be due to 'factory-overclocked'
GPUs) so YMMV.
MJH
--
Matt Harvey Email: m.j.harvey at imperial.ac.uk
HPC Systems Support Analyst
Imperial College London
PGP Key ID: 0xD234302E
http://www.imperial.ac.uk/ict/services/highperformancecomputing
More information about the Beowulf
mailing list