<meta http-equiv="Content-Type" content="text/html; charset=utf-8"><meta name="ProgId" content="Word.Document"><meta name="Generator" content="Microsoft Word 11"><meta name="Originator" content="Microsoft Word 11"><link rel="File-List" href="file:///C:%5CDOCUME%7E1%5CMUHAMMAD%5CLOCALS%7E1%5CTemp%5Cmsohtml1%5C01%5Cclip_filelist.xml"><style>
<!--
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{mso-style-parent:"";
margin:0in;
margin-bottom:.0001pt;
mso-pagination:widow-orphan;
font-size:12.0pt;
font-family:"Times New Roman";
mso-fareast-font-family:"Times New Roman";}
@page Section1
{size:8.5in 11.0in;
margin:1.0in 1.25in 1.0in 1.25in;
mso-header-margin:.5in;
mso-footer-margin:.5in;
mso-paper-source:0;}
div.Section1
{page:Section1;}
/* List Definitions */
@list l0
{mso-list-id:1954289362;
mso-list-type:hybrid;
mso-list-template-ids:1976578426 67698705 67698713 67698715 67698703 67698713 67698715 67698703 67698713 67698715;}
@list l0:level1
{mso-level-text:"%1\)";
mso-level-tab-stop:.5in;
mso-level-number-position:left;
text-indent:-.25in;}
ol
{margin-bottom:0in;}
ul
{margin-bottom:0in;}
-->
</style>
<p class="MsoNormal">Hello All,</p>
<p class="MsoNormal"> </p>
<p class="MsoNormal">I perceive following computing setups for GP-GPUs,</p>
<p class="MsoNormal"> </p>
<p class="MsoNormal" style="margin-left: 0.5in; text-indent: -0.25in;"><span style="">1)<span style="font-family: "Times New Roman"; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;"> </span></span>ONE
PC with ONE CPU and ONE GPU,</p>
<p class="MsoNormal" style="margin-left: 0.5in; text-indent: -0.25in;"><span style="">2)<span style="font-family: "Times New Roman"; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;"> </span></span>ONE
PC with more than one CPUs and ONE GPU</p>
<p class="MsoNormal" style="margin-left: 0.5in; text-indent: -0.25in;"><span style="">3)<span style="font-family: "Times New Roman"; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;"> </span></span>ONE
PC with one CPU and more than ONE GPUs</p>
<p class="MsoNormal" style="margin-left: 0.5in; text-indent: -0.25in;"><span style="">4)<span style="font-family: "Times New Roman"; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;"> </span></span>ONE
PC with TWO CPUs (e.g. Xeon Nehalems) and more than ONE GPUs (e.g. Nvidia
C1060)</p>
<p class="MsoNormal" style="margin-left: 0.5in; text-indent: -0.25in;"><span style="">5)<span style="font-family: "Times New Roman"; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;"> </span></span>Cluster
of PCs with each node having ONE CPU and ONE GPU</p>
<p class="MsoNormal" style="margin-left: 0.5in; text-indent: -0.25in;"><span style="">6)<span style="font-family: "Times New Roman"; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;"> </span></span>Cluster
of PCs with each node having more than one CPUs and ONE GPU</p>
<p class="MsoNormal" style="margin-left: 0.5in; text-indent: -0.25in;"><span style="">7)<span style="font-family: "Times New Roman"; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;"> </span></span>Cluster
of PCs with each node having ONE CPU and more than ONE GPUs</p>
<p class="MsoNormal" style="margin-left: 0.5in; text-indent: -0.25in;"><span style="">8)<span style="font-family: "Times New Roman"; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;"> </span></span>Cluster
of PCs with each node having more than one CPUs and more than ONE GPUs.</p>
<p class="MsoNormal"> <br></p>
<p class="MsoNormal">Which of these are good/realistic/practical; which are not?
Which are quite “natural” to use for CUDA based programs?</p>
<p class="MsoNormal"> </p>
<p class="MsoNormal">IMPORTANT QUESTION: Will a cuda based program will be
equally good for some/all of these setups or we need to write different CUDA
based programs for each of these setups to get good efficiency?</p>
<p class="MsoNormal"> </p>
<p class="MsoNormal">Comments are welcome also for <span style="background: yellow none repeat scroll 0% 0%; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;">AMD/ATI FireStream</span>.</p>
<p class="MsoNormal"> </p>
<p class="MsoNormal">With best regards,<br>
AMJAD ALI. </p>