[Beowulf] Threaded code

Joe Landman landman at scalableinformatics.com
Tue Aug 17 10:26:21 PDT 2004


Art Edwards wrote:

>It is conceivable, but unlikely. Including blas3 in the build increases
>the speed of the code by a factor of 2-3. The jobs take either days or
>weeks.
>  
>

Do you mean Atlas increases speed by 2-3x?  Or simply the BLAS3 bits 
(non-Atlas)?  There are some BLAS3 libs running around in various 
distributions.  SuSE will happily install BLAS for you if you request 
it, in which case you need to worry about link order.

I would look at the output of ldd.  Make sure it is linking in the right 
code.  Next, while the code is running, do an

    ps -fjg

after starting he code in another window.  You should see stuff like 
this ...

crunch:/var/log/httpd # ps fjg
 PPID   PID  PGID   SID TTY      TPGID STAT   UID   TIME COMMAND
25381 25384 25384 25384 pts/7    27532 S        0   0:00 -bash
25384 27532 27532 25384 pts/7    27532 S        0   0:00  \_ tail -f 
mail auth
25242 25245 25245 25245 pts/4    29077 S        0   0:00 -bash
25245 29077 29077 25245 pts/4    29077 R        0   0:00  \_ ps fjg
24959 24962 24962 24962 pts/2    28718 S        0   0:00 -bash
24962 27588 27588 24962 pts/2    28718 S        0   0:04  \_ nedit 
login.html
24962 27618 27618 24962 pts/2    28718 S        0   0:00  \_ nedit 
../etc/bbs.ini
16116 16118 16118 16118 pts/1    23153 S        0   0:00 bash
16118 23153 23153 16118 pts/1    23153 S        0   5:26  \_ ping indy

Your threads will have different process IDs.

also try an

    ps fjma

and look for your process threads.

Joe

>Art
>
>On Tue, Aug 17, 2004 at 01:05:13PM -0400, Joe Landman wrote:
>  
>
>>Art:
>>
>> Is it possible that the parallel section completes very quickly?  Or 
>>that code path is not taken?
>>
>>Joe
>>
>>Art Edwards wrote:
>>
>>    
>>
>>>I mean that when we run the threaded code and look at top, don't see two
>>>processors running. The load is always less than or equal to one. When
>>>we run two separate jobs, we see a load of 2. 
>>>
>>>Art Edwards
>>>
>>>On Tue, Aug 17, 2004 at 07:45:32AM -0700, Alvin Oga wrote:
>>>
>>>
>>>      
>>>
>>>>hi ya
>>>>
>>>>yup.. that is dual cpu
>>>>
>>>>On Tue, 17 Aug 2004, Art Edwards wrote:
>>>>
>>>>  
>>>>
>>>>        
>>>>
>>>>>We are running SuSE 9.1 on a system with 8 GB of ram and two scsi disks.
>>>>>Here is /proc/cpuinfo
>>>>>processor       : 0
>>>>>    
>>>>>
>>>>>          
>>>>>
>>>>good
>>>>
>>>>  
>>>>
>>>>        
>>>>
>>>>>processor       : 1
>>>>>    
>>>>>
>>>>>          
>>>>>
>>>>good
>>>>
>>>>  
>>>>
>>>>        
>>>>
>>>>>Linux bladerunner 2.6.4-52-smp #1 SMP Wed Apr 7 01:58:54 UTC 2004 x86_64
>>>>>x86_64 x86_64 GNU/Linux
>>>>>    
>>>>>
>>>>>          
>>>>>
>>>>good
>>>>
>>>>
>>>>  
>>>>
>>>>        
>>>>
>>>>>>Newbie question:
>>>>>>
>>>>>>We just purchased a dual opteron box that is running an SMP kernel. We
>>>>>>have built a fortran code using g77 and a pthreaded atlas. Currently, we
>>>>>>see only one processor. What really simple thing have we missed?
>>>>>>      
>>>>>>
>>>>>>            
>>>>>>
>>>>so what do you mean by "see only one processor" ??
>>>>
>>>>c ya
>>>>alvin
>>>>  
>>>>
>>>>        
>>>>
>>>
>>>      
>>>
>>-- 
>>Joseph Landman, Ph.D
>>Founder and CEO
>>Scalable Informatics LLC,
>>email: landman at scalableinformatics.com
>>web  : http://www.scalableinformatics.com
>>phone: +1 734 612 4615
>>    
>>
>
>  
>


-- 
Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics LLC,
email: landman at scalableinformatics.com
web  : http://www.scalableinformatics.com
phone: +1 734 612 4615




More information about the Beowulf mailing list