<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=iso-8859-1">
<META content="MSHTML 5.50.4933.1800" name=GENERATOR></HEAD>
<BODY>
<DIV>
<DIV><SPAN class=043122919-05072005><FONT face=Arial><FONT size=2><SPAN
class=037250413-06072005>Hello all</SPAN></FONT></FONT></SPAN></DIV>
<DIV><SPAN class=043122919-05072005><FONT face=Arial><FONT size=2><SPAN
class=037250413-06072005>I have some general Beowulf/Ganglia configuration woes
that I am seeking help with!</SPAN></FONT></FONT></SPAN></DIV>
<DIV><SPAN class=043122919-05072005><FONT face=Arial><FONT size=2><SPAN
class=037250413-06072005></SPAN></FONT></FONT></SPAN> </DIV>
<DIV><SPAN class=043122919-05072005><FONT face=Arial><FONT size=2><SPAN
class=037250413-06072005>1></SPAN>I have two beowulf style
clusters.</FONT></FONT></SPAN></DIV>
<DIV><FONT face=Arial size=2><SPAN class=043122919-05072005>I would like to use
cluster A to monitor Cluster B. Cluster A is 18 nodes cluster B is 90
nodes.</SPAN></FONT></DIV>
<DIV><FONT face=Arial size=2><SPAN
class=043122919-05072005></SPAN></FONT> </DIV>
<DIV><FONT face=Arial size=2><SPAN class=043122919-05072005>Monitoring on
Cluster A is no problem. But on Cluster B, for whatever reason, the
gmetd that is running on the headnode only "sees" about half of the gmonds
running on the corresponding compute nodes. I know the gmonds are running
on<SPAN class=037250413-06072005> each of the </SPAN>90 <SPAN
class=037250413-06072005>compute </SPAN>nodes as a simple ps tells me
so. Further I can go to each compute node in turn and do a localhost 8649
I see the spewage of XML. But, yet the gmetd on the headnode only see
about half of the compute nodes. Have any idea why></SPAN></FONT></DIV>
<DIV><FONT face=Arial size=2><SPAN
class=043122919-05072005></SPAN></FONT> </DIV>
<DIV><SPAN class=043122919-05072005><FONT face=Arial><FONT size=2><SPAN
class=037250413-06072005>2</SPAN>> Does a gmetd need to be running on the
headnode of cluster B if I wish to monitor Cluster B from Cluster A? Also
in general should a gmond be running on my headnodes? I have seen that
when a gmond is running on the headnode as well, the corresponding gmetd ignores
all the other gmonds and only reports the one of the
headnode.</FONT></FONT></SPAN></DIV>
<DIV><FONT face=Arial size=2><SPAN
class=043122919-05072005></SPAN></FONT> </DIV>
<DIV><SPAN class=043122919-05072005><FONT face=Arial><FONT size=2><SPAN
class=037250413-06072005>3</SPAN>> On cluster B as the data_source line
in the gmetd.conf file should I put the IP address of all the corresponding
compute nodes? I seem to get a variety of results and behaviors depending
on what I may put..</FONT></FONT></SPAN></DIV>
<DIV><FONT face=Arial size=2><SPAN
class=043122919-05072005></SPAN></FONT> </DIV>
<DIV><SPAN class=043122919-05072005><FONT face=Arial><FONT size=2><SPAN
class=037250413-06072005>4></SPAN> The ganglia conf files seem much
happier if I use IP addresses instead of FQDN. Is this really the
case?</FONT></FONT></SPAN></DIV>
<DIV><SPAN class=043122919-05072005><FONT face=Arial
size=2></FONT></SPAN> </DIV>
<DIV><SPAN class=043122919-05072005><SPAN class=037250413-06072005><FONT
face=Arial size=2>5> In general what should be on the data_source line of my
gmetd.conf file? All the IP addresses of every single gmond running in my
corresponding computer nodes?</FONT></SPAN></SPAN></DIV>
<DIV><FONT face=Arial size=2><SPAN
class=043122919-05072005></SPAN></FONT> </DIV>
<DIV><FONT face=Arial size=2><SPAN class=043122919-05072005>If you have some
general docs on how to correctly setup up ganglia on a grid of beowulfs clusters
that would be great to have!</SPAN></FONT></DIV>
<DIV><FONT face=Arial size=2><SPAN class=043122919-05072005>Thanks for any and
all help!</SPAN></FONT></DIV>
<DIV><FONT face=Arial size=2><SPAN class=043122919-05072005>Sincerely<BR>Dan
Roberts</SPAN></FONT></DIV>
<DIV><FONT face=Arial size=2></FONT> </DIV></DIV></BODY></HTML>