Clock Synchronization
Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.
Mike Snitzer msnitzer at plogic.comMon Jun 24 09:09:28 PDT 2002
- Previous message: Beowulf usage statistics?
- Next message: Clock Synchronization
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Josip Loncaric (josip at icase.edu) said: > "Robert G. Brown" wrote: > > > > I personally think that networked systems, nodes or not, should have the > > time network synchronized if at all possible. > > > > We use ntpd to keep everything sync'd. > > I concur -- ntpd is very good, and it can be very easy on your internal > network (within your cluster, just use broadcast client/server model > instead of polling). > > Unsynchronized systems can misbehave, particularly with schedulers etc. Josip et. al., I've tried to configure a test cluster with the broadcast client/server model and I've yet to get it working... ntpd yields little output that is of any help. So if you or anyone else on the list can offer some assistance based on the following data, I'd really appreciate it. I have Redhat's ntp-4.1.1-1 running on all nodes. /etc/ntp.conf on the master has: server clock.psu.edu # arbitrary choice for external NTP server server 127.127.1.0 # local clock fudge 127.127.1.0 stratum 10 broadcast 192.168.0.255 ttl 6 broadcastdelay 0.008 enable bclient driftfile /etc/ntp/drift authenticate no /etc/ntp.conf on the slaves has: broadcastclient # also tried with broadcastclient 192.168.0.255 broadcastdelay 0.008 enable bclient driftfile /etc/ntp/drift authenticate no /etc/ntp/step-tickers on the slaves has: LC1 NOTE: this really shouldn't needed in a properly configured broadcast client/server config correct? I've read that when ntpd starts as a client it handshakes a bit with an available ntpd server like a normal polling config and then reverts to using the broadcasts. starting ntpd on a slave yields: [root at LC1 ~]# rsh LC2 /sbin/service ntpd restart Shutting down ntpd: [ OK ] ntpd: Synchronizing with time server: [ OK ] Starting ntpd: [ OK ] /var/log/messages on the slave has: Jun 24 11:23:27 LC2 ntpd: ntpd shutdown succeeded Jun 24 11:23:27 LC2 ntpdate[3080]: step time server 192.168.0.1 offset -0.005725 sec Jun 24 11:23:27 LC2 ntpd: succeeded Jun 24 11:23:27 LC2 ntpd[3085]: ntpd 4.1.1 at 1.786 Mon Apr 8 06:30:52 EDT 2002 (1) Jun 24 11:23:27 LC2 ntpd: ntpd startup succeeded tcpdump from the master shows: [root at LC1 ~]# tcpdump -i eth0 "port 123" tcpdump: listening on eth0 11:35:46.125187 LC1.ntp > 192.168.0.255.ntp: v4 bcast strat 3 poll 6 prec -17 (DF) [tos 0x10] 11:36:49.125308 LC1.ntp > 192.168.0.255.ntp: v4 bcast strat 3 poll 6 prec -17 (DF) [tos 0x10] tcpdump from a slave shows: [root at LC2 ~]# tcpdump -i eth0 "port 123" tcpdump: listening on eth0 11:35:46.108483 LC1.ntp > 192.168.0.255.ntp: v4 bcast strat 3 poll 6 prec -17 (DF) [tos 0x10] 11:36:49.110939 LC1.ntp > 192.168.0.255.ntp: v4 bcast strat 3 poll 6 prec -17 (DF) [tos 0x10] SO... the master's ntpd appears to be doing what it should be and it would appear the client (slave) is getting the packet(s) it needs.. I suppose now it's just a question of whether or not the ntpd on the slave is actually listening for the broadcast. ntpq -p on a slave: [root at LC2 ~]# ntpq -p No association ID's returned NOTE: shouldn't the client (slave) ntpd see 192.168.0.255 as a source _IF_ it's ntpd were doing what it should? If anyone any ideas why the clients aren't reporting any association IDs or can provide the details on how you got your broadcast client/server ntp config working please let me know. Thanks. Mike
- Previous message: Beowulf usage statistics?
- Next message: Clock Synchronization
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
More information about the Beowulf mailing list
