[Beowulf] Anyone know the LinkAggregation(Trunking) on a switch?

Michael Will mwill at penguincomputing.com
Tue Sep 6 08:00:12 PDT 2005


What is the exact definition of a failed machine.
- turned off
- kernel paniced, halted but still on
- above certain limit load ?

Do you use heartbeat to detect those and STONITH in order to turn off a
failed machine?

Michael
Zhang Hui wrote:

>Hi,all
>
>	I have got a problem with the trunking failover on a switch.  
>	I have implemented the multi-machine Trunking(one link per machine).The packets can be distributed between the links/machines and be at last forwarded to a real server via the IPVS(by Zhang Wensong),like this:
> _
>| |	      |------|	   ________
>|.|__/\___|  1   |_____|      |
>| |  ||   |______|     | real |
>| |  ||trunk           |      |
>| |	 ||	  |------|	   |server|
>|.|__||___|  2   |_____|      |
>|_|  \/   |______|     |______|
>switch
> 
>	And when one link(to one machine) is down, the connection will be transfered to another link/machine, and to "server" at last,session kept.
>	The problem is, when, for example, "1" is down, but the link to "1" is still up(judge from the LED for "1" on the switch).So the "switch" won't think "1" is down, and distribute packets to "1" as usual.Therefore the connection is down.
>	Can't the switch sense the death of a machine,intrinsically? Or something wrone with the configuration of Trunking in the "switch"?
>	By the way, the "switch" is a 3com SS3300TM 16986A one.
>	Can anyone help me? Great appreciation to any reply.
> 					
>
>        Zhang Hui
>        spacetiller at 163.com
>          2005-09-02
>
>_______________________________________________
>Beowulf mailing list, Beowulf at beowulf.org
>To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
>  
>


-- 
Michael Will
Penguin Computing Corp.
Sales Engineer
415-954-2822
415-954-2899 fx
mwill at penguincomputing.com 





More information about the Beowulf mailing list