From owner-freebsd-stable@FreeBSD.ORG Tue Aug 12 11:40:13 2008 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 3C6181065677 for ; Tue, 12 Aug 2008 11:40:13 +0000 (UTC) (envelope-from petefrench@ticketswitch.com) Received: from constantine.ticketswitch.com (constantine.ticketswitch.com [IPv6:2002:57e0:1d4e:1::3]) by mx1.freebsd.org (Postfix) with ESMTP id 02A068FC1E for ; Tue, 12 Aug 2008 11:40:13 +0000 (UTC) (envelope-from petefrench@ticketswitch.com) Received: from dilbert.rattatosk ([10.64.50.6] helo=dilbert.ticketswitch.com) by constantine.ticketswitch.com with esmtps (TLSv1:AES256-SHA:256) (Exim 4.69 (FreeBSD)) (envelope-from ) id 1KSsEK-000HNQ-Ks; Tue, 12 Aug 2008 12:40:00 +0100 Received: from petefrench by dilbert.ticketswitch.com with local (Exim 4.69 (FreeBSD)) (envelope-from ) id 1KSsEK-000AWQ-JL; Tue, 12 Aug 2008 12:40:00 +0100 To: eugen@kuzbass.ru, peterjeremy@optushome.com.au In-Reply-To: <20080812112430.GC64458@server.vk2pj.dyndns.org> Message-Id: From: Pete French Date: Tue, 12 Aug 2008 12:40:00 +0100 Cc: stable@freebsd.org, mh@kernel32.de Subject: Re: lagg(4) and failover X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 Aug 2008 11:40:13 -0000 > As far as I can tell, not especially well :-(. It doesn't seem to detect > much short of layer 1 failure. In particular, shutting down the switch > port will not trigger a failover. Are you using bce devices as your phsyical interfaces ? Take a look at the thread from last week about ifconfig - with the patch posted a port shutdown now *does* trigger a failover quite happily. If you are using e devices then I suggest you try it. > With lacp, all the physical interfaces must be connected to a single > switch. With failover, the physical interfaces will normally be > connected to different switches (so a failure in one switch will not > cause the loss of all connectivity. This is true - with the caveat that certain pairs of switches can be made to appear as a single phsyical device for the purposes of LACP, in which case it works fine for failover. We have two farms here - an old one using a pair of Cisco 3560s and a new one using a pair of 3750-Es. The 3750s will act as a single device and we use LACP on the machines connected to those, but the 3560s appear as a pair of devices, so for those we use failover mode. LACP failover always worked fine, and with the bce patch from last week the normal failover now also works. Nore that you can enable LACP on the 3560,s and it does appear to negotiate and work, but the switches keep changing their idea of which port to use every few seconds. So the connection works, but with high rates of packet loss as a few go missing every time the switch pair flip-flops. -pete.