From owner-freebsd-stable@FreeBSD.ORG Mon May 27 07:59:39 2013 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by hub.freebsd.org (Postfix) with ESMTP id 141A9F78 for ; Mon, 27 May 2013 07:59:39 +0000 (UTC) (envelope-from danny@cs.huji.ac.il) Received: from kabab.cs.huji.ac.il (kabab.cs.huji.ac.il [132.65.16.84]) by mx1.freebsd.org (Postfix) with ESMTP id C18A7A6E for ; Mon, 27 May 2013 07:59:38 +0000 (UTC) Received: from pampa.cs.huji.ac.il ([132.65.80.32]) by kabab.cs.huji.ac.il with esmtp id 1UgsL2-000DBa-El; Mon, 27 May 2013 10:59:28 +0300 X-Mailer: exmh version 2.7.2 01/07/2005 with nmh-1.3 To: pyunyh@gmail.com Subject: Re: SunFire X2200 ilo's bge1 DOWN/UP In-reply-to: Your message of Mon, 27 May 2013 15:43:20 +0900. Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Date: Mon, 27 May 2013 10:59:28 +0300 From: Daniel Braniss Message-ID: Cc: freebsd-stable@freebsd.org X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 27 May 2013 07:59:39 -0000 > On Fri, May 24, 2013 at 05:31:13PM +0300, Daniel Braniss wrote: > > hi, after upgrading to 9.1-stable, this particular hardware - SunFire X2200, > > Show me dmesg(bge(4) and brgphy(4) only) and 'ifconfig bge1' output. > bge0: mem 0xfdff0000-0xfdffffff,0xfdfe0000-0xfdfeffff irq 17 at device 4.0 on pci6 bge0: CHIP ID 0x00009003; ASIC REV 0x09; CHIP REV 0x90; PCI-X 133 MHz miibus2: on bge0 brgphy0: PHY 1 on miibus2 brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-master, 1000baseT-FDX, 1000baseT-FDX-master, auto, auto-flow bge0: Ethernet address: 00:1b:24:5d:5b:bd bge1: mem 0xfdfc0000-0xfdfcffff,0xfdfb0000-0xfdfbffff irq 18 at device 4.1 on pci6 bge1: CHIP ID 0x00009003; ASIC REV 0x09; CHIP REV 0x90; PCI-X 133 MHz miibus3: on bge1 brgphy1: PHY 1 on miibus3 brgphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-master, 1000baseT-FDX, 1000baseT-FDX-master, auto, auto-flow bge1: Ethernet address: 00:1b:24:5d:5b:be sf-10> ifconfig bge1 bge1: flags=8802 metric 0 mtu 1500 options=8009b ether 00:1b:24:5d:5b:be nd6 options=21 media: Ethernet autoselect (100baseTX ) status: active > > is toggeling bge1 DOWN/UP every few hours, this port is being used by the ILO. > > To check, I upgraded another identical host, and the same problem appears. > > What is the last known working revision? I have no idea, but I have older versions, and ill start from the oldets (9.1-prerelease), but it will take time, since it takes hours till it happens. > > > There > > is not correlation with time, since they happend at totaly different times. > > I rebooted both hosts at almost the same time. > > one host : > > uptime: 5:24PM up 6:15, 0 users, load averages: 0.00, 0.00, 0.00 > > May 24 12:53:52 sf-04 kernel: bge1: link state changed to DOWN > > May 24 12:53:55 sf-04 kernel: bge1: link state changed to UP > > May 24 15:34:25 sf-04 kernel: bge1: link state changed to DOWN > > May 24 15:34:28 sf-04 kernel: bge1: link state changed to UP > > > > and > > uptime: 5:24PM up 6:14, 0 users, load averages: 0.00, 0.00, 0.00 > > > > May 24 16:30:44 sf-10 kernel: bge1: link state changed to DOWN > > May 24 16:30:44 sf-10 kernel: bge1: link state changed to UP > > > > this is not serious, the ilo (ssh) connection is ok, but it's anoying, we have > > more > > than 10 of this hosts, and if I upgrade all of them, the logs will fill up > > with this :-) > > > > any ideas? > > > > cheers, > > danny