From owner-freebsd-current@FreeBSD.ORG Sat May 8 09:40:23 2004 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 5A06616A4CE for ; Sat, 8 May 2004 09:40:23 -0700 (PDT) Received: from mail2.pubnix.net (mail2.pubnix.net [192.172.250.12]) by mx1.FreeBSD.org (Postfix) with ESMTP id A3EF743D6B for ; Sat, 8 May 2004 09:40:21 -0700 (PDT) (envelope-from ahebert@pubnix.net) Received: (from root@localhost) by mail2.pubnix.net (8.12.3/8.12.3) id i48GbB40001268 for freebsd-current@freebsd.org; Sat, 8 May 2004 12:37:11 -0400 (EDT) (envelope-from ahebert@pubnix.net) Received: from pubnix.net (aal.pubnix.net [64.235.216.13]) (authenticated bits=0) by mail2.pubnix.net (8.12.3/8.12.3) with ESMTP id i48Gb9HR001207 for ; Sat, 8 May 2004 12:37:09 -0400 (EDT) (envelope-from ahebert@pubnix.net) Message-ID: <409D0D7A.8040604@pubnix.net> Date: Sat, 08 May 2004 12:40:26 -0400 From: Alain Hebert Organization: PubNIX, Inc. User-Agent: Mozilla/5.0 (X11; U; FreeBSD i386; en-US; rv:1.6) Gecko/20040502 X-Accept-Language: en-us, en MIME-Version: 1.0 To: freebsd-current@freebsd.org Content-Type: multipart/mixed; boundary="------------050606010006000004010202" X-scanner: scanned by Yeti v1.0 - (http://www.kinovations.com) X-Mailman-Approved-At: Sun, 09 May 2004 05:08:36 -0700 Subject: FreeBSD/Current: As of 2004/05/08 at 11:00am EST - FSCK lockup X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list Reply-To: ahebert@pubnix.net List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 08 May 2004 16:40:23 -0000 This is a multi-part message in MIME format. --------------050606010006000004010202 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Hi, Situation: The computer will lock up real tight (no keyboard light working, no kernel debugger, no network, no disk) running fsck. (Dual P4 Xeon 2.4Ghz, 2GB RAM, using a Promise PDC20378 SATA150 controller in mirror mode on two identical Seagate ST3160023AS drives) This just happen after a lockup with XFree, thus I dont have a theorie nor any infos on a way to figure out which fix/patch introduce this problem. Work done: Platform test: CPU's, Memory, Disk, Temperature, power supply. All Ok. Was using a 2 weeks old current, updated to latest current this morning, no luck. Change the kernel from SMP to UP, no luck. Try different method/parms with fsck, no luck Been googling for reference or similar situation without luck. Things to do: Have yet to try with RELENG_5_2. use strace to see where this happen. Add debugging code to fsck to find where this happen. Status: The file system seems ok, all files are there, find works, no obvious inconsistency, dd if=/dev/ar0s1 dump the entire disk without any errors. Request: Any hint? This combinaison of hardware is deployed (10+ servers) for months now and this is the first problem I encounter. But this is also the only one running current. Config: ----- # disklabel -r /dev/ar0s1 # /dev/ar0s1: 8 partitions: # size offset fstype [fsize bsize bps/cpg] a: 304107709 8388608 4.2BSD 2048 16384 28552 b: 8388608 0 swap c: 312496317 0 unused 0 0 # "raw" part, don't edit ----- # tunefs -p /dev/ar0s1a tunefs: ACLs: (-a) disabled tunefs: MAC multilabel: (-l) disabled tunefs: soft updates: (-n) enabled tunefs: maximum blocks per file in a cylinder group: (-e) 2048 tunefs: average file size: (-f) 16384 tunefs: average number of files in a directory: (-s) 64 tunefs: minimum percentage of free space: (-m) 8% tunefs: optimization preference: (-o) time tunefs: volume label: (-L) -- Alain Hebert ahebert@pubnix.net PubNIX Inc. P.O. Box 175 Beaconsfield, Quebec H9W 5T7 tel 514-990-5911 http://www.pubnix.net fax 514-990-9443 --------------050606010006000004010202 Content-Type: text/plain; name="a1" Content-Transfer-Encoding: 7bit Content-Disposition: inline; filename="a1" Copyright (c) 1992-2004 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 5.2-CURRENT #1: Sat May 8 10:56:30 EDT 2004 root@aal2.pubnix.net:/usr/src/sys/i386/compile/AAL Preloaded elf kernel "/boot/kernel/kernel" at 0xc0945000. Preloaded elf module "/boot/kernel/acpi.ko" at 0xc09451f4. Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Intel(R) Xeon(TM) CPU 2.40GHz (2405.46-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0xf27 Stepping = 7 Features=0xbfebfbff Hyperthreading: 2 logical CPUs real memory = 2147418112 (2047 MB) avail memory = 2095943680 (1998 MB) ACPI APIC Table: FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs cpu0 (BSP): APIC ID: 0 cpu1 (AP): APIC ID: 1 cpu2 (AP): APIC ID: 6 cpu3 (AP): APIC ID: 7 ioapic0: Changing APIC ID to 4 ioapic0 irqs 0-23 on motherboard Pentium Pro MTRR support enabled random: npx0: [FAST] npx0: on motherboard npx0: INT 16 interface acpi0: on motherboard acpi0: [GIANT-LOCKED] pcibios: BIOS version 2.10 acpi0: Power Button (fixed) Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x408-0x40b on acpi0 cpu0: port 0x530-0x537 on acpi0 cpu1: port 0x530-0x537 on acpi0 cpu2: port 0x530-0x537 on acpi0 cpu3: port 0x530-0x537 on acpi0 acpi_tz0: port 0x530-0x537 on acpi0 acpi_button0: on acpi0 acpi_button1: on acpi0 pcib0: port 0xcf8-0xcff on acpi0 pci0: on pcib0 agp0: mem 0xf0000000-0xf7ffffff at device 0.0 on pci0 agp0: Reserved 0x8000000 bytes for rid 0x10 type 3 at 0xf0000000 pcib1: at device 1.0 on pci0 pci1: on pcib1 pcib1: slot 0 INTA is routed to irq 16 pci1: at device 0.0 (no driver attached) pci1: at device 0.1 (no driver attached) pcib2: at device 3.0 on pci0 pcib2: could not get PCI interrupt routing table for \\_SB_.PCI0.CSAB - AE_NOT_FOUND pci2: on pcib2 pci2: at device 1.0 (no driver attached) uhci0: port 0xbc00-0xbc1f irq 16 at device 29.0 on pci0 uhci0: Reserved 0x20 bytes for rid 0x20 type 4 at 0xbc00 uhci0: [GIANT-LOCKED] usb0: on uhci0 usb0: USB revision 1.0 uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered uhci1: port 0xb000-0xb01f irq 19 at device 29.1 on pci0 uhci1: Reserved 0x20 bytes for rid 0x20 type 4 at 0xb000 uhci1: [GIANT-LOCKED] usb1: on uhci1 usb1: USB revision 1.0 uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub1: 2 ports with 2 removable, self powered uhci2: port 0xb400-0xb41f irq 18 at device 29.2 on pci0 uhci2: Reserved 0x20 bytes for rid 0x20 type 4 at 0xb400 uhci2: [GIANT-LOCKED] usb2: on uhci2 usb2: USB revision 1.0 uhub2: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub2: 2 ports with 2 removable, self powered uhci3: port 0xb800-0xb81f irq 16 at device 29.3 on pci0 uhci3: Reserved 0x20 bytes for rid 0x20 type 4 at 0xb800 uhci3: [GIANT-LOCKED] usb3: on uhci3 usb3: USB revision 1.0 uhub3: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub3: 2 ports with 2 removable, self powered ehci0: mem 0xfc100000-0xfc1003ff irq 23 at device 29.7 on pci0 ehci0: Reserved 0x400 bytes for rid 0x10 type 3 at 0xfc100000 ehci0: [GIANT-LOCKED] ehci_pci_attach: companion usb0 ehci_pci_attach: companion usb1 ehci_pci_attach: companion usb2 ehci_pci_attach: companion usb3 usb4: EHCI version 1.0 usb4: companion controllers, 2 ports each: usb0 usb1 usb2 usb3 usb4: on ehci0 usb4: USB revision 2.0 uhub4: (0x8086) EHCI root hub, class 9/0, rev 2.00/1.00, addr 1 uhub4: 8 ports with 8 removable, self powered pcib3: at device 30.0 on pci0 pci3: on pcib3 fwohci0: mem 0xfb120000-0xfb123fff,0xfb126000-0xfb1267ff irq 20 at device 3.0 on pci3 fwohci0: Reserved 0x800 bytes for rid 0x10 type 3 at 0xfb126000 fwohci0: [GIANT-LOCKED] fwohci0: OHCI version 1.10 (ROM=1) fwohci0: No. of Isochronous channel is 4. fwohci0: EUI64 00:e0:18:00:00:43:2b:be fwohci0: Phy 1394a available S400, 2 ports. fwohci0: Link S400, max_rec 2048 bytes. firewire0: on fwohci0 sbp0: on firewire0 fwe0: on firewire0 if_fwe0: Fake Ethernet address: 02:e0:18:43:2b:be fwe0: Ethernet address: 02:e0:18:43:2b:be fwohci0: Initiate bus reset fwohci0: node_id=0xc800ffc0, gen=1, CYCLEMASTER mode firewire0: 1 nodes, maxhop <= 0, cable IRM = 0 (me) firewire0: bus manager 0 (me) atapci0: port 0x8400-0x847f,0x8000-0x800f,0x8c00-0x8c3f mem 0xfb100000-0xfb11ffff,0xfb125000-0xfb125fff irq 23 at device 4.0 on pci3 atapci0: failed: rid 0x20 is memory, requested 4 atapci0: Reserved 0x20000 bytes for rid 0x20 type 3 at 0xfb100000 atapci0: Reserved 0x1000 bytes for rid 0x1c type 3 at 0xfb125000 ata2: at 0xfb125000 on atapci0 ata3: at 0xfb125000 on atapci0 ata4: at 0xfb125000 on atapci0 ahc0: port 0x8800-0x88ff mem 0xfb124000-0xfb124fff irq 22 at device 10.0 on pci3 ahc0: Reserved 0x100 bytes for rid 0x10 type 4 at 0x8800 ahc0: [GIANT-LOCKED] aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs pcib4: at device 11.0 on pci3 pci4: on pcib4 pcib4: slot 4 INTA is routed to irq 23 pcib4: slot 5 INTA is routed to irq 20 fxp0: port 0x7000-0x703f mem 0xfb000000-0xfb01ffff,0xfb041000-0xfb041fff irq 23 at device 4.0 on pci4 fxp0: Reserved 0x1000 bytes for rid 0x10 type 3 at 0xfb041000 miibus0: on fxp0 inphy0: on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp0: Ethernet address: 00:02:b3:ce:19:3f fxp0: [GIANT-LOCKED] fxp1: port 0x7400-0x743f mem 0xfb020000-0xfb03ffff,0xfb040000-0xfb040fff irq 20 at device 5.0 on pci4 fxp1: Reserved 0x1000 bytes for rid 0x10 type 3 at 0xfb040000 miibus1: on fxp1 inphy1: on miibus1 inphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp1: Ethernet address: 00:02:b3:ce:19:40 fxp1: [GIANT-LOCKED] ohci0: mem 0xfb127000-0xfb127fff irq 20 at device 12.0 on pci3 ohci0: Reserved 0x1000 bytes for rid 0x10 type 3 at 0xfb127000 ohci0: [GIANT-LOCKED] usb5: OHCI version 1.0, legacy support usb5: on ohci0 usb5: USB revision 1.0 uhub5: OPTi OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub5: 2 ports with 2 removable, self powered isab0: at device 31.0 on pci0 isa0: on isab0 atapci1: port 0xf000-0xf00f,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 at device 31.1 on pci0 atapci1: Reserved 0x10 bytes for rid 0x20 type 4 at 0xf000 atapci1: Reserved 0x8 bytes for rid 0x10 type 4 at 0x1f0 atapci1: Reserved 0x1 bytes for rid 0x14 type 4 at 0x3f6 ata0: at 0x1f0 irq 14 on atapci1 atapci1: Reserved 0x8 bytes for rid 0x18 type 4 at 0x170 atapci1: Reserved 0x1 bytes for rid 0x1c type 4 at 0x376 ata1: at 0x170 irq 15 on atapci1 atapci2: port 0xd000-0xd00f,0xcc00-0xcc03,0xc800-0xc807,0xc400-0xc403,0xc000-0xc007 irq 18 at device 31.2 on pci0 atapci2: Reserved 0x10 bytes for rid 0x20 type 4 at 0xd000 atapci2: Reserved 0x8 bytes for rid 0x10 type 4 at 0xc000 atapci2: Reserved 0x4 bytes for rid 0x14 type 4 at 0xc400 ata5: at 0xc000 on atapci2 atapci2: Reserved 0x8 bytes for rid 0x18 type 4 at 0xc800 atapci2: Reserved 0x4 bytes for rid 0x1c type 4 at 0xcc00 ata6: at 0xc800 on atapci2 ichsmb0: port 0x500-0x51f irq 17 at device 31.3 on pci0 ichsmb0: Reserved 0x20 bytes for rid 0x20 type 4 at 0x500 ichsmb0: [GIANT-LOCKED] smbus0: on ichsmb0 smb0: on smbus0 pcm0: port 0xdc00-0xdc3f,0xd800-0xd8ff mem 0xfc102000-0xfc1020ff,0xfc101000-0xfc1011ff irq 17 at device 31.5 on pci0 pcm0: Reserved 0x200 bytes for rid 0x18 type 3 at 0xfc101000 pcm0: Reserved 0x100 bytes for rid 0x1c type 3 at 0xfc102000 pcm0: [GIANT-LOCKED] pcm0: fdc0: port 0x3f7,0x3f0-0x3f5 irq 6 drq 2 on acpi0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 sio0 port 0x3f8-0x3ff irq 4 on acpi0 sio0: type 16550A sio1 port 0x2f8-0x2ff irq 3 on acpi0 sio1: type 16550A ppc0 port 0x778-0x77b,0x378-0x37f irq 7 drq 3 on acpi0 ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/9 bytes threshold ppbus0: on ppc0 ppbus0: IEEE1284 device found Probing for PnP devices on ppbus0: plip0: on ppbus0 lpt0: on ppbus0 lpt0: Interrupt-driven port ppi0: on ppbus0 atkbdc0: port 0x64,0x60 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] psm0: irq 12 on atkbdc0 psm0: [GIANT-LOCKED] psm0: model IntelliMouse, device ID 3 orm0: