Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 23 Feb 2018 15:11:21 +0100
From:      Andrea Venturoli <ml@netfence.it>
To:        freebsd-questions@freebsd.org
Subject:   11.1/amd64 hangs on boot
Message-ID:  <854f2a4e-1264-7c7f-60c4-7f65efa290f7@netfence.it>

next in thread | raw e-mail | index | archive | help
Hello.

I've had this machine for eons and it went through almost every FreeBSD 
version in the last years without a hitch.

After upgrading to 11.1, however, it will often (not always) hang on boot.

Some extract from dmesg (I think these are the relevant parts):

> ...
> FreeBSD 11.1-RELEASE-p6 #1 r327994M: Mon Jan 15 09:50:57 CET 2018
>     root@alamar.ventu:/usr/obj/usr/src/sys/ALAMAR amd64
> FreeBSD clang version 4.0.0 (tags/RELEASE_400/final 297347) (based on LLVM 4.0.0)
> ...
> ahci0: <AMD SB7x0/SB8x0/SB9x0 AHCI SATA controller> port 0xa000-0xa007,0x9000-0x9003,0x8000-0x8007,0x7000-0x7003,0x6000-0x600f mem 0xfe6ffc00-0xfe6fffff irq 22 at device 17.0 on pci0
> ahci0: AHCI v1.10 with 6 3Gbps ports, Port Multiplier supported
> ahci0: quirks=0x22000<ATI_PMP_BUG,1MSI>
> ahcich0: <AHCI channel> at channel 0 on ahci0
> ahcich1: <AHCI channel> at channel 1 on ahci0
> ahcich2: <AHCI channel> at channel 2 on ahci0
> ahcich3: <AHCI channel> at channel 3 on ahci0
> ahcich4: <AHCI channel> at channel 4 on ahci0
> ahcich5: <AHCI channel> at channel 5 on ahci0
> ohci0: <AMD SB7x0/SB8x0/SB9x0 USB controller> mem 0xfe6fe000-0xfe6fefff irq 16 at device 18.0 on pci0
> usbus0 on ohci0
> usbus0: 12Mbps Full Speed USB v1.0
> ohci1: <AMD SB7x0/SB8x0/SB9x0 USB controller> mem 0xfe6fd000-0xfe6fdfff irq 16 at device 18.1 on pci0
> usbus1 on ohci1
> usbus1: 12Mbps Full Speed USB v1.0
> ehci0: <AMD SB7x0/SB8x0/SB9x0 USB 2.0 controller> mem 0xfe6ff800-0xfe6ff8ff irq 17 at device 18.2 on pci0
> usbus2: EHCI version 1.0
> usbus2 on ehci0
> usbus2: 480Mbps High Speed USB v2.0
> ohci2: <AMD SB7x0/SB8x0/SB9x0 USB controller> mem 0xfe6fc000-0xfe6fcfff irq 18 at device 19.0 on pci0
> usbus3 on ohci2
> usbus3: 12Mbps Full Speed USB v1.0
> ohci3: <AMD SB7x0/SB8x0/SB9x0 USB controller> mem 0xfe6f7000-0xfe6f7fff irq 18 at device 19.1 on pci0
> usbus4 on ohci3
> usbus4: 12Mbps Full Speed USB v1.0
> ehci1: <AMD SB7x0/SB8x0/SB9x0 USB 2.0 controller> mem 0xfe6ff400-0xfe6ff4ff irq 19 at device 19.2 on pci0
> usbus5: EHCI version 1.0
> usbus5 on ehci1
> usbus5: 480Mbps High Speed USB v2.0
> atapci0: <ATI IXP700/800 UDMA133 controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xff00-0xff0f at device 20.1 on pci0
> ata1: <ATA channel> at channel 1 on atapci0
> ...
> ahd0: <Adaptec 29320 Ultra320 SCSI adapter> port 0xe800-0xe8ff,0xe400-0xe4ff mem 0xfebfe000-0xfebfffff irq 20 at device 5.0 on pci3
> aic7902: Ultra320 Wide Channel A, SCSI Id=7, PCI 33 or 66MHz, 512 SCBs
> ahd1: <Adaptec 29320 Ultra320 SCSI adapter> port 0xe000-0xe0ff,0xd800-0xd8ff mem 0xfebfc000-0xfebfdfff irq 21 at device 5.1 on pci3
> aic7902: Ultra320 Wide Channel B, SCSI Id=7, PCI 33 or 66MHz, 512 SCBs
> ...
> ugen4.1: <ATI OHCI root HUB> at usbus4
> ugen1.1: <ATI OHCI root HUB> at usbus1
> ugen0.1: <ATI OHCI root HUB> at usbus0
> ugen6.1: <ATI OHCI root HUB> at usbus6
> uhub0: <ATI OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1> on usbus4
> uhub1: <ATI OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1> on usbus1
> uhub2: <ATI OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1> on usbus6
> uhub3: ugen3.1: <ATI OHCI root HUB> at usbus3
> <ATI OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1> on usbus0
> uhub4: <ATI OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1> on usbus3
> ugen5.1: <ATI EHCI root HUB> at usbus5
> ugen2.1: <ATI EHCI root HUB> at usbus2
> uhub5: <ATI EHCI root HUB, class 9/0, rev 2.00/1.00, addr 1> on usbus5
> uhub6: <ATI EHCI root HUB, class 9/0, rev 2.00/1.00, addr 1> on usbus2
> uhub2: 2 ports with 2 removable, self powered
> uhub0: 3 ports with 3 removable, self powered
> uhub1: 3 ports with 3 removable, self powered
> uhub3: 3 ports with 3 removable, self powered
> uhub4: 3 ports with 3 removable, self powered
> uhub5: 6 ports with 6 removable, self powered
> uhub6: 6 ports with 6 removable, self powered
> ...
> ugen5.2: <Generic Mass Storage Device> at usbus5
> umass0 on uhub5
> umass0: <Generic Mass Storage Device, class 0/0, rev 2.00/1.29, addr 2> on usbus5
> umass0:  SCSI over Bulk-Only; quirks = 0x4001
> umass0:9:0: Attached to scbus9
> ...
> da1 at ahd1 bus 0 scbus8 target 0 lun 0
> da1: <HITACHI HUS103014FL3600 SA1B> Fixed Direct Access SCSI-3 device
> da1: Serial Number V5X9S3AA0051P65D0441
> da1: 320.000MB/s transfers (160.000MHz DT, offset 80, 16bit)
> da1: Command Queueing enabled
> da1: 140205MB (287140277 512 byte sectors)
> da0 at ahd0 bus 0 scbus7 target 4 lun 0
> da0: <IOMEGA ZIP 100 E.08> Removable Direct Access SCSI-2 device
> da0: 3.300MB/s transfers
> da0: Attempt to query device size failed: NOT READY, Medium not present
> da2 at ahd1 bus 0 scbus8 target 2 lun 0
> da2: <HITACHI HUS103014FL3600 SA1B> Fixed Direct Access SCSI-3 device
> da2: Serial Number V5XABRAA0051P65L0026
> da2: 320.000MB/s transfers (160.000MHz DT, offset 80, 16bit)
> da2: Command Queueing enabled
> da2: 140205MB (287140277 512 byte sectors)
> cd0 at ahcich2 bus 0 scbus2 target 0 lun 0
> cd0: <HL-DT-ST BD-RE  BH16NS40 1.03> Removable CD-ROM SCSI device
> cd0: Serial Number K9HE8TD3540
> cd0: 150.000MB/s transfers (SATA 1.x, UDMA5, ATAPI 12bytes, PIO 8192bytes)
> cd0: Attempt to query device size failed: NOT READY, Medium not present - tray closed
> cd1 at ata1 bus 0 scbus6 target 0 lun 0
> cd1: <BENQ DVD DC DW1670 103> Removable CD-ROM SCSI device
> cd1: 66.700MB/s transfers (UDMA4, ATAPI 12bytes, PIO 65534bytes)
> cd1: Attempt to query device size failed: NOT READY, Medium not present
> da3 at umass-sim0 bus 0 scbus9 target 0 lun 0
> da3: <Generic USB SD Reader 1.00> Removable Direct Access SCSI device
> da3: Serial Number 058F312D81B
> da3: 40.000MB/s transfers
> da3: Attempt to query device size failed: NOT READY, Medium not present
> da3: quirks=0x2<NO_6_BYTE>
> Trying to mount root from ufs:/dev/mirror/gm1a [rw]...
> da4 at umass-sim0 bus 0 scbus9 target 0 lun 1
> da4: <Generic USB CF Reader 1.01> Removable Direct Access SCSI device
> da4: Serial Number 058F312D81B
> da4: 40.000MB/s transfers
> da4: Attempt to query device size failed: NOT READY, Medium not present
> da4: quirks=0x2<NO_6_BYTE>
> da5 at umass-sim0 bus 0 scbus9 target 0 lun 2
> da5: <Generic USB SM Reader 1.02> Removable Direct Access SCSI device
> da5: Serial Number 058F312D81B
> da5: 40.000MB/s transfers
> da5: Attempt to query device size failed: NOT READY, Medium not present
> da5: quirks=0x2<NO_6_BYTE>
> da6 at umass-sim0 bus 0 scbus9 target 0 lun 3
> da6: <Generic USB MS Reader 1.03> Removable Direct Access SCSI device
> da6: Serial Number 058F312D81B
> da6: 40.000MB/s transfers
> da6: Attempt to query device size failed: NOT READY, Medium not present
> da6: quirks=0x2<NO_6_BYTE>

The last part is the most relevant: starting from "da1:" onwards, the 
order of appearance of daX and cdX is not always the same.
Often, after a few of daX and/or cdX are output, the machine hangs, the 
SCSI controller light staying on: sometimes it will spontaneusly reboot 
after several seconds, sometimes I'll have to press the reset button.

There's no rules to how many times I have to try booting every morning: 
some days it starts right the first time, some days I have to reboot 
several times until it gets it right and goes ahead.

While it might be an hardware problem (being the box quite old), the 
fact that it started doing this after a 10.3->11.1 upgrade is suspicious.
Is there a known race-condition/LOR/whatever? Is there a way to get 
better info (e.g. some option to put in the kernel config or in 
/boot/loader.conf)?
Should I enable DDB, INVARIANTS, WITNESS & co.? I'd like to avoid 
breaking performance, but I'll do if it's the only way.

  bye & Thanks
	av.



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?854f2a4e-1264-7c7f-60c4-7f65efa290f7>