Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 23 Feb 2010 17:50:18 +0200
From:      Alexander Motin <mav@FreeBSD.org>
To:        George Liaskos <geo.liaskos@gmail.com>
Cc:        Harald Schmalzbauer <h.schmalzbauer@omnilan.de>, freebsd-stable@freebsd.org
Subject:   Re: ahcich timeouts, only with ahci, not with ataahci
Message-ID:  <4B83F93A.4030805@FreeBSD.org>
In-Reply-To: <1266938581.00222692.1266925802@10.7.7.3>
References:  <1266934981.00222684.1266922202@10.7.7.3> <1266938581.00222692.1266925802@10.7.7.3>

next in thread | previous in thread | raw e-mail | index | archive | help
George Liaskos wrote:
> I am having the same issue since Feb 20 which was my last update.

Which version was before?

> ahcich1: Timeout on slot 3
> ahcich1: is 00000000 cs 00000018 ss 00000000 rs 00000018 tfd d0 serr 00000000
> ahcich1: Timeout on slot 5
> ahcich1: is 00000000 cs 00000060 ss 00000000 rs 00000060 tfd d0 serr 00000000
> ahcich1: Timeout on slot 7
> ahcich1: is 00000000 cs 00000180 ss 00000000 rs 00000180 tfd d0 serr 00000000
> ahcich1: Timeout on slot 9
> ahcich1: is 00000000 cs 00000600 ss 00000000 rs 00000600 tfd d0 serr 00000000

Situation looks alike, except it is CD drive and so without NCQ. I am
still don't see bugs from driver side here. Do you see delays before
timeout messages?

You may try to enable verbose kernel messages to get some more info.

>> dmesg | grep ahci
> ahci0: <Intel ICH9M AHCI SATA controller> port
> 0x1c40-0x1c47,0x1834-0x1837,0x1838-0x183f,0x1830-0x1833,0x1c20-0x1c3f
> mem 0xfc226000-0xfc2267ff irq 16 at device 31.2 on pci0
> ahci0: [ITHREAD]
> ahci0: AHCI v1.20 with 4 3Gbps ports, Port Multiplier not supported
> ahcich0: <AHCI channel> at channel 0 on ahci0
> ahcich0: [ITHREAD]
> ahcich1: <AHCI channel> at channel 1 on ahci0
> ahcich1: [ITHREAD]
> ada0 at ahcich0 bus 0 scbus0 target 0 lun 0
> cd0 at ahcich1 bus 0 scbus1 target 0 lun 0

Show this also with verbose.

> On Tue, Feb 23, 2010 at 10:40 AM, Harald Schmalzbauer
> <h.schmalzbauer@omnilan.de> wrote:
>> Hello,
>>
>> I'm frequently getting my machine locked with ahcichX timeouts:
>> ahcich2: Timeout on slot 0
>> ahcich2: is 00000000 cs 00000001 ss 00000000 rs 00000001 tfd c0 serr
>> 00000000
>> ahcich2: Timeout on slot 8
>> ahcich2: is 00000000 cs 00000100 ss 00000000 rs 00000100 tfd c0 serr
>> 00000000
>> ahcich2: Timeout on slot 8
>> ahcich2: is 00000000 cs fffff07f ss ffffff7f rs ffffff7f tfd c0 serr
>> 00000000
>> ...
>>
>> This happens when backup over GbE overloads ZFS/HDD capabilities.
>> I reduced vfs.zfs.txg.timeout to 1 to prevent the machine from locking up
>> almost immediately, but from it still happens.
>> When I don't use ahci but ataahci (the old driver if I understand things
>> correct) I also see the ZFS burst write congestion, but this doesn't lead to
>> controller timeouts, thus blocking the machine.
>>
>> Sometimes the machine recovers from the disk lock, but most often I have to
>> reboot.
>>
>> Kernel is from Feb. 19, so recent ahci improovements are active.
>> Controller is ICH9R with 3 Samsung F3 SpinPoints.
>>
>> Any ideas how to work arround the hangs other than using the old ahci
>> driver?

-- 
Alexander Motin



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?4B83F93A.4030805>