Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 24 Oct 2011 20:27:49 +0200
From:      "C. P. Ghost" <cpghost@cordula.ws>
To:        Alexey Shuvaev <shuvaev@physik.uni-wuerzburg.de>
Cc:        freebsd-current@freebsd.org
Subject:   Re: Panics after AHCI timeouts
Message-ID:  <CADGWnjXR%2BX9WaTJLutSxtLmJHuZq6j54wwj4UdZbqgL5fKvmtg@mail.gmail.com>
In-Reply-To: <20111018131353.GA83797@lexx.ifp.tuwien.ac.at>
References:  <20111008201456.GA3529@lexx.ifp.tuwien.ac.at> <20111017190027.GA9873@lexx.ifp.tuwien.ac.at> <CAJ-Vmokbm5z3GPbKjc6_o0_Ea6u_b7twDu=xLeYpORiUpp6Z=Q@mail.gmail.com> <20111018131353.GA83797@lexx.ifp.tuwien.ac.at>

next in thread | previous in thread | raw e-mail | index | archive | help
On Tue, Oct 18, 2011 at 3:13 PM, Alexey Shuvaev
<shuvaev@physik.uni-wuerzburg.de> wrote:
> On Tue, Oct 18, 2011 at 06:19:19AM +0800, Adrian Chadd wrote:
>> On 18 October 2011 03:00, Alexey Shuvaev
>> <shuvaev@physik.uni-wuerzburg.de> wrote:
>> > On Sat, Oct 08, 2011 at 10:14:56PM +0200, Alexey Shuvaev wrote:
>> >> Hello list!
>> >>
>> > Errr... Replying to myself... Ping? Should I file a PR and put it
>> > in the back burner? :)
>>
>> I think filing a PR is a good move. Then just be proactive and poke
>> people about it. It'd be good to get this fixed. :)
>>
> Done, kern/161768.
>
> Question to the list: does anybody see successful recovery from AHCI
> timeout an a recent CURRENT? Recent means June 2011 or newer, so 9.0
> branch counts also. That is, there are some kernel messages like this:
>
> ahcich0: Timeout on slot 29 port 0
> ahcich0: is 00000000 cs 00000000 ss ffffffff rs ffffffff tfd 40 serr 00000000 cmd 0000fc17
>
> but then AHCI recovers and the system does not panic?

I'm seeing these timeouts too on an 8.2-STABLE amd64 r222832
from June 7. The system hangs partially -- or, more precisely, all
processes that attempt to access the disk on this channel hang,
everything else continues as normal.

I suspect a faulty cable, but I don't have physical access to the system
to replace parts right now. A panic would be a regression, so I'm holding
off updates on that server until AHCI becomes more tolerant and somewhat
self-healing. :(

> Poking Alexey.

-cpghost.

-- 
Cordula's Web. http://www.cordula.ws/



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CADGWnjXR%2BX9WaTJLutSxtLmJHuZq6j54wwj4UdZbqgL5fKvmtg>