Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 18 Jun 2010 09:51:01 +0200
From:      Pieter de Boer <pieter@thelostparadise.com>
To:        Matthew Lear <matt@bubblegen.co.uk>
Cc:        freebsd-stable@freebsd.org
Subject:   Re: 7.2-RELEASE-p4, IO errors & RAID1 failure
Message-ID:  <4C1B2565.9010509@thelostparadise.com>
In-Reply-To: <1276844904.7519.19.camel@almscliff.bubblegen.co.uk>
References:  <1276844904.7519.19.camel@almscliff.bubblegen.co.uk>

next in thread | previous in thread | raw e-mail | index | archive | help
Hi Matthew,

> I'm running 7.2-RELEASE-p4 on an i386 HP server (ML G5) in RAID1
> configuration. Very recently, I've seen IO errors such as:
>
> ad0: TIMEOUT - READ_DMA retrying (1 retry left) LBA=20472527
>
> reported and the RAID mirror is now offline.
>
> ad0: TIMEOUT - WRITE_DMA48 retrying (1 retry left) LBA=395032335
> ad0: FAILURE - WRITE_DMA48 status=51<READY,DSC,ERROR>
> error=10<NID_NOT_FOUND>  LBA=395032335
> ar0: WARNING - mirror protection lost. RAID1 array in DEGRADED mode
>
I had more or less the same timeout issues on my 8.0-RELEASE box on a 
Dell R300 with SATA disks. What I did was raise the ata timeout from 5 
seconds to 20. I did this by patching the kernel code while running, but 
I'm not sure you'd like that approach ;)

In http://www.freebsd.org/cgi/query-pr.cgi?pr=111023 a patch is 
presented that raises the timeouts by patching a few ATA kernel source 
files. This has been committed to RELENG_7 as well, so by upgrading your 
7.2-install to the latest RELENG_7 (or RELENG_8), you'll have that 
timeout fix.

Why ATA commands can take longer than 5 seconds although the disks 
appear to be fine.. wouldn't know ..

-- 
Pieter






Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?4C1B2565.9010509>