From owner-freebsd-stable@FreeBSD.ORG Tue Aug 5 16:24:57 2008 Return-Path: Delivered-To: freebsd-stable@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 323E91065672 for ; Tue, 5 Aug 2008 16:24:57 +0000 (UTC) (envelope-from scf@FreeBSD.org) Received: from mail.farley.org (farley.org [67.64.95.201]) by mx1.freebsd.org (Postfix) with ESMTP id F39408FC12 for ; Tue, 5 Aug 2008 16:24:56 +0000 (UTC) (envelope-from scf@FreeBSD.org) Received: from thor.farley.org (HPooka@thor.farley.org [192.168.1.5]) by mail.farley.org (8.14.3/8.14.3) with ESMTP id m75FjGNG041188 for ; Tue, 5 Aug 2008 10:45:16 -0500 (CDT) (envelope-from scf@FreeBSD.org) Date: Tue, 5 Aug 2008 10:45:16 -0500 (CDT) From: "Sean C. Farley" To: freebsd-stable@FreeBSD.org Message-ID: User-Agent: Alpine 1.10 (BSF 962 2008-03-14) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; format=flowed; charset=US-ASCII X-Spam-Status: No, score=-4.4 required=3.0 tests=ALL_TRUSTED,BAYES_00 autolearn=ham version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on mail.farley.org Cc: Subject: Stuck in geli X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 05 Aug 2008 16:24:57 -0000 Rarely, a geli partition I have freezes a process in bufwait state. It occurs after an ATA timeout message: Aug 5 03:47:13 thor kernel: ad10: TIMEOUT - WRITE_DMA retrying (1 retry left) LBA=219028637 The geli partition resides on an Intel MatrixRAID RAID1 mirror using the ICH9R chipset (Asus P5K-E/WIFI). Killing (even -9) the process does not work. Rebooting is the only solution, yet the system is unable to flush the buffers and complete a clean unmounting. Both drives in the mirror have both survived a smartctl -t offline scan. Also, a previous time it was with ad12, so I strongly doubt it is the drive. It seems like a geli partition is unable to handle a timeout from a drive. ad10: Model Family: Seagate Barracuda 7200.7 and 7200.7 Plus family Device Model: ST3160827AS ad12: Model Family: Seagate Barracuda 7200.7 and 7200.7 Plus family Device Model: ST3160827AS Sean P.S. I could not help myself with the subject line. :) -- scf@FreeBSD.org