From owner-freebsd-hardware@FreeBSD.ORG Tue Jul 19 21:39:03 2011 Return-Path: Delivered-To: freebsd-hardware@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id D8004106564A; Tue, 19 Jul 2011 21:39:03 +0000 (UTC) (envelope-from lev@FreeBSD.org) Received: from onlyone.friendlyhosting.spb.ru (onlyone.friendlyhosting.spb.ru [IPv6:2a01:4f8:131:60a2::2]) by mx1.freebsd.org (Postfix) with ESMTP id 330628FC16; Tue, 19 Jul 2011 21:39:02 +0000 (UTC) Received: from lion.home.serebryakov.spb.ru (unknown [IPv6:2001:470:923f:1:d4eb:36bb:6cb8:7401]) (Authenticated sender: lev@serebryakov.spb.ru) by onlyone.friendlyhosting.spb.ru (Postfix) with ESMTPA id B45F14AC1C; Wed, 20 Jul 2011 01:38:59 +0400 (MSD) Date: Wed, 20 Jul 2011 01:38:56 +0400 From: Lev Serebryakov Organization: FreeBSD X-Priority: 3 (Normal) Message-ID: <1981757790.20110720013856@serebryakov.spb.ru> To: freebsd-hardware@freebsd.org MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="----------968014E387BE7D2" X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: Alexander Motin Subject: ahci.ko / geom_mirror / zfs hangs up system when one of HDDs fauilts. X-BeenThere: freebsd-hardware@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: lev@FreeBSD.org List-Id: General discussion of FreeBSD hardware List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 19 Jul 2011 21:39:03 -0000 ------------968014E387BE7D2 Content-Type: text/plain; charset=windows-1251 Content-Transfer-Encoding: quoted-printable Hello, Freebsd-hardware. I've have two identical live locks when HDD becomes broken on 8.2-STABLE system with two SATA HDDs withgmirror and ZFS on them. It is Hetzner-based server, so only access I have is LARA console, but symptoms are identical in both cases: HDD becomes bad, ahci.ko complains about timeouts, and after that server stops to respond on high-level access attempts (ssh/HTTP/SMTP), but can be pinged both with IPv4 and IPv6 addresses. HDDs are identical, and they are splitted into several (BSD)partions. Some partitions are mirrired with geom_mirror and one pair of partitions are added to (mirrored) ZFS pool like this (I proved output on rebooted one-HDD-only system, but, I think, it is clear how it looks when both HDDs are Ok): =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D # gmirror status onlyone# gmirror status Name Status Components mirror/root DEGRADED ada0s1a mirror/var DEGRADED ada0s1d mirror/tmp DEGRADED ada0s1e mirror/usr DEGRADED ada0s1f mirror/databases DEGRADED ada0s1g # zpool status pool: pool state: DEGRADED status: One or more devices could not be opened. Sufficient replicas exist= for the pool to continue functioning in a degraded state. action: Attach the missing device and online it using 'zpool online'. see: http://www.sun.com/msg/ZFS-8000-2Q scrub: none requested config: NAME STATE READ WRITE CKSUM pool DEGRADED 0 0 0 mirror DEGRADED 0 0 0 ada0s1h ONLINE 0 0 0 ada0s1h UNAVAIL 0 0 0 cannot open errors: No known data errors =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D Screenshot of LARA console in such case is attached. --=20 // Black Lion AKA Lev Serebryakov ------------968014E387BE7D2--