From owner-freebsd-questions Mon Jul 31 15:43:40 2000 Delivered-To: freebsd-questions@freebsd.org Received: from relay.lppi.com (relay.lppi.com [208.49.169.66]) by hub.freebsd.org (Postfix) with ESMTP id EFF2B37B726 for ; Mon, 31 Jul 2000 15:43:35 -0700 (PDT) (envelope-from lew@lppi.com) Date: Mon, 31 Jul 2000 15:43:34 -0700 (PDT) Message-Id: <200007312243.PAA09543@relay.lppi.com> From: Lew Payne To: freebsd-questions@FreeBSD.ORG Subject: Soft Errors Reading EIDE ccd Cluster. MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Mailer: Becky! ver 1.23 Sender: owner-freebsd-questions@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG One of our new FreeBSD 3.5-REL systems is periodically locking up, due to an apparent disk error. These are brand-new IBM 7200 RPM 60 GB ATA/66 EIDE drives, in a ccd configuration as follows: Filesystem 1K-blocks Used Avail Capacity Mounted on /dev/da0s1a 79359 20357 52654 28% / /dev/da0s1f 8025325 1023010 6360289 14% /usr /dev/da0s1e 119055 3401 106130 3% /var procfs 4 4 0 100% /proc /dev/ccd0c 239854317 127105965 93560007 58% /spool cat /etc/ccd.conf # ccd ileave flags component devices ccd0 128 none /dev/wd0s1e /dev/wd2s1e /dev/wd1s1e /dev/wd3s1e and the error message (which repeats indefinately) is: Jul 31 01:02:06 news /kernel: wd0s1e: soft error reading fsbn 81057521 of 81057520-81057551 (wd0s1 bn 81057521; cn 64331 tn 7 sn 20) (status 58 error 1) Jul 31 01:02:16 news /kernel: wd0s1e: soft error reading fsbn 81057521 of 81057520-81057551 (wd0s1 bn 81057521; cn 64331 tn 7 sn 20) (status 58 error 1) The system just gets stuck doing this seek over and over again, at which point it becomes impossible to log in via the console, or do anything else (I/O bound). Is there a trick to getting soft-recovery working with EIDE devices? Better yet, how can I get rid of this problem without getting rid of the new drives? It seems that it sits there trying to recover from this "soft error" but never does, and never maps it in the replacement block table as "bad". Is there a way to "reformat" the drive (low level, perhaps) so that it maps out the appropriate replacement table for the darn fsbn? Or how about a way of adding bad blocks to it? Any help in solving this would be appreciated !! Regards, Lew Payne --- Lew Payne Publishing, Inc. Dunn & Bradstreet listed 994 San Antonio Road DUNS # 055037852 Palo Alto, CA 94303 To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-questions" in the body of the message