From owner-freebsd-questions@FreeBSD.ORG Sun Mar 16 06:22:24 2014 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id B0AD5978 for ; Sun, 16 Mar 2014 06:22:24 +0000 (UTC) Received: from alogt.com (alogt.com [69.36.191.58]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mx1.freebsd.org (Postfix) with ESMTPS id 85CB239C for ; Sun, 16 Mar 2014 06:22:24 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=alogt.com; s=default; h=Content-Transfer-Encoding:Content-Type:MIME-Version:References:In-Reply-To:Message-ID:Subject:Cc:To:From:Date; bh=DYpPazUoBrnQ/p3khe4NrSybC6JeyqbowgicKyffSVs=; b=Sz0wgnaYir+FyHPu1hV9nlQ2BwPOQy3mmQfXYDjsNp9/XewLXmyCvFgWnq5QGW546QDnyGUE+omR3i0FWW/E4LcNevRYudbZ6xt4SX0oyKmW7sCHtyzjsAOLTAqlcoq54aBbP/CWV2rBVpRP9+QW60RBrcA6deZ0d5eLW1s5x9A=; Received: from [182.12.48.234] (port=30294 helo=X220.alogt.com) by sl-508-2.slc.westdc.net with esmtpsa (SSLv3:DHE-RSA-AES128-SHA:128) (Exim 4.82) (envelope-from ) id 1WP4Sk-000yhK-H8; Sun, 16 Mar 2014 00:22:23 -0600 Date: Sun, 16 Mar 2014 14:22:13 +0800 From: Erich Dollansky To: cruxpot Subject: Re: Another case of the vanishing disk Message-ID: <20140316142213.459009dc@X220.alogt.com> In-Reply-To: References: <20140316130936.3f2d18e0@X220.alogt.com> <20140316134309.2edc258a@X220.alogt.com> Organization: ALO Green Technologies X-Mailer: Claws Mail 3.9.3 (GTK+ 2.24.22; amd64-portbld-freebsd10.0) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - sl-508-2.slc.westdc.net X-AntiAbuse: Original Domain - freebsd.org X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - alogt.com X-Get-Message-Sender-Via: sl-508-2.slc.westdc.net: authenticated_id: erich@alogt.com X-Source: X-Source-Args: X-Source-Dir: Cc: freebsd-questions@freebsd.org X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.17 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 16 Mar 2014 06:22:24 -0000 Hi, On Sun, 16 Mar 2014 01:04:05 -0500 cruxpot wrote: > Back in December, it was the power supply. That was a cheap Rosewill > 300W PSU. The new is a Corsair CX500 (500W). The system basically just > has an old SCSI card and 4 Green Barracuda 2TB disks and a low end > pci-e video card and pci-e gigabit NIC. How can the PSU be the problem > since I replaced it and it's more than adequate? the power supply has to regulate the supplied voltages withing a given range. If this does not work, drives tend to have problems. Your problem will be that you do not have the tools to check for this. The problem is that it is a rare thing. It is as rare that four drives go together. Can you run the machine with another power supply to test? Store the SMART values of each disk when you start the test and compare after some time. Erich > > On Sun, Mar 16, 2014 at 12:43 AM, Erich Dollansky > wrote: > > Hi, > > > > On Sun, 16 Mar 2014 00:28:31 -0500 > > cruxpot wrote: > > > >> All four disks have similar smartctl stats as far as those alarms > >> go. Are you trying to tell me that all four of my disks are about > >> to die? The sudden crashes have already been happening. > > > > it also could a problem with the motherboard or power supply. It is > > only hard to believe that a problem from the motherboard affects raw > > error rate. It is a bit more likely that your power supply is just > > on its limits and small drops in the 5/12V supply lines cause the > > problem. > > > > Erich > >> > >> On Sun, Mar 16, 2014 at 12:09 AM, Erich Dollansky > >> wrote: > >> > Hi, > >> > > >> > get a new disk as fast as possible. > >> > > >> > On Sat, 15 Mar 2014 23:48:58 -0500 > >> > cruxpot wrote: > >> > > >> >> messages:Mar 13 03:03:11 bsdbox kernel: ata4: port is not ready > >> >> (timeout 15000ms) tfd = 0000ffff > >> > > >> > First alarm bell is on. > >> > > >> >> UPDATED WHEN_FAILED RAW_VALUE > >> >> 1 Raw_Read_Error_Rate 0x000f 100 099 006 Pre-fail > >> >> Always - 1476032 > >> > > >> > Second alarm bell. > >> > > >> >> 7 Seek_Error_Rate 0x000f 078 060 030 Pre-fail > >> >> Always - 64570250 > >> > > >> > Third alarm bell. > >> > > >> >> 9 Power_On_Hours 0x0032 077 077 000 Old_age > >> >> Always - 20524 > >> > > >> > Warranty should be still on then. > >> > > >> >> 188 Command_Timeout 0x0032 100 097 000 Old_age > >> >> Always - 50 > >> > > >> > Fourth alarm bell. > >> > > >> >> 195 Hardware_ECC_Recovered 0x001a 037 004 000 Old_age > >> >> Always - 1476032 > >> > > >> > I think I cannot count that far. > >> > > >> > A disk with raw errors is not dead yet but it is a clear sign > >> > that something is wrong. Be prepared for a sudden crash. > >> > > >> > Erich > >> _______________________________________________ > >> freebsd-questions@freebsd.org mailing list > >> http://lists.freebsd.org/mailman/listinfo/freebsd-questions > >> To unsubscribe, send any mail to > >> "freebsd-questions-unsubscribe@freebsd.org" > > > _______________________________________________ > freebsd-questions@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-questions > To unsubscribe, send any mail to > "freebsd-questions-unsubscribe@freebsd.org"