From owner-freebsd-questions@FreeBSD.ORG Fri May 29 20:59:41 2009 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id C5102106566C for ; Fri, 29 May 2009 20:59:41 +0000 (UTC) (envelope-from claudiu.vasadi@gmail.com) Received: from mail-ew0-f164.google.com (mail-ew0-f164.google.com [209.85.219.164]) by mx1.freebsd.org (Postfix) with ESMTP id 3F4558FC1D for ; Fri, 29 May 2009 20:59:40 +0000 (UTC) (envelope-from claudiu.vasadi@gmail.com) Received: by ewy8 with SMTP id 8so3073660ewy.43 for ; Fri, 29 May 2009 13:59:40 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:date:message-id:subject :from:to:content-type; bh=yRCkoYzueG5pmuQn8nNKyN4SwrX0hzqZ9+9vzmB15Hc=; b=HN0lkJTfKCBRDLdhG29YtIB78ZoeiW/zlWVCpOpNjsLa+TqHiJBPiYJy18NfrchgAf x5vDwoWGSre9Xm09dHT8f2B+almoPeCjUs3KK9z5aiHyV6m/31F3kQ1BVpFbcpdWRiJX SN2Y/dWbU5+stkjsJ3mwtHItpa0DJcIqnbPEg= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=hqTT+CDjOVPSF5FsOy9YYcOBbGps7ET7tEUzQVg02N0Mrlnogf5pzIQM5/DASOC6+q ddK8ogd/3FtfKdWw4pBLlt45BleZ/4w3hF8uT/D17xB/gECPEcRj+nUeG8TnMc0LTwCy IUkYf009pkhbjd13c+qytLT3PAPC+4O34C2cc= MIME-Version: 1.0 Received: by 10.216.30.71 with SMTP id j49mr1094545wea.89.1243628943266; Fri, 29 May 2009 13:29:03 -0700 (PDT) Date: Fri, 29 May 2009 22:29:03 +0200 Message-ID: <4f760c6a0905291329w31d0df93he5aa004a3ab9bd87@mail.gmail.com> From: claudiu vasadi To: freebsd-questions@freebsd.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Subject: system reboot because of hdd X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 29 May 2009 20:59:42 -0000 Hello ppl. Straight to business. FreeBSD 7.1-stable 2 hdd. 1 is ad2 and the other is ad6. ad2 is the BSD hdd, and ad6 is just for data (movies, music, etc). ad2 is a 80GB Samsung P-ata133 and ad6 is a WD 250GB S-ata2. While running a process that was trying to create a 25GB file on a 30 GB partition on the second hdd (ad2) I experienced ssh outage. Everything came back to life after a short perioud of ~2 minutes. So, again I started the process. This time, the outage was about 5 minutes. I was busy with something else and did not run the process again. 2 minutes after that i get a call from a customer that some thing is not working. so I check it and surprize, the OS rebooted itself. so, went to the logs and this is what i found out (/var/log/messages): May 29 22:26:30 da1 kernel: ad6: TIMEOUT - WRITE_DMA48 retrying (1 retry left) LBA=419468447 May 29 22:26:35 da1 kernel: ad6: TIMEOUT - WRITE_DMA48 retrying (0 retries left) LBA=419468447 May 29 22:26:41 da1 kernel: ad6: FAILURE - WRITE_DMA48 timed out LBA=419468447 May 29 22:26:41 da1 kernel: g_vfs_done():ad6s1f[WRITE(offset=19447808, length=16384)]error = 5 May 29 22:26:35 da1 syslogd: kernel boot file is /boot/kernel/kernel May 29 22:26:35 da1 kernel: ad6: FAILURE - device detached May 29 22:26:35 da1 kernel: subdisk6: detached May 29 22:26:35 da1 kernel: ad6: detached May 29 22:26:35 da1 kernel: g_vfs_done():ad6s1f[WRITE(offset=36683776, length=16384)]error = 6 May 29 22:26:35 da1 kernel: g_vfs_done():ad6s1f[WRITE(offset=16908288, length=16384)]error = 6 May 29 22:26:35 da1 kernel: g_vfs_done():ad6s1f[WRITE(offset=36700160, length=16384)]error = 6 May 29 22:26:35 da1 kernel: g_vfs_done():ad6s1f[WRITE(offset=114688, length=16384)]error = 6 May 29 22:26:35 da1 kernel: panic: vinvalbuf: dirty bufs May 29 22:26:35 da1 kernel: cpuid = 0 May 29 22:26:35 da1 kernel: Uptime: 45d22h15m29s May 29 22:26:35 da1 kernel: Physical memory: 1003 MB May 29 22:26:35 da1 kernel: Dumping 232 MB: 217 201 185 169 153 137 121 105 89 73 57 41 25 9 and (/var/log/all.log): May 29 22:54:49 da1 fsck: /dev/ad6s1f: 6 files, 12 used, 17132271 free (31 frags, 2141530 blocks, 0.0% fragmentation) exacly where the file was created. but it was 1 not 6 files that i wanted to create but 1. the process that I run is "dsmfmt" of TSM server for Sun. it creates a file volume of a specific size for use in tsm server itself for defining storage pool capacity. so, I know that the hdd was to the limit. It could be a hardware issue I know, but right now dnt have resources to try somewere else so I'm asking a oppinion. Has anyone dealt with this situation before ? OS reboot because of high hdd load ?