From owner-freebsd-stable@FreeBSD.ORG Fri Mar 26 01:19:50 2004 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id BDB9016A4CE for ; Fri, 26 Mar 2004 01:19:50 -0800 (PST) Received: from sparcs.kaist.ac.kr (sparcs.kaist.ac.kr [143.248.234.102]) by mx1.FreeBSD.org (Postfix) with ESMTP id 2AB8143D41 for ; Fri, 26 Mar 2004 01:19:50 -0800 (PST) (envelope-from netj@sparcs.kaist.ac.kr) Received: from localhost ([127.0.0.1] helo=sab.mazic.org) by sparcs.kaist.ac.kr with esmtp (Exim 3.35 #1 (Debian)) id 1B6nVF-0002dt-00 for ; Fri, 26 Mar 2004 18:19:49 +0900 Received: from netj by sab.mazic.org with local (Exim 4.24) id HV6F90-0001B4-KU for freebsd-stable@freebsd.org; Fri, 26 Mar 2004 18:19:48 +0900 Date: Fri, 26 Mar 2004 18:19:48 +0900 From: Jaeho Shin To: freebsd-stable@freebsd.org Message-ID: <20040326091948.GA2948@sab.mazic.org> Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="Y5rl02BVI9TCfPar" Content-Disposition: inline User-Agent: Mutt/1.4.1i Organization: SPARCS, KAIST X-Content-Filtered-By: Mailman/MimeDel 2.1.1 Subject: "panic: ffs_blkfree: bad size" with HPT374 IDE RAID X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 26 Mar 2004 09:19:51 -0000 --Y5rl02BVI9TCfPar Content-Type: multipart/mixed; boundary="gatW/ieO32f1wygP" Content-Disposition: inline --gatW/ieO32f1wygP Content-Type: text/plain; charset=euc-kr Content-Disposition: inline Content-Transfer-Encoding: quoted-printable I'm getting unpredictable kernel panics from a 4.9-STABLE box. I use a RAID5 volume consisting of 8 disks, which are connected to a HPT374 IDE RAID controller. This box is a public FTP/Rsync server whose files are all stored in da0. There are read I/O from da0 for most of the time, and some write I/O occur when synchronizing files from other mirrors. I have experienced about 3 times of kernel panic for the last five days. I could only capture the coredump of which occurred last night. I couldn't find how to reproduce this panic yet, but the only possible cause seems to be some heavy write I/O to da0. Here are some info about my machine: hpt3740: port 0xa000-0xa0ff,0xa400-0xa403,0 xa800-0xa807,0xb000-0xb003,0xb400-0xb407 irq 11 at device 12.0 on pci0 hpt3741: port 0x8400-0x84ff,0x8800-0x8803,0 x9000-0x9007,0x9400-0x9403,0x9800-0x9807 irq 11 at device 12.1 on pci0 =2E.. da0 at hpt3740 bus 0 target 0 lun 0 da0: Fixed Direct Access SCSI-0 device=20 da0: 1094341MB (2241210496 512 byte sectors: 255H 63S/T 8436C) $ uname -a FreeBSD XXXX 4.9-STABLE FreeBSD 4.9-STABLE #8: Thu Mar 25 04:24:56 KST 2004= root@XXXX:/usr/obj/usr/src/sys/EGO i386 $ disklabel /dev/da0s1e # /dev/da0s1e: type: SCSI disk: da0s1 label: flags: bytes/sector: 512 sectors/track: 63 tracks/cylinder: 255 sectors/cylinder: 16065 cylinders: 139508 sectors/unit: 2241210496 rpm: 3600 interleave: 1 trackskew: 0 cylinderskew: 0 headswitch: 0 # milliseconds track-to-track seek: 0 # milliseconds drivedata: 0 8 partitions: # size offset fstype [fsize bsize bps/cpg] c: 2241210496 0 unused 0 0 # (Cyl. = 0 - 139508*) e: 2241210496 0 4.2BSD 2048 16384 89 # (Cyl. = 0 - 139508*) $ tunefs -p /dev/da0s1e tunefs: soft updates: (-n) enabled tunefs: maximum contiguous block count: (-a) 7 tunefs: rotational delay between contiguous blocks: (-d) 0 ms tunefs: maximum blocks per file in a cylinder group: (-e) 4096 tunefs: average file size: (-f) 16384 tunefs: average number of files in a directory: (-s) 64 tunefs: minimum percentage of free space: (-m) 0% tunefs: optimization preference: (-o) space Now I have disabled softupdates to check whether it could be a problem. This box ran 4-STABLE for over 3 years with no problem until the RAID controller was added. It seems there was a similar problem last November: See http://docs.freebsd.org/cgi/getmsg.cgi?fetch=3D106291+0+/usr/local/www/db/t= ext/2003/freebsd-stable/20031123.freebsd-stable A short kgdb log is attached, so if any kernel hacker can help me solving this problem, please tell me what to do. Thank you. --=20 =BD=C5=C0=E7=C8=A3 | Jaeho Shin | http://netj.org/ System Programmers' Association for Researching Computer Systems Division of Computer Science, Department of EECS, KAIST --gatW/ieO32f1wygP-- --Y5rl02BVI9TCfPar Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.2 (Cygwin) iD8DBQFAY/W0eGASkZ411HcRArxUAJ0WRPkGqzsEhixu1O4rkUKR+mhVNwCeIooD DBLAlZsaxH80PD5oO0QbBjw= =mI1p -----END PGP SIGNATURE----- --Y5rl02BVI9TCfPar--