From owner-freebsd-current@FreeBSD.ORG Fri Feb 27 01:35:59 2009 Return-Path: Delivered-To: current@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 87F121065672; Fri, 27 Feb 2009 01:35:59 +0000 (UTC) (envelope-from null@pozo.com) Received: from pozo.com (pozo.com [216.101.162.50]) by mx1.freebsd.org (Postfix) with ESMTP id 51ECC8FC08; Fri, 27 Feb 2009 01:35:59 +0000 (UTC) (envelope-from null@pozo.com) Received: from T41p.pozo.com (t41p.pozo.com [192.168.0.4]) (authenticated bits=0) by pozo.com (8.14.3/8.14.3) with ESMTP id n1R1ZmU7001445 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NOT); Thu, 26 Feb 2009 17:35:49 -0800 (PST) (envelope-from null@pozo.com) Message-Id: <200902270135.n1R1ZmU7001445@pozo.com> X-Mailer: QUALCOMM Windows Eudora Version 7.1.0.9 Date: Thu, 26 Feb 2009 17:33:09 -0800 To: Alexander Motin From: Manfred Antar In-Reply-To: <49A7107A.5080208@FreeBSD.org> References: <200902250259.n1P2xvYh001449@pozo.com> <1235559230.15704.4.camel@buffy.york.ac.uk> <200902251410.n1PEAla5010605@pozo.com> <1235573928.15704.10.camel@buffy.york.ac.uk> <200902251508.n1PF8ORo001737@pozo.com> <1235576175.15704.15.camel@buffy.york.ac.uk> <200902260208.n1Q280jg001474@pozo.com> <49A7107A.5080208@FreeBSD.org> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" X-Spam-Status: No, score=-101.4 required=5.0 tests=ALL_TRUSTED,MISSING_MID, USER_IN_WHITELIST autolearn=failed version=3.2.5, No X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on pozo.com X-Virus-Scanned: ClamAV version 0.94.2-exp, clamav-milter version 0.94.2-exp on pozo.com X-Virus-Status: Clean X-pozo.com-MailScanner-Information: Please contact the ISP for more information X-MailScanner-ID: n1R1ZmU7001445 X-pozo.com-MailScanner: Found to be clean X-pozo.com-MailScanner-From: null@pozo.com Cc: current@FreeBSD.org Subject: Re: Fatal trap 18 on Current 1386 since Saturday 21 Feb 2009 X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 27 Feb 2009 01:35:59 -0000 At 13:58 2/26/2009, you wrote: >Manfred Antar wrote: >>At 07:36 2/25/2009, Gavin Atkinson wrote: >>>My suspicion is that it is this commit: >>> >>>http://docs.freebsd.org/cgi/getmsg.cgi?fetch=1637857+0+archive/2009/svn-src-all/20090222.svn-src-all >>> >>>Might be worthwhile trying the source from just before, and just after >>>it and seeing if that is the cause. >>Yes that's it I put: >>/sys/dev/ata/ata-all.c >>/sys/dev/ata/ata-all.h >>/sys/dev/ata/ata-pci.h >>/sys/dev/ata/ata-sata.c >>/sys/dev/ata/chipsets/ata-promise.c >>from before the changes on the 21st in current /sys and kernel builds and boots fine >>My original date of the 14th was wrong, I looked at the calendar thought it was the 14th >>but actually uname -a of the last working kernel was the 21st 7am. > >Looking on on your first and later posts I would say that this commit may happen not a root of your problem. It may just trigger something else. > >Problem begins when some of your drives on ata1 channel timeouts request that leads to several controller reinits. I think that durung that driver somehow gets wrong drive parameters that leads to division by zero error inside ata_tf_write(). > >I have just committed one patch that probably does not fix the problem, but change process a bit. Could you try it and send me all ata related verbose messages starting from controller detection. > >-- >Alexander Motin Sure. I only have one Hard Drive and a DVD drive in this computer. atapci0: port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0x24c0-0x24cf irq 10 at device 31.1 on pci0 atapci0: Reserved 0x10 bytes for rid 0x20 type 4 at 0x24c0 ata0: on atapci0 atapci0: Reserved 0x8 bytes for rid 0x10 type 4 at 0x1f0 atapci0: Reserved 0x1 bytes for rid 0x14 type 4 at 0x3f6 ata0: reset tp1 mask=03 ostat0=50 ostat1=00 ata0: stat0=0x50 err=0x01 lsb=0x00 msb=0x00 ata0: stat1=0x00 err=0x01 lsb=0x00 msb=0x00 ata0: reset tp2 stat0=50 stat1=00 devices=0x1 ata0: [MPSAFE] ata0: [ITHREAD] ata1: on atapci0 atapci0: Reserved 0x8 bytes for rid 0x18 type 4 at 0x170 atapci0: Reserved 0x1 bytes for rid 0x1c type 4 at 0x376 ata1: reset tp1 mask=03 ostat0=00 ostat1=50 ata1: stat0=0x7f err=0x01 lsb=0x7f msb=0x7f ata1: stat1=0x90 err=0x01 lsb=0x14 msb=0xeb ata1: stat1=0x90 err=0x01 lsb=0x14 msb=0xeb ata1: stat1=0x00 err=0x01 lsb=0x14 msb=0xeb ata1: reset tp2 stat0=7f stat1=00 devices=0x20001 ata1: [MPSAFE] ata1: [ITHREAD] atapci1: port 0x24f8-0x24ff,0x2810-0x2813,0x2800-0x2807,0x2814-0x2817,0x24d0-0x24df irq 10 at device 31.2 on pci0 atapci1: Reserved 0x10 bytes for rid 0x20 type 4 at 0x24d0 atapci1: [MPSAFE] atapci1: [ITHREAD] ata2: on atapci1 atapci1: Reserved 0x8 bytes for rid 0x10 type 4 at 0x24f8 atapci1: Reserved 0x4 bytes for rid 0x14 type 4 at 0x2810 ata2: reset tp1 mask=01 ostat0=7f ostat1=00 ata2: stat0=0x7f err=0xff lsb=0xff msb=0xff ata2: stat0=0x7f err=0xff lsb=0xff msb=0xff ata2: stat0=0x7f err=0xff lsb=0xff msb=0xff ata2: stat0=0x7f err=0xff lsb=0xff msb=0xff ata2: stat0=0x7f err=0xff lsb=0xff msb=0xff ata2: stat0=0x7f err=0xff lsb=0xff msb=0xff ata2: stat0=0x7f err=0xff lsb=0xff msb=0xff ata2: stat0=0x7f err=0xff lsb=0xff msb=0xff ata2: stat0=0x7f err=0xff lsb=0xff msb=0xff ata2: stat0=0x7f err=0xff lsb=0xff msb=0xff ata2: stat0=0x7f err=0xff lsb=0xff msb=0xff ata2: stat0=0x7f err=0xff lsb=0xff msb=0xff ata2: reset tp2 stat0=ff stat1=00 devices=0x0 ata2: [MPSAFE] ata2: [ITHREAD] ata3: on atapci1 atapci1: Reserved 0x8 bytes for rid 0x18 type 4 at 0x2800 atapci1: Reserved 0x4 bytes for rid 0x1c type 4 at 0x2814 ata3: reset tp1 mask=01 ostat0=7f ostat1=00 ata3: stat0=0x7f err=0xff lsb=0xff msb=0xff ata3: stat0=0x7f err=0xff lsb=0xff msb=0xff ata3: stat0=0x7f err=0xff lsb=0xff msb=0xff ata3: stat0=0x7f err=0xff lsb=0xff msb=0xff ata3: stat0=0x7f err=0xff lsb=0xff msb=0xff ata3: stat0=0x7f err=0xff lsb=0xff msb=0xff ata3: stat0=0x7f err=0xff lsb=0xff msb=0xff ata3: stat0=0x7f err=0xff lsb=0xff msb=0xff ata3: stat0=0x7f err=0xff lsb=0xff msb=0xff ata3: stat0=0x7f err=0xff lsb=0xff msb=0xff ata3: stat0=0x7f err=0xff lsb=0xff msb=0xff ata3: stat0=0x7f err=0xff lsb=0xff msb=0xff ata3: reset tp2 stat0=ff stat1=00 devices=0x0 ata3: [MPSAFE] ata3: [ITHREAD] ata0-master: pio=PIO4 wdma=WDMA2 udma=UDMA100 cable=80 wire ad0: setting PIO4 on ICH5 chip ad0: setting UDMA100 on ICH5 chip ad0: 76319MB at ata0-master UDMA100 ad0: 156301488 sectors [155061C/16H/63S] 16 sectors/interrupt 1 depth queue ad0: Intel check1 failed ad0: Adaptec check1 failed ad0: LSI (v3) check1 failed ad0: LSI (v2) check1 failed ad0: FreeBSD check1 failed ata1: Identifying devices: 00020001 ata1: New devices: 00020001 ushub0: 2 ports with 2 removable, self powered ushub1: 2 ports with 2 removable, self powered ushub2: 2 ports with 2 removable, self powered GEOM: new disk ad0 ata1: reiniting channel .. ata1: reset tp1 mask=03 ostat0=00 ostat1=00 ata1: stat0=0x7f err=0x01 lsb=0x7f msb=0x7f ata1: stat1=0x80 err=0x01 lsb=0x14 msb=0xeb ata1: stat1=0x80 err=0x01 lsb=0x14 msb=0xeb ata1: stat1=0x00 err=0x01 lsb=0x14 msb=0xeb ata1: reset tp2 stat0=7f stat1=00 devices=0x20001 ata1: Identifying devices: 00020001 ata1: New devices: 00020000 ushub3: 8 ports with 8 removable, self powered unknown: WARNING - ATA_IDENTIFY taskqueue timeout - completing request directly ata1: reiniting channel .. ata1: reset tp1 mask=03 ostat0=7f ostat1=50 ata1: stat0=0x7f err=0x01 lsb=0x7f msb=0x7f ata1: stat1=0x90 err=0x01 lsb=0x14 msb=0xeb ata1: stat1=0x90 err=0x01 lsb=0x14 msb=0xeb ata1: stat1=0x00 err=0x01 lsb=0x14 msb=0xeb ata1: reset tp2 stat0=7f stat1=00 devices=0x20001 ata1: Identifying devices: 00020001 ata1: New devices: 00000000 ata1: reinit done .. unknown: FAILURE - ATA_IDENTIFY timed out LBA=0 unknown: WARNING - ATAPI_IDENTIFY taskqueue timeout - completing request directly ata1-slave: pio=PIO4 wdma=WDMA2 udma=UDMA66 cable=40 wire ad2: WARNING - ATA_IDENTIFY taskqueue timeout - completing request directly ata1: reiniting channel .. ata1: reset tp1 mask=03 ostat0=00 ostat1=00 ata1: stat0=0x7f err=0x01 lsb=0x7f msb=0x7f ata1: stat1=0x80 err=0x01 lsb=0x14 msb=0xeb ata1: stat1=0x80 err=0x01 lsb=0x14 msb=0xeb ata1: stat1=0x00 err=0x01 lsb=0x14 msb=0xeb ata1: reset tp2 stat0=7f stat1=00 devices=0x20001 ata1: Identifying devices: 00020001 ata1: New devices: 00000000 ata1: reinit done .. Fatal trap 18: integer divide fault while in kernel mode instruction pointer = 0x20:0xc08e837b stack pointer = 0x28:0xc0c209e8 frame pointer = 0x28:0xc0c20a5c code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 0 (swapper) [thread pid 0 tid 100000 ] Stopped at __qdivrem+0x3b: divl %ecx,%eax db> db> bt Tracing pid 0 tid 100000 td 0xc0aa07d0 __qdivrem(0,0,0,0,c0c20a78,...) at __qdivrem+0x3b __umoddi3(0,0,0,0,9e8f3044,...) at __umoddi3+0x2d ata_tf_write(c6fae190,c04efc80,c6fce800,1,c6fae190,...) at ata_tf_write+0x2ff ata_generic_command(c6fae190,6,c6d6ea00,c6d6ea00,c0c20b48,...) at ata_generic_command+0x2d9 ata_begin_transaction(c6fae190,1,c6cef000,c6d6ea00,c0c20b60,...) at ata_begin_transaction+0x7a ata_start(c6d6ea00,c09e6ae1,0,0,0,...) at ata_start+0x222 ata_reinit(c6d6ea00,5,ffffffff,0,0,...) at ata_reinit+0x501 ata_completed(c6fae000,0,c09e723c,0,20001,...) at ata_completed+0xb4 ata_queue_request(c6fae000,0,101,c6c31e00,101,...) at ata_queue_request+0x578 ata_getparam(c6fce800,1,2,0,0,...) at ata_getparam+0xf3 ata_identify(c6d6ea00,1,0,0,0,...) at ata_identify+0x24a ata_boot_attach(0,40000,c6d58c50,c6d198e0,c6c84340,...) at ata_boot_attach+0x6f run_interrupt_driven_config_hooks(0,c1ec00,c1ec00,c1e000,c25000,...) at run_interrupt_driven_config_hooks+0x91 mi_startup() at mi_startup+0x96 begin() at begin+0x2c ================================== || null@pozo.com || || Ph. (415) 681-6235 || ==================================