From owner-cvs-all@FreeBSD.ORG Thu Oct 23 05:04:49 2003 Return-Path: Delivered-To: cvs-all@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 8925C16A4B3; Thu, 23 Oct 2003 05:04:49 -0700 (PDT) Received: from mailout03.sul.t-online.com (mailout03.sul.t-online.com [194.25.134.81]) by mx1.FreeBSD.org (Postfix) with ESMTP id F0C2643FB1; Thu, 23 Oct 2003 05:04:47 -0700 (PDT) (envelope-from Alexander@Leidinger.net) Received: from fwd01.aul.t-online.de by mailout03.sul.t-online.com with smtp id 1ACeCj-0008WI-05; Thu, 23 Oct 2003 14:04:37 +0200 Received: from Andro-Beta.Leidinger.net (bpYRuvZerehEZBeQpAktqfrAOn-wCGEX0ClEFVrjzTKLukmvcJmQUf@[80.131.114.27]) by fmrl01.sul.t-online.com with esmtp id 1ACeCS-1PgOtk0; Thu, 23 Oct 2003 14:04:20 +0200 Received: from Magelan.Leidinger.net (Magellan [192.168.1.1]) h9NC4Jf5079499; Thu, 23 Oct 2003 14:04:19 +0200 (CEST) (envelope-from Alexander@Leidinger.net) Received: from Magelan.Leidinger.net (netchild@localhost [127.0.0.1]) h9NC4IpP023130; Thu, 23 Oct 2003 14:04:18 +0200 (CEST) (envelope-from Alexander@Leidinger.net) Date: Thu, 23 Oct 2003 14:04:18 +0200 From: Alexander Leidinger To: Soren Schmidt Message-Id: <20031023140418.7510414c.Alexander@Leidinger.net> In-Reply-To: <200310230919.h9N9J3eR001115@spider.deepcore.dk> References: <3F9799EF.9050509@Leidinger.net> <200310230919.h9N9J3eR001115@spider.deepcore.dk> X-Mailer: Sylpheed version 0.9.6claws (GTK+ 1.2.10; i386-portbld-freebsd5.1) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Seen: false X-ID: bpYRuvZerehEZBeQpAktqfrAOn-wCGEX0ClEFVrjzTKLukmvcJmQUf@t-dialin.net cc: cvs-src@FreeBSD.ORG cc: src-committers@FreeBSD.ORG cc: cvs-all@FreeBSD.ORG cc: John Baldwin Subject: Re: cvs commit: src/sys/dev/ata ata-dma.c ata-pci.c X-BeenThere: cvs-all@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: CVS commit messages for the entire tree List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 23 Oct 2003 12:04:49 -0000 On Thu, 23 Oct 2003 11:19:03 +0200 (CEST) Soren Schmidt wrote: > > > Hmm, are you sure those drives are OK ? have you tested them on another > > > system/controller ? > > > > I've only one system with a SATA controller. The drives are new. > > OK, have it ever worked for you ? can you test it in some other way ? I've only tested a recent -current on the system, as I got it just some days ago. What other way do you have in mind? I have no idea how to test it further. > > ata3: spurious interrupt - status=0x50 error=0x00 [a lot of them] > > as2: FAILURE - already active DMA on this device > > ad2: setting up DMA failed > > > > Fatal trap 12: page fault... > > Fault virtual address = 0x8 > > Fault code = supervisor read, page not present > > instruction pointer = 0x8:0xc042d4bb > > stack pointer = 0x10:0xe008ccac > > frame pointer = 0x10:0xe008ccb0 > > code segment = base 0x0, limit 0xfffff, type 0x1b > > = DPL 0, pres 1, def32 1, gran 1 > > processor eflags = interrupt enabled, resume, IOPL = 0 > > current process = 27 (irq5: uhci2 atapci1) > > kernel: type 12 trap, code=0 > > Stopped at ata_prtdev+0x7: movl 0x8(%ecx),%eax > > db> trace > > ata_prtdev > > ata_interrupt > > ata_generic_intr > > ithread_loopp > > fork_exit > > fork_trampoline > > --- trap 0x1, eip = 0, esp = 0xe008cd7c, ebp = 0 --- > > Hmm, this looks as if interrupts are not delivered to the right channel > or something, does it work if you only attach one drive ? Running "iozone -a" on one drive while the other drive is still attached results in spurious interrupts, and after a while in a panic too: ---snip--- ata2: resetting devices... ad1: FAILURE - already active DMA on this device ad1: Setting up DMA failed done ad1: TIMEOUT - WRITE_DMA retrying (2 retries left) ata2: resetting devices... FEOM: destroy disk ad1 handle workitem freeblocks: block count ad1: WARNING - removed from confguration done ata2-master: FAILURE - WRITE_DMA device lockup/removed Fatal trap 12: page fault Fatal virtual address = 0xff Fault code = supervisor read, page not present ... current process = 41 (bufdaemon) kernel: type 12 trap, code 0 Stopped at ---snip--- I open the case, remove one drive and test again... > > You are aware of the fact, that this is an ICH5R system (RAID BIOS > > disabled; the RAID isn't supposed to be supported with the actual code, > > correct?)? > > Yes, and I know of at least a handfull of other ich5 based system that > runs just fine, so I'm a bit puzzled here. Can I enable some kind of debug code to find the cause? Bye, Alexander. -- There's no place like ~ http://www.Leidinger.net Alexander @ Leidinger.net GPG fingerprint = C518 BC70 E67F 143F BE91 3365 79E2 9C60 B006 3FE7