From owner-freebsd-smp Mon Jan 11 15:09:35 1999 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id PAA15961 for freebsd-smp-outgoing; Mon, 11 Jan 1999 15:09:35 -0800 (PST) (envelope-from owner-freebsd-smp@FreeBSD.ORG) Received: from psv.oss.uswest.net (psv.oss.uswest.net [204.147.85.6]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id PAA15953; Mon, 11 Jan 1999 15:09:29 -0800 (PST) (envelope-from greg@psv.oss.uswest.net) Received: (from greg@localhost) by psv.oss.uswest.net (8.8.8/8.8.5) id RAA12571; Mon, 11 Jan 1999 17:08:51 -0600 (CST) Message-ID: X-Mailer: XFMail 1.3 [p0] on FreeBSD X-Priority: 3 (Normal) Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 8bit MIME-Version: 1.0 Date: Mon, 11 Jan 1999 17:08:51 -0600 (CST) Reply-To: greg@uswest.net Organization: US WEST !NTERACT From: Greg Rowe To: freebsd-scsi@FreeBSD.ORG, freebsd-smp@FreeBSD.ORG Subject: Fatal Trap Sender: owner-freebsd-smp@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org I have a Tyan mb dual 400mhz system I'm building that's getting repeatable crashes under disk load. This didn't occur until I compiled an SMP kernel. I'm running -current as of last Friday and the system has 512mb of memory and 4 drives on Adaptec controllers (1 - 4gig on one controller and 3 - 9gig on the other). Untarring a file will interestingly enough crash the system at around the same appoximate place in the tar image every time, although it's crashed while building ports also. The system drops to the debugger with the following: Fatal trap 12: page fault while in kernel mode mp_lock = 01000002; cpuid=1; lapic.id = 01000000(I've also seen it for cpuid0) fault virtual address = 0x0 fault code = supervisor write, page not present instruction pointer = 0x8:0xf0216357 stack pointer = 0x10:0xff618ce8 frame pointer = 0x10:0xff618d0c code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 243 (tar) interrupt mask = net tty bio cam <-SMP:XXX kernel: type 12 trap, code 0 Stopped at generic_bzero+0xf: repe stosl %es:(%edi) Any ideas as to what to look for ? Thanks. Greg Greg Rowe US WEST - Internet Service Operations "The telephone, for those of you who have forgotten, was a commonly used communications technology in the days before electronic mail. They're still easy to find in most large cities." -- Nathaniel Borenstein To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-smp" in the body of the message From owner-freebsd-smp Mon Jan 11 16:25:18 1999 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id QAA25838 for freebsd-smp-outgoing; Mon, 11 Jan 1999 16:25:18 -0800 (PST) (envelope-from owner-freebsd-smp@FreeBSD.ORG) Received: from bright.fx.genx.net (bright.fx.genx.net [206.64.4.154]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id QAA25820 for ; Mon, 11 Jan 1999 16:25:12 -0800 (PST) (envelope-from bright@hotjobs.com) Received: from localhost (bright@localhost) by bright.fx.genx.net (8.9.1/8.9.1) with ESMTP id TAA86533 for ; Mon, 11 Jan 1999 19:29:36 -0500 (EST) (envelope-from bright@hotjobs.com) X-Authentication-Warning: bright.fx.genx.net: bright owned process doing -bs Date: Mon, 11 Jan 1999 19:29:35 -0500 (EST) From: Alfred Perlstein X-Sender: bright@bright.fx.genx.net To: freebsd-smp@FreeBSD.ORG Subject: Re: Fatal Trap In-Reply-To: Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: owner-freebsd-smp@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org On Mon, 11 Jan 1999, Greg Rowe wrote: > > Fatal trap 12: page fault while in kernel mode > mp_lock = 01000002; cpuid=1; lapic.id = 01000000(I've also seen it for cpuid0) > fault virtual address = 0x0 > fault code = supervisor write, page not present > instruction pointer = 0x8:0xf0216357 > stack pointer = 0x10:0xff618ce8 > frame pointer = 0x10:0xff618d0c > code segment = base 0x0, limit 0xfffff, type 0x1b > = DPL 0, pres 1, def32 1, gran1 > processor eflags = interrupt enabled, resume, IOPL = 0 > current process = 243 (tar) > interrupt mask = net tty bio cam <-SMP:XXX why is this XXX? is it just an indication that using spl() is a bad thing for SMP and that movement away from the spl() model of interupt servicing is a good thing? -Alfred > kernel: type 12 trap, code 0 > Stopped at generic_bzero+0xf: repe stosl %es:(%edi) To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-smp" in the body of the message From owner-freebsd-smp Tue Jan 12 01:36:46 1999 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id BAA23312 for freebsd-smp-outgoing; Tue, 12 Jan 1999 01:36:46 -0800 (PST) (envelope-from owner-freebsd-smp@FreeBSD.ORG) Received: from isbalham.ist.co.uk (isbalham.ist.co.uk [192.31.26.1]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id BAA23284; Tue, 12 Jan 1999 01:36:35 -0800 (PST) (envelope-from rb@gid.co.uk) Received: from gid.co.uk (uucp@localhost) by isbalham.ist.co.uk (8.8.7/8.8.7) with UUCP id JAA26451; Tue, 12 Jan 1999 09:35:37 GMT (envelope-from rb@gid.co.uk) Received: from [194.32.164.2] by seagoon.gid.co.uk; Tue, 12 Jan 1999 09:32:18 GMT X-Sender: rb@194.32.164.1 Message-Id: In-Reply-To: Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Date: Tue, 12 Jan 1999 09:32:15 +0000 To: greg@uswest.net From: Bob Bishop Subject: Re: Fatal Trap Cc: freebsd-scsi@FreeBSD.ORG, freebsd-smp@FreeBSD.ORG Sender: owner-freebsd-smp@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org Hi, At 5:08 pm -0600 11/1/99, Greg Rowe wrote: > I have a Tyan mb dual 400mhz system I'm building that's getting repeatable >crashes under disk load. This didn't occur until I compiled an SMP kernel. I'm >running -current as of last Friday and the system has 512mb of memory and 4 >drives on Adaptec controllers (1 - 4gig on one controller and 3 - 9gig on the >other). [etc] What Adaptec controllers, precisely? -- Bob Bishop (0118) 977 4017 international code +44 118 rb@gid.co.uk fax (0118) 989 4254 between 0800 and 1800 UK To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-smp" in the body of the message From owner-freebsd-smp Tue Jan 12 06:29:18 1999 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id GAA25751 for freebsd-smp-outgoing; Tue, 12 Jan 1999 06:29:18 -0800 (PST) (envelope-from owner-freebsd-smp@FreeBSD.ORG) Received: from psv.oss.uswest.net (psv.oss.uswest.net [204.147.85.6]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id GAA25744; Tue, 12 Jan 1999 06:29:16 -0800 (PST) (envelope-from greg@psv.oss.uswest.net) Received: (from greg@localhost) by psv.oss.uswest.net (8.8.8/8.8.5) id IAA13923; Tue, 12 Jan 1999 08:28:01 -0600 (CST) Message-ID: X-Mailer: XFMail 1.3 [p0] on FreeBSD X-Priority: 3 (Normal) Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 8bit MIME-Version: 1.0 In-Reply-To: Date: Tue, 12 Jan 1999 08:28:01 -0600 (CST) Reply-To: greg@uswest.net Organization: US WEST !NTERACT From: Greg Rowe To: Bob Bishop Subject: Re: Fatal Trap Cc: freebsd-scsi@FreeBSD.ORG, freebsd-smp@FreeBSD.ORG Sender: owner-freebsd-smp@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org They are 2940UW's Revision C. I had 2940UW revision E's in, but I replaced them thinking that was the issue. I can build the system with a Generic kernel, single CPU, and do the tar load with no problems, so it appears to be an SMP issue ??? A dmesg output follows: Copyright (c) 1992-1999 FreeBSD Inc. Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. FreeBSD 3.0-CURRENT #1: Mon Jan 11 09:02:31 CST 1999 root@spare2.oss.uswest.net:/usr/src/sys/compile/TAHITI Timecounter "i8254" frequency 1193182 Hz CPU: Pentium II (quarter-micron) (686-class CPU) Origin = "GenuineIntel" Id = 0x652 Stepping=2 Features=0x183fbff> real memory = 536870912 (524288K bytes) config> quit avail memory = 518963200 (506800K bytes) Programming 24 pins in IOAPIC #0 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 0, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 1, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x00170011, at 0xfec00000 Preloaded elf kernel "kernel" at 0xf0309000. Probing for devices on PCI bus 0: chip0: rev 0x02 on pci0.0.0 chip1: rev 0x02 on pci0.1.0 chip2: rev 0x02 on pci0.7.0 ide_pci0: rev 0x01 on pci0.7.1 chip3: rev 0x02 on pci0.7.3 Freeing (NOT implemented) redirected ISA irq 11. ahc0: rev 0x00 int a irq 16 on pci0.16.0 ahc0: aic7880 Wide Channel A, SCSI Id=7, 16/255 SCBs Freeing (NOT implemented) redirected ISA irq 9. ahc1: rev 0x00 int a irq 17 on pci0.17.0 ahc1: aic7880 Wide Channel A, SCSI Id=7, 16/255 SCBs Freeing (NOT implemented) redirected ISA irq 10. vga0: rev 0x01 int a irq 18 on pci0.18. 0 Freeing (NOT implemented) redirected ISA irq 5. fxp0: rev 0x05 int a irq 19 on pci0 .19.0 fxp0: Ethernet address 00:90:27:1d:8e:08 Probing for devices on PCI bus 1: Probing for devices on the ISA bus: sc0 at 0x60-0x6f irq 1 on motherboard sc0: VGA color <16 virtual consoles, flags=0x0> sio0 at 0x3f8-0x3ff irq 4 flags 0x10 on isa sio0: type 16550A sio1 at 0x2f8-0x2ff irq 3 on isa sio1: type 16550A lpt0 at 0x378-0x37f irq 7 on isa lpt0: Interrupt-driven port lp0: TCP/IP capable interface psm0 at 0x60-0x64 irq 12 on motherboard psm0: model Generic PS/2 mouse, device ID 0 fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa fdc0: FIFO enabled, 8 bytes threshold fd0: 1.44MB 3.5in wdc0 not found at 0x1f0 wt0 not found at 0x300 mcd0 not found at 0x300 matcdc0 not found at 0x230 scd0 not found at 0x230 adv0 not found at 0x330 bt0 not found at 0x134 aha0 not found at 0x134 npx0 on motherboard npx0: INT 16 interface APIC_IO: Testing 8254 interrupt delivery APIC_IO: routing 8254 via pin 2 Waiting 15 seconds for SCSI devices to settle SMP: AP CPU #1 Launched! changing root device to da0s1a da0 at ahc0 bus 0 target 0 lun 0 da0: Fixed Direct Access SCSI-2 device da0: 40.0MB/s transfers (20.0MHz, offset 8, 16bit), Tagged Queueing Enabled da0: 4340MB (8888924 512 byte sectors: 255H 63S/T 553C) da1 at ahc1 bus 0 target 0 lun 0 da1: Fixed Direct Access SCSI-2 device da1: 40.0MB/s transfers (20.0MHz, offset 8, 16bit), Tagged Queueing Enabled da1: 8683MB (17783240 512 byte sectors: 255H 63S/T 1106C) da3 at ahc1 bus 0 target 2 lun 0 da3: Fixed Direct Access SCSI-2 device da3: 40.0MB/s transfers (20.0MHz, offset 8, 16bit), Tagged Queueing Enabled da3: 8683MB (17783240 512 byte sectors: 255H 63S/T 1106C) da2 at ahc1 bus 0 target 1 lun 0 da2: Fixed Direct Access SCSI-2 device da2: 40.0MB/s transfers (20.0MHz, offset 8, 16bit), Tagged Queueing Enabled da2: 8683MB (17783240 512 byte sectors: 255H 63S/T 1106C) WARNING: / was not properly dismounted (da3:ahc1:0:2:0): tagged openings now 64 (da0:ahc0:0:0:0): tagged openings now 61 (da0:ahc0:0:0:0): tagged openings now 60 (da0:ahc0:0:0:0): tagged openings now 59 (da0:ahc0:0:0:0): tagged openings now 58 (da0:ahc0:0:0:0): tagged openings now 57 (da0:ahc0:0:0:0): tagged openings now 56 (da0:ahc0:0:0:0): tagged openings now 55 (da0:ahc0:0:0:0): tagged openings now 54 (da0:ahc0:0:0:0): tagged openings now 53 (da0:ahc0:0:0:0): tagged openings now 52 (da0:ahc0:0:0:0): tagged openings now 51 (da0:ahc0:0:0:0): tagged openings now 50 (da0:ahc0:0:0:0): tagged openings now 49 (da1:ahc1:0:0:0): tagged openings now 64 (da1:ahc1:0:0:0): tagged openings now 63 (da1:ahc1:0:0:0): tagged openings now 62 (da1:ahc1:0:0:0): tagged openings now 61 (da1:ahc1:0:0:0): tagged openings now 60 (da1:ahc1:0:0:0): tagged openings now 59 (da1:ahc1:0:0:0): tagged openings now 58 (da1:ahc1:0:0:0): tagged openings now 57 (da1:ahc1:0:0:0): tagged openings now 56 (da1:ahc1:0:0:0): tagged openings now 55 (da1:ahc1:0:0:0): tagged openings now 54 (da1:ahc1:0:0:0): tagged openings now 53 (da1:ahc1:0:0:0): tagged openings now 52 (da1:ahc1:0:0:0): tagged openings now 51 (da1:ahc1:0:0:0): tagged openings now 50 (da1:ahc1:0:0:0): tagged openings now 49 On 12-Jan-99 Bob Bishop wrote: > Hi, > > At 5:08 pm -0600 11/1/99, Greg Rowe wrote: >> I have a Tyan mb dual 400mhz system I'm building that's getting repeatable >>crashes under disk load. This didn't occur until I compiled an SMP kernel. >>I'm >>running -current as of last Friday and the system has 512mb of memory and 4 >>drives on Adaptec controllers (1 - 4gig on one controller and 3 - 9gig on the >>other). [etc] > > What Adaptec controllers, precisely? > > > -- > Bob Bishop (0118) 977 4017 international code +44 118 > rb@gid.co.uk fax (0118) 989 4254 between 0800 and 1800 UK > > > > To Unsubscribe: send mail to majordomo@FreeBSD.org > with "unsubscribe freebsd-smp" in the body of the message Greg Rowe US WEST - Internet Service Operations "The telephone, for those of you who have forgotten, was a commonly used communications technology in the days before electronic mail. They're still easy to find in most large cities." -- Nathaniel Borenstein To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-smp" in the body of the message From owner-freebsd-smp Tue Jan 12 07:47:53 1999 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id HAA04158 for freebsd-smp-outgoing; Tue, 12 Jan 1999 07:47:53 -0800 (PST) (envelope-from owner-freebsd-smp@FreeBSD.ORG) Received: from Ilsa.StevesCafe.com (Ilsa.StevesCafe.com [205.168.119.129]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id HAA04153 for ; Tue, 12 Jan 1999 07:47:51 -0800 (PST) (envelope-from fbsd@Ilsa.StevesCafe.com) Received: from Ilsa.StevesCafe.com (localhost.StevesCafe.com [127.0.0.1]) by Ilsa.StevesCafe.com (8.8.8/8.8.5) with ESMTP id IAA22317; Tue, 12 Jan 1999 08:49:40 -0700 (MST) Message-Id: <199901121549.IAA22317@Ilsa.StevesCafe.com> X-Mailer: exmh version 2.0.2 2/24/98 From: Steve Passe To: Alfred Perlstein cc: freebsd-smp@FreeBSD.ORG Subject: Re: Fatal Trap In-reply-to: Your message of "Mon, 11 Jan 1999 19:29:35 EST." Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Date: Tue, 12 Jan 1999 08:49:40 -0700 Sender: owner-freebsd-smp@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org Hi, > On Mon, 11 Jan 1999, Greg Rowe wrote: ... > > interrupt mask = net tty bio cam <-SMP:XXX > > why is this XXX? is it just an indication that using spl() is a bad thing > for SMP and that movement away from the spl() model of interupt > servicing is a good thing? I'm the one responsible for that line, but I can't honestly remember its exact significance. I'm not even sure if it stands for "Symmetric Multi Processor", or if its my initials (probably the former). I put it there at the time I was attempting to decouple the intr aspect of spl() from the process aspect (ie. cpl vs. ipl). Anyways, your statement is accurate in that spl() must go away in favor of mutex() style programming b4 the "giant lock" problem can be fixed. I beat my head on that wall for quite awhile b4 giving up... -- Steve Passe | powered by smp@csn.net | Symmetric MultiProcessor FreeBSD To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-smp" in the body of the message From owner-freebsd-smp Tue Jan 12 11:51:42 1999 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id LAA02049 for freebsd-smp-outgoing; Tue, 12 Jan 1999 11:51:42 -0800 (PST) (envelope-from owner-freebsd-smp@FreeBSD.ORG) Received: from isbalham.ist.co.uk (isbalham.ist.co.uk [192.31.26.1]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id LAA02027; Tue, 12 Jan 1999 11:51:28 -0800 (PST) (envelope-from rb@gid.co.uk) Received: from gid.co.uk (uucp@localhost) by isbalham.ist.co.uk (8.8.7/8.8.7) with UUCP id TAA07388; Tue, 12 Jan 1999 19:50:30 GMT (envelope-from rb@gid.co.uk) Received: from [194.32.164.2] by seagoon.gid.co.uk; Tue, 12 Jan 1999 19:42:37 GMT X-Sender: rb@194.32.164.1 Message-Id: In-Reply-To: References: Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Date: Tue, 12 Jan 1999 19:42:35 +0000 To: greg@uswest.net From: Bob Bishop Subject: Re: Fatal Trap Cc: freebsd-scsi@FreeBSD.ORG, freebsd-smp@FreeBSD.ORG Sender: owner-freebsd-smp@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org At 8:28 am -0600 12/1/99, Greg Rowe wrote: >They are 2940UW's Revision C. I had 2940UW revision E's in, but I replaced >them >thinking that was the issue. I can build the system with a Generic kernel, >single CPU, and do the tar load with no problems, so it appears to be an SMP >issue ??? A dmesg output follows: Some 2940s (but not the UW, IIRC) misbehave with some MP motherboards. http://www.adaptec.com/support might be worth a quick look. -- Bob Bishop (0118) 977 4017 international code +44 118 rb@gid.co.uk fax (0118) 989 4254 between 0800 and 1800 UK To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-smp" in the body of the message From owner-freebsd-smp Tue Jan 12 15:58:32 1999 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id PAA07917 for freebsd-smp-outgoing; Tue, 12 Jan 1999 15:58:32 -0800 (PST) (envelope-from owner-freebsd-smp@FreeBSD.ORG) Received: from gw1.tesys.com (gw1.tesys.com [207.5.58.2]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id PAA07912 for ; Tue, 12 Jan 1999 15:58:31 -0800 (PST) (envelope-from kamal@tesys.com) Received: from kamal.tesys.com (kamal.tesys.com [207.5.58.14]) by gw1.tesys.com (8.9.1/8.7.3) with SMTP id PAA21070 for ; Tue, 12 Jan 1999 15:58:31 -0800 (PST) Message-Id: <199901122358.PAA21070@gw1.tesys.com> X-Sender: kamal@gw1.tesys.com X-Mailer: QUALCOMM Windows Eudora Pro Version 4.0 Date: Tue, 12 Jan 1999 15:57:37 -0800 To: smp@FreeBSD.ORG From: Kamal Singh Subject: Pentium II Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Sender: owner-freebsd-smp@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org Hello, Can you please educate me how to make multiple processor work under FreeBSD 3.0? Thanks in advance.. Kamal Singh To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-smp" in the body of the message From owner-freebsd-smp Wed Jan 13 04:15:38 1999 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id EAA12475 for freebsd-smp-outgoing; Wed, 13 Jan 1999 04:15:38 -0800 (PST) (envelope-from owner-freebsd-smp@FreeBSD.ORG) Received: from ns1.seidata.com (ns1.seidata.com [208.10.211.2]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id EAA12464 for ; Wed, 13 Jan 1999 04:15:37 -0800 (PST) (envelope-from mike@seidata.com) From: mike@seidata.com Received: from localhost (mike@localhost) by ns1.seidata.com (8.8.8/8.8.5) with ESMTP id HAA12643; Wed, 13 Jan 1999 07:14:59 -0500 (EST) Date: Wed, 13 Jan 1999 07:14:58 -0500 (EST) To: Kamal Singh cc: smp@FreeBSD.ORG Subject: Re: Pentium II In-Reply-To: <199901122358.PAA21070@gw1.tesys.com> Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: owner-freebsd-smp@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org On Tue, 12 Jan 1999, Kamal Singh wrote: > Can you please educate me how to make multiple processor work under FreeBSD > 3.0? Check out /sys/i386/conf/SMP-GENERIC. From there, the options are pretty straightforward. -- Mike Hoskins System/Network Administrator SEI Data Network Services, Inc. http://www.seidata.com To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-smp" in the body of the message From owner-freebsd-smp Wed Jan 13 10:59:08 1999 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id KAA00478 for freebsd-smp-outgoing; Wed, 13 Jan 1999 10:59:08 -0800 (PST) (envelope-from owner-freebsd-smp@FreeBSD.ORG) Received: from charon.npc.net (charon.finall.com [199.15.61.3]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id KAA00403 for ; Wed, 13 Jan 1999 10:58:54 -0800 (PST) (envelope-from mjung@npc.net) Received: from exchange.finall.com (exchange-gw.finall.com [10.0.158.37]) by charon.npc.net (8.9.1/8.8.8) with SMTP id MAA06801 for ; Wed, 13 Jan 1999 12:07:06 -0500 (EST) (envelope-from mjung@npc.net) Received: by exchange.finall.com with SMTP (Microsoft Exchange Server Internet Mail Connector Version 4.0.996.62) id <01BE3EEB.DCF06010@exchange.finall.com>; Wed, 13 Jan 1999 11:57:15 -0500 Message-ID: From: "Jung, Michael" To: "'freebsd-smp@freebsd.org'" Subject: Kernel Trap Date: Wed, 13 Jan 1999 11:57:14 -0500 X-Mailer: Microsoft Exchange Server Internet Mail Connector Version 4.0.996.62 MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Sender: owner-freebsd-smp@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org This is the third time this has occurred to me. I also get trap 12. I remember some noting that the 2940 might be the culprit but it never dies running with smp not compiled in kernel. I didn't find anything while searching adapets sight either. Any ideas? What else is necessary to help track this down? TIA --mikej This is hand copied from the screen: Fatal trap 21: FPU bounds check fault while in kernel mode mp_lock=01000006; cpuid = 1; lapic.id = 01000000 instruction pointer = 0x8:0xf0186037 stack pointer = 0x10:0xf6a411ec frame pointer = 0x10:0xf6a41208 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 2085 (cc1) interrupt mask = cam <- SMP: XXX trap number = 21 panic: FPU bounds check fault mp_lock = 01000006; cpuid = 1; lapic.id = 01000000 boot() called on cpu#1 Other info: ======================================================================== ======= MPTable, version 2.0.15 ------------------------------------------------------------------------ ------- MP Floating Pointer Structure: location: BIOS physical address: 0x000f0c80 signature: '_MP_' length: 16 bytes version: 1.1 checksum: 0xf4 mode: Virtual Wire ------------------------------------------------------------------------ ------- MP Config Table Header: physical address: 0x000f0c94 signature: 'PCMP' base table length: 292 version: 1.1 checksum: 0xa5 OEM ID: 'OEM00000' Product ID: 'PROD00000000' OEM table pointer: 0x00000000 OEM table size: 0 entry count: 28 local APIC address: 0xfee00000 extended table length: 0 extended table checksum: 0 ------------------------------------------------------------------------ ------- MP Config Base Table Entries: -- Processors: APIC ID Version State Family Model Step Flags 0 0x11 BSP, usable 5 2 1 0x07bf 1 0x11 AP, usable 5 2 1 0x07bf -- Bus: Bus ID Type 0 ISA 1 PCI -- I/O APICs: APIC ID Version State Address 2 0x11 usable 0xfec00000 -- I/O Ints: Type Polarity Trigger Bus ID IRQ APIC ID PIN# ExtINT conforms conforms 0 0 2 0 INT conforms conforms 0 1 2 1 INT conforms conforms 0 0 2 2 INT conforms conforms 0 3 2 3 INT conforms conforms 0 4 2 4 INT conforms conforms 0 5 2 5 INT conforms conforms 0 6 2 6 INT conforms conforms 0 7 2 7 INT conforms conforms 0 8 2 8 INT conforms conforms 0 9 2 9 INT conforms conforms 0 10 2 10 INT conforms conforms 0 11 2 11 INT conforms conforms 0 12 2 12 INT conforms conforms 0 13 2 13 INT conforms conforms 0 14 2 14 INT conforms conforms 0 15 2 15 INT active-lo level 1 20:A 2 16 INT active-lo level 1 19:A 2 17 INT active-lo level 1 18:A 2 18 INT active-lo level 1 17:A 2 19 SMI conforms conforms 0 0 2 23 -- Local Ints: Type Polarity Trigger Bus ID IRQ APIC ID PIN# ExtINT active-hi edge 0 0 255 0 NMI active-hi edge 0 0 255 1 ------------------------------------------------------------------------ ------- # SMP kernel config file options: # Required: options SMP # Symmetric MultiProcessor Kernel options APIC_IO # Symmetric (APIC) I/O # Optional (built-in defaults will work in most cases): #options NCPU=2 # number of CPUs #options NBUS=2 # number of busses #options NAPIC=1 # number of IO APICs #options NINTR=24 # number of INTs ======================================================================== ======= Copyright (c) 1992-1998 FreeBSD Inc. Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. FreeBSD 3.0-RELEASE #29: Tue Dec 29 14:26:39 GMT 1998 root@quake.npc.net:/usr/src/sys/compile/quake Timecounter "i8254" frequency 1193182 Hz cost 2772 ns CPU: Pentium/P55C (586-class CPU) Origin = "GenuineIntel" Id = 0x544 Stepping=4 Features=0x8003bf real memory = 83886080 (81920K bytes) avail memory = 78319616 (76484K bytes) Programming 24 pins in IOAPIC #0 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 0, version: 0x00030010, at 0xfee00000 cpu1 (AP): apic id: 1, version: 0x00030010, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x00170011, at 0xfec00000 Probing for devices on PCI bus 0: chip0: rev 0x03 on pci0.0.0 chip1: rev 0x01 on pci0.7.0 ide_pci0: rev 0x00 on pci0.7.1 vx0: <3COM 3C595 Fast Etherlink III PCI> rev 0x00 int a irq 19 on pci0.17.0 utp/tx[*utp*]: disable 'auto select' with DOS util! address 00:a0:24:93:17:3f ahc0: rev 0x00 int a irq 17 on pci0.19.0 ahc0: aic7890/91 Wide Channel A, SCSI Id=7, 16/255 SCBs vga0: rev 0xd3 int a irq 16 on pci0.20.0 Probing for devices on the ISA bus: sc0 at 0x60-0x6f irq 1 on motherboard sc0: VGA color <16 virtual consoles, flags=0x0> sio0 at 0x3f8-0x3ff irq 4 flags 0x10 on isa sio0: type 16550A sio1 at 0x2f8-0x2ff irq 3 on isa sio1: type 16550A lpt0 at 0x378-0x37f irq 7 on isa lpt0: Interrupt-driven port lp0: TCP/IP capable interface psm0 not found at 0x60 fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa fdc0: FIFO enabled, 8 bytes threshold fd0: 1.44MB 3.5in wdc0 not found at 0x1f0 wt0 not found at 0x300 aha0 not found at 0x134 npx0 on motherboard npx0: INT 16 interface sb0 at 0x220 irq 5 drq 1 on isa snd0: SB16 Error: Invalid DMA channel 1/-1 sbxvi0 not found sbmidi0 at 0x330 on isa snd0: joy0 at 0x201 on isa joy0: joystick spigot0 not found at 0xad6 Intel Pentium F00F detected, installing workaround APIC_IO: Testing 8254 interrupt delivery APIC_IO: routing 8254 via pin 2 ccd0-11: Concatenated disk drivers Waiting 8 seconds for SCSI devices to settle SMP: AP CPU #1 Launched! da0 at ahc0 bus 0 target 0 lun 0 da0: Fixed Direct Access SCSI2 device da0: 10.0MB/s transfers (10.0MHz, offset 15), Tagged Queueing Enabled da0: 8669MB (17755792 512 byte sectors: 255H 63S/T 1105C) da4 at ahc0 bus 0 target 1 lun 0 da4: Fixed Direct Access SCSI2 device da4: 10.0MB/s transfers (10.0MHz, offset 15), Tagged Queueing Enabled da4: 8669MB (17755792 512 byte sectors: 255H 63S/T 1105C) da5 at ahc0 bus 0 target 2 lun 0 da5: Fixed Direct Access SCSI2 device da5: 10.0MB/s transfers (10.0MHz, offset 15), Tagged Queueing Enabled da5: 8669MB (17755792 512 byte sectors: 255H 63S/T 1105C) da6 at ahc0 bus 0 target 3 lun 0 da6: Fixed Direct Access SCSI2 device da6: 10.0MB/s transfers (10.0MHz, offset 15), Tagged Queueing Enabled da6: 8669MB (17755792 512 byte sectors: 255H 63S/T 1105C) da1 at ahc0 bus 0 target 4 lun 0 da1: Fixed Direct Access SCSI2 device da1: 10.0MB/s transfers (10.0MHz, offset 15), Tagged Queueing Enabled da1: 8669MB (17755792 512 byte sectors: 255H 63S/T 1105C) da2 at ahc0 bus 0 target 5 lun 0 da2: Fixed Direct Access SCSI2 device da2: 10.0MB/s transfers (10.0MHz, offset 15), Tagged Queueing Enabled da2: 8669MB (17755792 512 byte sectors: 255H 63S/T 1105C) da3 at ahc0 bus 0 target 6 lun 0 da3: Fixed Direct Access SCSI2 device da3: 10.0MB/s transfers (10.0MHz, offset 15), Tagged Queueing Enabled da3: 8669MB (17755792 512 byte sectors: 255H 63S/T 1105C) changing root device to da0s1a Michael Jung (MJ548) (502) 315-2457 Voice National Processing Company (502) 315-2815 Facsimile 1231 Durrett Lane mjung@npc.net Mailstop: 75SYSP Louisville, KY 40285 To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-smp" in the body of the message From owner-freebsd-smp Wed Jan 13 15:29:40 1999 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id PAA20144 for freebsd-smp-outgoing; Wed, 13 Jan 1999 15:29:40 -0800 (PST) (envelope-from owner-freebsd-smp@FreeBSD.ORG) Received: from sv.compuland.com.br (sv.compuland.com.br [200.255.96.21]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id PAA20132 for ; Wed, 13 Jan 1999 15:29:35 -0800 (PST) (envelope-from helio@compuland.com.br) Received: from fuzzy (fuzzy.compuland.com.br [200.255.96.29]) by sv.compuland.com.br (8.8.8/8.8.8) with SMTP id VAA22545 for ; Wed, 13 Jan 1999 21:26:49 -0200 (EDT) Message-Id: <199901132326.VAA22545@sv.compuland.com.br> From: "=?ISO-8859-1?Q?Helio_Coelho_Jr_-_CompuLand_ISP_-_Petr=F3polis?=" To: freebsd-smp@FreeBSD.ORG Date: Wed, 13 Jan 1999 21:25:57 +0300 MIME-Version: 1.0 Content-type: text/plain; charset=US-ASCII Content-transfer-encoding: 7BIT Subject: Soyo Motherboard Reply-to: helio@compuland.com.br X-PM-Encryptor: QDPGP, 4 X-mailer: Pegasus Mail for Win32 (v3.01d) Sender: owner-freebsd-smp@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hi: I've installed 3.0Release in a Soyo SY-D6IBA with two 400Mhz Pentium2 processors... It's very fast, but... When I issue a 'build world' and during a custom kernel compilation, the machine hanged with: fatal trap 12: kernel page fault while in kernel mode mp_lock = 01000002; cpu id = 1 ; lapic.id = 01000000 . . . stopped at _generic_bcopy+0x1a: repe movsl I didn't find any reference to this motherboard, neither in the mailling list, neither in the SMP pages... Is that board rogue hardware ? :) Anyone has used that board ? (with success ? :)) ) Best Regards, Helio. The mptable: ====================================================================== MPTable, version 2.0.15 looking for EBDA pointer @ 0x040e, NOT found searching CMOS 'top of mem' @ 0x0009fc00 (639K) searching BIOS @ 0x000f0000 MP FPS found in BIOS @ physical addr: 0x000f58c0 - ------------------------------------------------------------------------------- MP Floating Pointer Structure: location: BIOS physical address: 0x000f58c0 signature: '_MP_' length: 16 bytes version: 1.1 checksum: 0x80 mode: Virtual Wire - ------------------------------------------------------------------------------- MP Config Table Header: physical address: 0x000f1400 signature: 'PCMP' base table length: 292 version: 1.1 checksum: 0x43 OEM ID: 'OEM00000' Product ID: 'PROD00000000' OEM table pointer: 0x00000000 OEM table size: 0 entry count: 28 local APIC address: 0xfee00000 extended table length: 0 extended table checksum: 0 - ------------------------------------------------------------------------------- MP Config Base Table Entries: - -- Processors: APIC ID Version State Family Model Step Flags 0 0x11 BSP, usable 6 5 0 0xfbff 1 0x11 AP, usable 6 5 0 0xfbff - -- Bus: Bus ID Type 0 PCI 1 PCI 2 ISA - -- I/O APICs: APIC ID Version State Address 2 0x11 usable 0xfec00000 - -- I/O Ints: Type Polarity Trigger Bus ID IRQ APIC ID PIN# ExtINT conforms conforms 2 0 2 0 INT conforms conforms 2 1 2 1 INT conforms conforms 2 0 2 2 INT conforms conforms 2 3 2 3 INT conforms conforms 2 4 2 4 INT conforms conforms 2 5 2 5 INT conforms conforms 2 6 2 6 INT conforms conforms 2 7 2 7 INT active-hi edge 2 8 2 8 INT conforms conforms 2 9 2 9 INT conforms conforms 2 10 2 10 INT conforms conforms 2 11 2 11 INT conforms conforms 2 12 2 12 INT conforms conforms 2 13 2 13 INT conforms conforms 2 14 2 14 INT conforms conforms 2 15 2 15 INT active-lo level 0 7:A 2 19 INT active-lo level 0 11:A 2 16 INT active-lo level 0 18:A 2 17 SMI conforms conforms 2 0 2 23 - -- Local Ints: Type Polarity Trigger Bus ID IRQ APIC ID PIN# ExtINT conforms conforms 0 0:A 255 0 NMI conforms conforms 0 0:A 255 1 - ------------------------------------------------------------------------------- # SMP kernel config file options: # Required: options SMP # Symmetric MultiProcessor Kernel options APIC_IO # Symmetric (APIC) I/O # Optional (built-in defaults will work in most cases): #options NCPU=2 # number of CPUs #options NBUS=3 # number of busses #options NAPIC=1 # number of IO APICs #options NINTR=24 # number of INTs - ------------------------------------------------------------------------------- dmesg output: Copyright (c) 1992-1998 FreeBSD Inc. Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. FreeBSD 3.0-RELEASE #1: Wed Jan 13 18:40:17 EDT 1999 root@fuzzy2.compuland.com.br:/usr/src/sys/compile/SVKERNEL Timecounter "i8254" frequency 1193182 Hz cost 3191 ns CPU: Pentium II (quarter-micron) (686-class CPU) Origin = "GenuineIntel" Id = 0x650 Stepping=0 Features=0x183fbff> real memory = 268435456 (262144K bytes) avail memory = 258052096 (252004K bytes) Programming 24 pins in IOAPIC #0 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 0, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 1, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x00170011, at 0xfec00000 Probing for devices on PCI bus 0: chip0: rev 0x02 on pci0.0.0 chip1: rev 0x02 on pci0.1.0 chip2: rev 0x02 on pci0.7.0 ide_pci0: rev 0x01 on pci0.7.1 chip3: rev 0x01 int d irq 11 on pci0.7.2 chip4: rev 0x02 on pci0.7.3 ahc0: rev 0x00 int a irq 16 on pci0.11.0 ahc0: aic7880 Wide Channel A, SCSI Id=7, 16/255 SCBs ed1: rev 0x00 int a irq 17 on pci0.18.0 ed1: address 00:00:b4:98:cc:17, type NE2000 (16 bit) Probing for devices on PCI bus 1: Probing for devices on the ISA bus: sc0 at 0x60-0x6f irq 1 on motherboard sc0: VGA color <16 virtual consoles, flags=0x0> ed0 not found at 0x280 fe0 not found at 0x300 sio0 at 0x3f8-0x3ff irq 4 flags 0x10 on isa sio0: type 16550A sio1 at 0x2f8-0x2ff irq 3 on isa sio1: type 16550A lpt0 at 0x378-0x37f irq 7 on isa lpt0: Interrupt-driven port lp0: TCP/IP capable interface psm0 not found at 0x60 fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa fdc0: FIFO enabled, 8 bytes threshold fd0: 1.44MB 3.5in wdc0 at 0x1f0-0x1f7 irq 14 on isa wdc0: unit 1 (atapi): <36X CD-ROM/VER 2.F0>, removable, accel, dma, iordy wcd0: 687/4125Kb/sec, 128Kb cache, audio play, 255 volume levels, ejectable tray wcd0: 120mm data disc loaded, unlocked wt0 not found at 0x300 mcd0 not found at 0x300 matcdc0 not found at 0x230 scd0 not found at 0x230 ie0: unknown board_id: f000 ie0 not found at 0x300 ep0 not found at 0x300 ex0 not found le0 not found at 0x300 lnc0 not found at 0x280 ze0 not found at 0x300 zp0 not found at 0x300 cs0 not found at 0x300 adv0 not found at 0x330 bt0 not found at 0x134 aha0 not found at 0x134 npx0 on motherboard npx0: INT 16 interface APIC_IO: Testing 8254 interrupt delivery APIC_IO: routing 8254 via pin 2 IP packet filtering initialized, divert disabled, rule-based forwarding enabled, default to accept, logging limited to 100 packets/entry Waiting 15 seconds for SCSI devices to settle SMP: AP CPU #1 Launched! changing root device to da0s1a da0 at ahc0 bus 0 target 1 lun 0 da0: Fixed Direct Access SCSI2 device da0: 10.0MB/s transfers (10.0MHz, offset 15), Tagged Queueing Enabled da0: 2014MB (4124736 512 byte sectors: 255H 63S/T 256C) WARNING: / was not properly dismounted (da0:ahc0:0:1:0): tagged openings now 15 . -----BEGIN PGP SIGNATURE----- Version: PGP 5.5.5 - QDPGP 1.71a Comment: http://community.wow.net/grt/qdpgp.html iQA/AwUBNp0rhDy+pq3odvEWEQIN/wCfcaL5/FjwAdmSfoJyvDaS83S2yWoAoKIH PPM963zgAzJyvLaB2hhmGgNO =Lb4q -----END PGP SIGNATURE----- PGP Public key: http://www.compuland.com.br/helio/pgppublic.txt To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-smp" in the body of the message From owner-freebsd-smp Thu Jan 14 13:39:04 1999 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id NAA12704 for freebsd-smp-outgoing; Thu, 14 Jan 1999 13:39:04 -0800 (PST) (envelope-from owner-freebsd-smp@FreeBSD.ORG) Received: from psv.oss.uswest.net (psv.oss.uswest.net [204.147.85.6]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id NAA12680; Thu, 14 Jan 1999 13:38:59 -0800 (PST) (envelope-from greg@psv.oss.uswest.net) Received: (from greg@localhost) by psv.oss.uswest.net (8.8.8/8.8.5) id PAA10389; Thu, 14 Jan 1999 15:37:50 -0600 (CST) Message-ID: X-Mailer: XFMail 1.3 [p0] on FreeBSD X-Priority: 3 (Normal) Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 8bit MIME-Version: 1.0 Date: Thu, 14 Jan 1999 15:37:50 -0600 (CST) Reply-To: greg@uswest.net Organization: US WEST !NTERACT From: Greg Rowe To: freebsd-smp@FreeBSD.ORG, freebsd-scsi@FreeBSD.ORG Subject: Fatal Trap - Update Sender: owner-freebsd-smp@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org I've done some more testing on this problem and have at least narrowed it down somewhat. The failure happens repeatedly by just un-tarring a tar file. No matter what the tar file contains, it seems to always drop into the debbugger at approximately the same point in the file ??? I've done this around 30 times with different tar files and it always crashes within a file or two of the previous crash. It seems to only be able to handle X amount of IO from tar, but multiple copies of bonnie run fine. The problem doesn't seem to be hardware related (at least failing hardware) as it happens on another system with the same mainboard and CPU's. The problem does not happen in non-SMP mode. The problem also does not exist in SMP mode on 3.0-RELEASE (straight CD install with SMP-GENERIC kernel), so the problem was introduced in -CURRENT. (I'm running yesterdays -CURRENT now) Adaptec SCSI controller revision, number of drives, type of drives, and number of controllers do not effect the problem, I've tried it all ways. Also, the page fault will sometimes report cpuid=0 and sometimes cpuid=1. The problem doesn't exist on a quad Xeon system running the same -CURRENT. So, I guess the question of the day is "what does a Tyan Tiger 100 mainboard, 3.0 CURRENT, SMP, and tar have in common" ? Any ideas what I should try next ? Thanks, Greg -----FW: Fatal Trap----- Date: Mon, 11 Jan 1999 16:48:53 -0600 (CST) From: Greg Rowe To: freebsd-scsi@FreeBSD.ORG, freebsd-smp@FreeBSD.ORG Subject: Fatal Trap I have a Tyan mb dual 400mhz system I'm building that's getting repeatable crashes under disk load. This didn't occur until I compiled an SMP kernel. I'm running -current as of last Friday and the system has 512mb of memory and 4 drives on Adaptec controllers (1 - 4gig on one controller and 3 - 9gig on the other). Untarring a file will interestingly enough crash the system at around the same appoximate place in the tar image every time, although it's crashed while building ports also. The system drops to the debugger with the following: Fatal trap 12: page fault while in kernel mode mp_lock = 01000002; cpuid=1; lapic.id = 01000000(I've also seen it for cpuid0) fault virtual address = 0x0 fault code = supervisor write, page not present instruction pointer = 0x8:0xf0216357 stack pointer = 0x10:0xff618ce8 frame pointer = 0x10:0xff618d0c code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 243 (tar) interrupt mask = net tty bio cam <-SMP:XXX kernel: type 12 trap, code 0 Stopped at generic_bzero+0xf: repe stosl %es:(%edi) Any ideas as to what to look for ? Thanks. Greg To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-smp" in the body of the message From owner-freebsd-smp Fri Jan 15 12:26:58 1999 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id MAA06207 for freebsd-smp-outgoing; Fri, 15 Jan 1999 12:26:58 -0800 (PST) (envelope-from owner-freebsd-smp@FreeBSD.ORG) Received: from psv.oss.uswest.net (psv.oss.uswest.net [204.147.85.6]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id MAA06189; Fri, 15 Jan 1999 12:26:55 -0800 (PST) (envelope-from greg@psv.oss.uswest.net) Received: (from greg@localhost) by psv.oss.uswest.net (8.8.8/8.8.5) id OAA13011; Fri, 15 Jan 1999 14:26:47 -0600 (CST) Message-ID: X-Mailer: XFMail 1.3 [p0] on FreeBSD X-Priority: 3 (Normal) Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 8bit MIME-Version: 1.0 Date: Fri, 15 Jan 1999 14:26:46 -0600 (CST) Reply-To: greg@uswest.net Organization: US WEST !NTERACT From: Greg Rowe To: freebsd-smp@FreeBSD.ORG, freebsd-scsi@FreeBSD.ORG Subject: More - Fatal Trap - Update Sender: owner-freebsd-smp@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org I found today that cpio will also cause the crash in the exact same way. It's interesting that bonnie won't though since that's what we usually use to test for IO problems. Greg -----FW: Fatal Trap - Update----- Date: Thu, 14 Jan 1999 15:05:17 -0600 (CST) From: Greg Rowe To: freebsd-smp@FreeBSD.ORG, freebsd-scsi@FreeBSD.ORG Subject: Fatal Trap - Update I've done some more testing on this problem and have at least narrowed it down somewhat. The failure happens repeatedly by just un-tarring a tar file. No matter what the tar file contains, it seems to always drop into the debbugger at approximately the same point in the file ??? I've done this around 30 times with different tar files and it always crashes within a file or two of the previous crash. It seems to only be able to handle X amount of IO from tar, but multiple copies of bonnie run fine. The problem doesn't seem to be hardware related (at least failing hardware) as it happens on another system with the same mainboard and CPU's. The problem does not happen in non-SMP mode. The problem also does not exist in SMP mode on 3.0-RELEASE (straight CD install with SMP-GENERIC kernel), so the problem was introduced in -CURRENT. (I'm running yesterdays -CURRENT now) Adaptec SCSI controller revision, number of drives, type of drives, and number of controllers do not effect the problem, I've tried it all ways. Also, the page fault will sometimes report cpuid=0 and sometimes cpuid=1. The problem doesn't exist on a quad Xeon system running the same -CURRENT. So, I guess the question of the day is "what does a Tyan Tiger 100 mainboard, 3.0 CURRENT, SMP, and tar have in common" ? Any ideas what I should try next ? Thanks, Greg -----FW: Fatal Trap----- Date: Mon, 11 Jan 1999 16:48:53 -0600 (CST) From: Greg Rowe To: freebsd-scsi@FreeBSD.ORG, freebsd-smp@FreeBSD.ORG Subject: Fatal Trap I have a Tyan mb dual 400mhz system I'm building that's getting repeatable crashes under disk load. This didn't occur until I compiled an SMP kernel. I'm running -current as of last Friday and the system has 512mb of memory and 4 drives on Adaptec controllers (1 - 4gig on one controller and 3 - 9gig on the other). Untarring a file will interestingly enough crash the system at around the same appoximate place in the tar image every time, although it's crashed while building ports also. The system drops to the debugger with the following: Fatal trap 12: page fault while in kernel mode mp_lock = 01000002; cpuid=1; lapic.id = 01000000(I've also seen it for cpuid0) fault virtual address = 0x0 fault code = supervisor write, page not present instruction pointer = 0x8:0xf0216357 stack pointer = 0x10:0xff618ce8 frame pointer = 0x10:0xff618d0c code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 243 (tar) interrupt mask = net tty bio cam <-SMP:XXX kernel: type 12 trap, code 0 Stopped at generic_bzero+0xf: repe stosl %es:(%edi) Any ideas as to what to look for ? Thanks. Greg --------------End of forwarded message------------------------- To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-smp" in the body of the message