Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 22 Mar 2007 10:30:17 -0400
From:      Phillip Neumann <pneumann@gmail.com>
To:        FreeBSD Stable <freebsd-stable@freebsd.org>
Subject:   Amd64 Unstable Areca
Message-ID:  <460292F9.3050802@gmail.com>

next in thread | raw e-mail | index | archive | help
This is a multi-part message in MIME format.
--------------000001060205060509090907
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit

Dear FreeBSD-stable...

My amd64 box is not very stable.
In its hardware list, you can see there is an areca 1210 card, wich 
suffer the errata of 6.2-release (high load crash)

Last week or so, i saw a commit where the areca bugs were fixed, so i 
updated the system.

I can still see the mashine crashing under load

attached are dmesg -a, and a simple 'bt' of kgdb.

sometimes (under load) i see this message:
Interrupt storm detected on "swi2:"; throttling interrupt source

i get the same behaviour with sched_bsd or sched_ule


Is this info useful to determine where the problem is?
If so, where is it?  :-)
Has this something to do with the fs? or the scheduler?


thanks!


killfill.



--------------000001060205060509090907
Content-Type: text/plain;
 name="kgdb.txt"
Content-Transfer-Encoding: 7bit
Content-Disposition: inline;
 filename="kgdb.txt"

[root@worm /usr/obj/usr/src/sys/WORM]# kgdb kernel.debug /var/crash/vmcore.1 
[GDB will not be able to debug user-mode threads: /usr/lib/libthread_db.so: Undefined symbol "ps_pglobal_lookup"]
GNU gdb 6.1.1 [FreeBSD]
Copyright 2004 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "amd64-marcel-freebsd".

Unread portion of the kernel message buffer:
panic: handle_workitem_remove: lost inodedep
cpuid = 0
Uptime: 2h19m9s
Dumping 2047 MB (2 chunks)
  chunk 0: 1MB (156 pages) ... ok
  chunk 1: 2047MB (524016 pages) 2031 2015 1999 1983 1967 1951 1935 1919 1903 1887 1871 1855 1839 1823 1807 1791 1775 1759 1743 1727 1711 1695 1679 1663 1647 1631 1615 1599 1583 1567 1551 1535 1519 1503 1487 1471 1455 1439 1423 1407 1391 1375 1359 1343 1327 1311 1295 1279 1263 1247 1231 1215 1199 1183 1167 1151 1135 1119 1103 1087 1071 1055 1039 1023 1007 991 975 959 943 927 911 895 879 863 847 831 815 799 783 767 751 735 719 703 687 671 655 639 623 607 591 575 559 543 527 511 495 479 463 447 431 415 399 383 367 351 335 319 303 287 271 255 239 223 207 191 175 159 143 127 111 95 79 63 47 31 15

#0  doadump () at pcpu.h:172
172             __asm __volatile("movq %%gs:0,%0" : "=r" (td));
(kgdb) bt
#0  doadump () at pcpu.h:172
#1  0x0000000000000004 in ?? ()
#2  0xffffffff804085e7 in boot (howto=260) at /usr/src/sys/kern/kern_shutdown.c:409
#3  0xffffffff80408c81 in panic (fmt=0xffffff007a9cd980 "&#65533;VTx")
    at /usr/src/sys/kern/kern_shutdown.c:565
#4  0xffffffff80589fc6 in handle_workitem_remove (dirrem=0xffffff001a6c7b40, xp=0x0)
    at /usr/src/sys/ufs/ffs/ffs_softdep.c:3599
#5  0xffffffff8058a3dc in process_worklist_item (mp=0xffffff0031482630, flags=0)
    at /usr/src/sys/ufs/ffs/ffs_softdep.c:962
#6  0xffffffff8058fd3d in softdep_process_worklist (mp=0xffffff0031482630, full=0)
    at /usr/src/sys/ufs/ffs/ffs_softdep.c:851
#7  0xffffffff80590031 in softdep_flush () at /usr/src/sys/ufs/ffs/ffs_softdep.c:762
#8  0xffffffff803ed647 in fork_exit (callout=0xffffffff8058fee0 <softdep_flush>, arg=0x0, 
    frame=0xffffffffb49abc50) at /usr/src/sys/kern/kern_fork.c:821
#9  0xffffffff80615b0e in fork_trampoline () at /usr/src/sys/amd64/amd64/exception.S:394
#10 0x0000000000000000 in ?? ()
#11 0x0000000000000000 in ?? ()
#12 0x0000000000000001 in ?? ()
#13 0x0000000000000000 in ?? ()
#14 0x0000000000000000 in ?? ()
#15 0x0000000000000000 in ?? ()
#16 0x0000000000000000 in ?? ()
#17 0x0000000000000000 in ?? ()
#18 0x0000000000000000 in ?? ()
#19 0x0000000000000000 in ?? ()
#20 0x0000000000000000 in ?? ()
#21 0x0000000000000000 in ?? ()
#22 0x0000000000000000 in ?? ()
#23 0x0000000000000000 in ?? ()
#24 0x0000000000000000 in ?? ()
#25 0x0000000000000000 in ?? ()
#26 0x0000000000000000 in ?? ()
#27 0x0000000000000000 in ?? ()
#28 0x0000000000000000 in ?? ()
#29 0x0000000000000000 in ?? ()
#30 0x0000000000000000 in ?? ()
#31 0x0000000000000000 in ?? ()
#32 0x0000000000000000 in ?? ()
---Type <return> to continue, or q <return> to quit--- 
#33 0x0000000000000000 in ?? ()
#34 0x0000000000000000 in ?? ()
#35 0x0000000000000000 in ?? ()
#36 0x0000000000000000 in ?? ()
#37 0x0000000000000000 in ?? ()
#38 0x0000000000000000 in ?? ()
#39 0x0000000000000000 in ?? ()
#40 0x0000000000000000 in ?? ()
#41 0x0000000000000000 in ?? ()
#42 0x0000000000b7d000 in ?? ()
#43 0x0000000000000000 in ?? ()
#44 0xffffff002d080600 in ?? ()
#45 0x0000000000000000 in ?? ()
#46 0xffffff00785456b0 in ?? ()
#47 0xffffff007a9f2000 in ?? ()
#48 0xffffffffb49ab878 in ?? ()
#49 0xffffff007a9cd980 in ?? ()
#50 0xffffffff8041ed56 in sched_switch (td=0x0, newtd=0x0, flags=1)
    at /usr/src/sys/kern/sched_4bsd.c:973
#51 0x0000000000000000 in ?? ()
#52 0x0000000000000000 in ?? ()
#53 0x0000000000000000 in ?? ()
#54 0x0000000000000000 in ?? ()
#55 0x0000000000000000 in ?? ()
#56 0x0000000000000000 in ?? ()
#57 0x0000000000000000 in ?? ()
#58 0x0000000000000000 in ?? ()
#59 0x0000000000000000 in ?? ()
#60 0x0000000000000000 in ?? ()
#61 0x0000000000000000 in ?? ()
#62 0x0000000000000000 in ?? ()
#63 0x0000000000000000 in ?? ()
#64 0x0000000000000000 in ?? ()
#65 0x0000000000000000 in ?? ()
#66 0x0000000000000000 in ?? ()
#67 0x0000000000000000 in ?? ()
#68 0x0000000000000000 in ?? ()
#69 0x0000000000000000 in ?? ()
---Type <return> to continue, or q <return> to quit---
#70 0x0000000000000000 in ?? ()
#71 0x0000000000000000 in ?? ()
#72 0x0000000000000000 in ?? ()
#73 0x0000000000000000 in ?? ()
#74 0x0000000000000000 in ?? ()
#75 0x0000000000000000 in ?? ()
#76 0x0000000000000000 in ?? ()
#77 0x0000000000000000 in ?? ()
#78 0x0000000000000000 in ?? ()
#79 0x0000000000000000 in ?? ()
#80 0x0000000000000000 in ?? ()
#81 0x0000000000000000 in ?? ()
#82 0x0000000000000000 in ?? ()
#83 0x0000000000000000 in ?? ()
#84 0x0000000000000000 in ?? ()
#85 0x0000000000000000 in ?? ()
#86 0x0000000000000000 in ?? ()
#87 0x0000000000000000 in ?? ()
#88 0x0000000000000000 in ?? ()
#89 0x0000000000000000 in ?? ()
#90 0x0000000000000000 in ?? ()
#91 0x0000000000000000 in ?? ()
#92 0x0000000000000000 in ?? ()
#93 0x0000000000000000 in ?? ()
#94 0x0000000000000000 in ?? ()
#95 0x0000000000000000 in ?? ()
#96 0x0000000000000000 in ?? ()
#97 0x0000000000000000 in ?? ()
#98 0x0000000000000000 in ?? ()
#99 0x0000000000000000 in ?? ()
#100 0x0000000000000000 in ?? ()
#101 0x0000000000000000 in ?? ()
#102 0x0000000000000000 in ?? ()
#103 0x0000000000000000 in ?? ()
#104 0x0000000000000000 in ?? ()
#105 0x0000000000000000 in ?? ()
#106 0x0000000000000000 in ?? ()
#107 0x0000000000000000 in ?? ()
---Type <return> to continue, or q <return> to quit---
#108 0x0000000000000000 in ?? ()
#109 0x0000000000000000 in ?? ()
#110 0x0000000000000000 in ?? ()
#111 0x0000000000000000 in ?? ()
#112 0x0000000000000000 in ?? ()
#113 0x0000000000000000 in ?? ()
#114 0x0000000000000000 in ?? ()
#115 0x0000000000000000 in ?? ()
#116 0x0000000000000000 in ?? ()
#117 0x0000000000000000 in ?? ()
#118 0x0000000000000000 in ?? ()
#119 0x0000000000000000 in ?? ()
#120 0x0000000000000000 in ?? ()
#121 0x0000000000000000 in ?? ()
#122 0x0000000000000000 in ?? ()
Cannot access memory at address 0xffffffffb49ac000

--------------000001060205060509090907
Content-Type: text/plain;
 name="dmesg.txt"
Content-Transfer-Encoding: 7bit
Content-Disposition: inline;
 filename="dmesg.txt"

Copyright (c) 1992-2007 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
        The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 6.2-STABLE #1: Mon Mar 19 20:24:34 CLT 2007
    root@worm.sofsis.cl:/usr/obj/usr/src/sys/WORM
Timecounter "i8254" frequency 1193182 Hz quality 0
CPU: AMD Athlon(tm) 64 X2 Dual Core Processor 4600+ (2411.12-MHz K8-class CPU)
  Origin = "AuthenticAMD"  Id = 0x40fb2  Stepping = 2
  Features=0x178bfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2,HTT>
  Features2=0x2001<SSE3,CX16>
  AMD Features=0xea500800<SYSCALL,NX,MMX+,FFXSR,RDTSCP,LM,3DNow!+,3DNow!>
  AMD Features2=0x1f<LAHF,CMP,SVM,ExtAPIC,CR8>
  Cores per package: 2
real memory  = 2147418112 (2047 MB)
avail memory = 2044805120 (1950 MB)
ACPI APIC Table: <Nvidia AWRDACPI>
FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
 cpu0 (BSP): APIC ID:  0
 cpu1 (AP): APIC ID:  1
ioapic0: Changing APIC ID to 2
ioapic0 <Version 1.1> irqs 0-23 on motherboard
kbd1 at kbdmux0
ath_hal: 0.9.20.3 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413, RF5413)
acpi0: <Nvidia AWRDACPI> on motherboard
acpi_bus_number: can't get _ADR
acpi_bus_number: can't get _ADR
acpi0: Power Button (fixed)
acpi_bus_number: can't get _ADR
acpi_bus_number: can't get _ADR
acpi_bus_number: can't get _ADR
acpi_bus_number: can't get _ADR
acpi_bus_number: can't get _ADR
acpi_bus_number: can't get _ADR
Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000
acpi_timer0: <24-bit timer at 3.579545MHz> port 0x1008-0x100b on acpi0
cpu0: <ACPI CPU> on acpi0
cpu1: <ACPI CPU> on acpi0
acpi_button0: <Power Button> on acpi0
pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0
pci0: <ACPI PCI bus> on pcib0
pci0: <memory> at device 0.0 (no driver attached)
isab0: <PCI-ISA bridge> at device 1.0 on pci0
isa0: <ISA bus> on isab0
pci0: <serial bus, SMBus> at device 1.1 (no driver attached)
ohci0: <OHCI (generic) USB controller> mem 0xfe02f000-0xfe02ffff irq 21 at device 2.0 on pci0
ohci0: [GIANT-LOCKED]
usb0: OHCI version 1.0, legacy support
usb0: <OHCI (generic) USB controller> on ohci0
usb0: USB revision 1.0
uhub0: nVidia OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 10 ports with 10 removable, self powered
ehci0: <NVIDIA nForce4 USB 2.0 controller> mem 0xfeb00000-0xfeb000ff irq 22 at device 2.1 on pci0
ehci0: [GIANT-LOCKED]
usb1: EHCI version 1.0
usb1: companion controller, 4 ports each: usb0
usb1: <NVIDIA nForce4 USB 2.0 controller> on ehci0
usb1: USB revision 2.0
uhub1: nVidia EHCI root hub, class 9/0, rev 2.00/1.00, addr 1
uhub1: 10 ports with 10 removable, self powered
pci0: <multimedia, audio> at device 4.0 (no driver attached)
atapci0: <nVidia nForce CK804 UDMA133 controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xe800-0xe80f at device 6.0 on pci0
ata0: <ATA channel 0> on atapci0
ata1: <ATA channel 1> on atapci0
atapci1: <nVidia nForce CK804 SATA300 controller> port 0x9f0-0x9f7,0xbf0-0xbf3,0x970-0x977,0xb70-0xb73,0xd400-0xd40f mem 0xfe02c000-0xfe02cfff irq 21 at device 7.0 on pci0
ata2: <ATA channel 0> on atapci1
ata3: <ATA channel 1> on atapci1
atapci2: <nVidia nForce CK804 SATA300 controller> port 0x9e0-0x9e7,0xbe0-0xbe3,0x960-0x967,0xb60-0xb63,0xc000-0xc00f mem 0xfe02b000-0xfe02bfff irq 22 at device 8.0 on pci0
ata4: <ATA channel 0> on atapci2
ata5: <ATA channel 1> on atapci2
pcib1: <ACPI PCI-PCI bridge> at device 9.0 on pci0
pci1: <ACPI PCI bus> on pcib1
fwohci0: <VIA Fire II (VT6306)> port 0xac00-0xac7f mem 0xfc7ff000-0xfc7ff7ff irq 16 at device 6.0 on pci1
fwohci0: OHCI version 1.10 (ROM=1)
fwohci0: No. of Isochronous channels is 4.
fwohci0: EUI64 00:01:29:20:00:05:92:fd
fwohci0: Phy 1394a available S400, 2 ports.
fwohci0: Link S400, max_rec 2048 bytes.
firewire0: <IEEE1394(FireWire) bus> on fwohci0
fwe0: <Ethernet over FireWire> on firewire0
if_fwe0: Fake Ethernet address: 02:01:29:05:92:fd
fwe0: Ethernet address: 02:01:29:05:92:fd
fwe0: if_start running deferred for Giant
sbp0: <SBP-2/SCSI over FireWire> on firewire0
fwohci0: Initiate bus reset
fwohci0: node_id=0xc800ffc0, gen=1, CYCLEMASTER mode
firewire0: 1 nodes, maxhop <= 0, cable IRM = 0 (me)
firewire0: bus manager 0 (me)
pci1: <display, VGA> at device 8.0 (no driver attached)
nve0: <NVIDIA nForce MCP9 Networking Adapter> port 0xbc00-0xbc07 mem 0xfe02a000-0xfe02afff irq 23 at device 10.0 on pci0
nve0: Ethernet address 00:01:29:d6:b3:1c
miibus0: <MII bus> on nve0
ukphy0: <Generic IEEE 802.3u media interface> on miibus0
ukphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
nve0: Ethernet address: 00:01:29:d6:b3:1c
pcib2: <ACPI PCI-PCI bridge> at device 11.0 on pci0
pci2: <ACPI PCI bus> on pcib2
pcib3: <ACPI PCI-PCI bridge> at device 12.0 on pci0
pci3: <ACPI PCI bus> on pcib3
pcib4: <ACPI PCI-PCI bridge> at device 13.0 on pci0
pci4: <ACPI PCI bus> on pcib4
pcib5: <ACPI PCI-PCI bridge> at device 14.0 on pci0
pci5: <ACPI PCI bus> on pcib5
pcib6: <PCI-PCI bridge> at device 0.0 on pci5
pci6: <PCI bus> on pcib6
arcmsr0: <Areca SATA Host Adapter RAID Controller 
> mem 0xfd8ff000-0xfd8fffff,0xfcc00000-0xfcffffff irq 16 at device 14.0 on pci6
ARECA RAID ADAPTER0: Driver Version 1.20.00.14 2007-2-05 
ARECA RAID ADAPTER0: FIRMWARE VERSION V1.42 2006-10-13 
pcib7: <PCI-PCI bridge> at device 0.2 on pci5
pci7: <PCI bus> on pcib7
acpi_tz0: <Thermal Zone> on acpi0
acpi_tz0: _CRT value is absurd, ignored (1134.7C)
acpi_tz0: _PSV value is absurd, ignored (1132.7C)
acpi_tz0: _ACx value is absurd, ignored (1132.7C)
speaker0: <PC speaker> port 0x61 on acpi0
fdc0: <floppy drive controller> port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0
fdc0: [FAST]
sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0
sio0: type 16550A
sio1: <Generic IRDA-compatible device> port 0x2f8-0x2ff irq 3 on acpi0
sio1: type 16550A
ppc0: <Standard parallel printer port> port 0x378-0x37f irq 7 on acpi0
ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode
ppbus0: <Parallel port bus> on ppc0
plip0: <PLIP network interface> on ppbus0
lpt0: <Printer> on ppbus0
lpt0: Interrupt-driven port
ppi0: <Parallel I/O> on ppbus0
orm0: <ISA Option ROMs> at iomem 0xc0000-0xc7fff,0xc8000-0xc97ff,0xca000-0xcafff on isa0
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
atkbd0: <AT Keyboard> irq 1 on atkbdc0
kbd0 at atkbd0
atkbd0: [GIANT-LOCKED]
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
Timecounters tick every 1.000 msec
pass4 at arcmsr0 bus 0 target 16 lun 0
pass4: <Areca RAID controller R001> Fixed Processor SCSI-0 device 
da0 at arcmsr0 bus 0 target 0 lun 0
da0: <Areca Raid5_Sistema R001> Fixed Direct Access SCSI-5 device 
da0: 166.666MB/s transfers (83.333MHz, offset 32, 16bit), Tagged Queueing Enabled
da0: 30517MB (62499840 512 byte sectors: 255H 63S/T 3890C)
da1 at arcmsr0 bus 0 target 0 lun 1
da1: <Areca Raid5_Data R001> Fixed Direct Access SCSI-5 device 
da1: 166.666MB/s transfers (83.333MHz, offset 32, 16bit), Tagged Queueing Enabled
da1: 114440MB (234373632 512 byte sectors: 255H 63S/T 14589C)
da2 at arcmsr0 bus 0 target 0 lun 2
da2: <Areca Raid5_Jaulas R001> Fixed Direct Access SCSI-5 device 
da2: 166.666MB/s transfers (83.333MHz, offset 32, 16bit), Tagged Queueing Enabled
da2: 143050MB (292967424 512 byte sectors: 255H 63S/T 18236C)
da3 at arcmsr0 bus 0 target 0 lun 3
da3: <Areca Raid0_18G R001> Fixed Direct Access SCSI-5 device 
da3: 166.666MB/s transfers (83.333MHz, offset 32, 16bit), Tagged Queueing Enabled
da3: 17166MB (35155968 512 byte sectors: 255H 63S/T 2188C)
acpi_tz0: _CRT value is absurd, ignored (1134.7C)
acpi_tz0: _PSV value is absurd, ignored (1132.7C)
acpi_tz0: _ACx value is absurd, ignored (1132.7C)
acpi_tz0: _CRT value is absurd, ignored (1134.7C)
acpi_tz0: _PSV value is absurd, ignored (1132.7C)
acpi_tz0: _ACx value is absurd, ignored (1132.7C)
SMP: AP CPU #1 Launched!
Trying to mount root from ufs:/dev/da0s1a


--------------000001060205060509090907--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?460292F9.3050802>