From owner-freebsd-current Sat Jul 31 6:57:57 1999 Delivered-To: freebsd-current@freebsd.org Received: from bastian.attic.ch (bastian.attic.ch [194.235.45.178]) by hub.freebsd.org (Postfix) with ESMTP id 28D4E14FED for ; Sat, 31 Jul 1999 06:57:42 -0700 (PDT) (envelope-from blapp@attic.ch) Received: from bastian.attic.ch (bastian.attic.ch [194.235.45.178]) by bastian.attic.ch (8.9.1a/8.9.1a) with SMTP id PAA02838 for ; Sat, 31 Jul 1999 15:57:26 +0200 (CEST) Date: Sat, 31 Jul 1999 15:57:26 +0200 (CEST) From: Martin Blapp To: freebsd-current@freebsd.org Subject: network got stuck during high nfs-load Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: owner-freebsd-current@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG Hi driver gurus, Today I had heavy NFS-load on my CURRENT-SMP machine while compiling world. Suddenly all NFS-Clients stopped working. I was't able to ping the server from inside my /29 subnet, same with ping the others from the server itself. I changed cables, plugged them in and out, did some ifconfig up down, deleted routes and added them again ... nothing. But the card was still sending from time to time (once a second) with 100% load many 0 bytes - like some waves: (tcp-dump from same subnet, MAC Adress 0:a0:cc:66:32:97 is the NFS-Server) -------------------------------------------------------------------------- 13:53:41.673733 0:a0:cc:66:32:97 > 0:a0:cc:66:32:97 null test/C len=43 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 00 13:53:41.773706 0:a0:cc:66:32:97 > 0:a0:cc:66:32:97 null test/C len=43 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 00 It looks like a endless loop somewhere in the de0-driver ... I was able to ping the machine from itself via loopback and via the local ip. No stucked processes at all, the running make buildworld was still running. Also the load was normal, I couldn't see some special behaviour ... I paniced the server and made a crashdump. If anybody is interested in this strange behaviour, just write me back ... After a reboot, all worked again, but that's not a good solution ;) Martin PS: cvsup is from yesterday (30.7.1999) at 22:00 CET and I included a backtrace and a dmesg from my server ... backtrace from debug.kernel --------------------------- #0 boot (howto=260) at ../../kern/kern_shutdown.c:291 #1 0xc01538c9 in panic (fmt=0xc0277c14 "from debugger") at ../../kern/kern_shutdown.c:505 #2 0xc012df19 in db_panic (addr=-1071398025, have_addr=0, count=-1, modif=0xff80dddc "") at ../../ddb/db_command.c:434 #3 0xc012deb7 in db_command (last_cmdp=0xc02a6f30, cmd_table=0xc02a6d90, aux_cmd_tablep=0xc02c572c) at ../../ddb/db_command.c:334 #4 0xc012df7e in db_command_loop () at ../../ddb/db_command.c:456 #5 0xc012ffbb in db_trap (type=3, code=0) at ../../ddb/db_trap.c:71 #6 0xc023c0c2 in kdb_trap (type=3, code=0, regs=0xff80ded0) at ../../i386/i386/db_interface.c:157 #7 0xc02524a0 in trap (frame={tf_fs = 24, tf_es = 16, tf_ds = 16, tf_edi = 0, tf_esi = -1070741376, tf_ebp = -8331496, tf_isp = -8331524, tf_ebx = -1070741376, tf_edx = -1070689184, tf_ecx = 16777217, tf_eax = 38, tf_trapno = 3, tf_err = 0, tf_eip = -1071398025, tf_cs = 8, tf_eflags = 582, tf_esp = -1071030749, tf_ss = -1071045718}) at ../../i386/i386/trap.c:534 #8 0xc023c377 in Debugger (msg=0xc02923aa "manual escape to debugger") at machine/cpufunc.h:64 #9 0xc0236d7c in scgetc (sc=0xc02c3ca0, flags=2) at ../../dev/syscons/syscons.c:3813 #10 0xc0232529 in sckbdevent (thiskbd=0xc02db0a0, event=0, arg=0xc02c3ca0) at ../../dev/syscons/syscons.c:688 #11 0xc022bd77 in atkbd_intr (kbd=0xc02db0a0, arg=0x0) at ../../dev/kbd/atkbd.c:535 #12 0xc02692d7 in atkbd_isa_intr (arg=0xc05a2100) at ../../isa/atkbd_isa.c:125 dmesg-output ------------ Copyright (c) 1992-1999 The FreeBSD Project. Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. FreeBSD 4.0-CURRENT #1: Fri Jul 30 21:31:06 CEST 1999 root@fuchur.lan.attic.ch:/usr/src/sys/compile/FUCHUR Timecounter "i8254" frequency 1193182 Hz CPU: Pentium/P55C (586-class CPU) Origin = "GenuineIntel" Id = 0x543 Stepping = 3 Features=0x8003bf real memory = 67108864 (65536K bytes) avail memory = 61652992 (60208K bytes) Programming 24 pins in IOAPIC #0 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 0, version: 0x00030010, at 0xfee00000 cpu1 (AP): apic id: 1, version: 0x00030010, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x00170011, at 0xfec00000 Preloaded elf kernel "kernel" at 0xc0340000. Intel Pentium detected, installing workaround for F00F bug Probing for PnP devices: npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard pci0: on pcib0 isab0: at device 7.0 on pci0 ide_pci0: at device 7.1 on pci0 de0: irq 17 at device 9.0 on pci0 de0: 21140A [10-100Mb/s] pass 2.0 de0: address 00:a0:cc:66:32:97 vga-pci0: irq 18 at device 10.0 on pci0 ahc0: irq 19 at device 12.0 on pci0 ahc0: Using left over BIOS settings ahc0: aic7880 Wide Channel A, SCSI Id=7, 16/255 SCBs isa0: on motherboard fdc0: at port 0x3f0-0x3f7 irq 6 drq 2 on isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 wdc0 at port 0x1f0-0x1f7 irq 14 on isa0 wdc0: unit 0 (atapi): , removable, accel, ovlap, dma, iordis wcd0: drive speed 5512KB/sec, 256KB cache wcd0: supported read types: CD-R, CD-RW, CD-DA wcd0: Audio: play, 255 volume levels wcd0: Mechanism: ejectable tray wcd0: Medium: CD-ROM 120mm data disc loaded, unlocked wdc0: unit 1 (atapi): , removable, intr, iordis wfd0: medium type unknown (no disk) wfd0: buggy Zip drive, 64-block transfer limit set atkbdc0: at port 0x60-0x6f on isa0 atkbd0: irq 1 on atkbdc0 vga0: at port 0x3b0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: VGA <16 virtual consoles, flags=0x200> sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 16550A sio1 at port 0x2f8-0x2ff irq 3 on isa0 sio1: type 16550A ppc0 at port 0x378-0x37f irq 7 flags 0x40 on isa0 ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/16 bytes threshold ppb0: IEEE1284 device found /NIBBLE Probing for PnP devices on ppbus0: ppbus0: HP ENHANCED PCL5,PJL,POSTSCRIPT plip0: on ppbus 0 lpt0: on ppbus 0 lpt0: Interrupt-driven port ppi0: on ppbus 0 APIC_IO: Testing 8254 interrupt delivery APIC_IO: routing 8254 via pin 2 IP packet filtering initialized, divert enabled, rule-based forwarding enabled, default to accept, logging limited to 100 packets/entry DUMMYNET initialized (990504) BRIDGE 981214, have 6 interfaces -- index 1 de0 type 6 phy 0 addrl 6 addr 00.a0.cc.66.32.97 IP Filter: initialized. Default = pass all, Logging = enabled Waiting 15 seconds for SCSI devices to settle SMP: AP CPU #1 Launched! de0: enabling 10baseT port To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-current" in the body of the message