From owner-freebsd-questions Fri Sep 13 02:20:16 1996 Return-Path: owner-questions Received: (from root@localhost) by freefall.freebsd.org (8.7.5/8.7.3) id CAA07809 for questions-outgoing; Fri, 13 Sep 1996 02:20:16 -0700 (PDT) Received: from admin.cyberenet.net (root@admin.cyberenet.net [204.213.252.6]) by freefall.freebsd.org (8.7.5/8.7.3) with SMTP id CAA07797 for ; Fri, 13 Sep 1996 02:20:13 -0700 (PDT) Received: from twwells.com by admin.cyberenet.net with smtp (Smail3.1.29.1 #7) id m0v1UPy-000O3sC; Fri, 13 Sep 96 05:20 EDT Received: by twwells.com (Smail3.1.29.1 #8) id m0v1UO4-00011uC; Fri, 13 Sep 96 05:18 EDT Message-Id: From: bill@twwells.com (T. William Wells) Subject: gratuitous system hangs -- 2.1.5 To: freebsd-questions@freebsd.org Date: Fri, 13 Sep 1996 05:18:12 -0400 (EDT) X-Mailer: ELM [version 2.4 PL25] Content-Type: text Sender: owner-questions@freebsd.org X-Loop: FreeBSD.org Precedence: bulk I posted the bulk of this message before but got no answers.... In addition to the original post, which I've included at the end of this new text, I have this (slightly edited) vmstat. Yes, the vmstat kept running through the crash; I got this output by having been telnetted in and capturing the vmstat on another machine. procs memory page disks faults cpu r b w avm fre flt re pi po fr sr s0 s1 in sy cs us sy id 3 0 0 421488 5144 619 17 1 0 616 0 4 13 434 1299 189 13 25 62 2 1 0 397596 5048 532 3 1 0 535 0 4 12 481 1062 189 9 15 76 2 0 0 379872 5736 439 3 0 0 464 0 2 10 480 932 195 8 15 77 4 1 0 401056 4832 1213 19 3 0 1146 0 3 18 544 1985 262 14 31 55 **presumably, this is where things started going wrong.... 19 36 0 610428 1664 1182 17 6 0 1039 0 19 45 564 1650 296 22 38 41 60 37 3 764756 220 1374 0 14 47 1076 6787 45 56 769 2394 518 11 45 43 16 45 21 704324 2228 629 0 17 36 636 4358 48 49 590 1601 433 15 39 46 13 34 17 664476 5336 34 1 3 51 101 4375 55 47 515 10320 210 7 38 56 14 36 0 544712 8192 48 1 3 18 53 7460 22 1 374 413 127 2 22 75 14 36 0 516176 7940 19 0 1 0 10 0 2 1 359 456 137 1 6 93 14 36 0 376420 7916 11 0 2 0 15 0 5 2 356 439 137 3 8 89 14 36 0 363124 8120 8 1 1 0 25 0 3 2 358 544 131 6 8 86 14 36 0 322944 8476 22 0 0 0 42 0 0 1 316 261 80 2 3 95 14 36 0 318748 9440 61 0 1 0 107 0 10 12 347 345 103 2 8 90 15 37 0 307268 10072 51 0 4 0 88 0 4 2 337 325 104 2 5 93 15 37 0 296872 10028 3 0 1 0 2 0 0 1 326 307 76 3 5 92 15 37 0 311468 9052 48 0 10 0 37 0 4 9 355 428 117 4 7 89 17 38 1 313476 9384 24 0 2 0 44 0 1 1 326 335 87 3 4 93 15 37 0 303860 9496 59 0 7 0 67 0 7 2 338 345 93 3 6 91 16 38 0 300732 9408 9 0 2 0 5 0 6 12 363 421 115 4 6 90 16 38 0 286192 9800 23 0 1 0 43 0 0 0 297 278 73 3 5 92 16 38 0 273460 10236 8 0 1 0 31 0 0 2 291 182 47 1 2 97 One other, possibly irrelevant, datum. There seems to be an unusual number of processes in "lock_write" state, says the results of a ps taken via ddb during the crash. ------------ I've been having some unexplained system hangs. These occured in 2.1 and continue in 2.1.5. The symptoms are these: processes that are running keep running. Processes cannot start. Processes appear to be unable to exit. For example, if I'm running a shell I can (always? I can't say for sure) run internal shell commands but as soon as I try to run anything external, it hangs. I run a radius server on that machine; it keeps responding to requests. There are no console messages. There are no abnormal log entries. The disk is still active. The behavior of the system does change with time -- the radius server stops responding eventually; when telnetting in, instead of a connected-to message and a hang, it just doesn't connect. The only remedy is the reset switch.... I've included the boot messages and the config. Any ideas? FreeBSD 2.1.5-RELEASE #0: Tue Aug 27 21:14:03 EDT 1996 root@ux1.cyberenet.net:/usr/src/sys/compile/UX1 CPU: 133-MHz Pentium 735\90 or 815\100 (Pentium-class CPU) Origin = "GenuineIntel" Id = 0x52b Stepping=11 Features=0x1bf real memory = 67108864 (65536K bytes) avail memory = 63217664 (61736K bytes) Probing for devices on PCI bus 0: chip0 rev 2 on pci0:0 chip1 rev 2 on pci0:7:0 chip2 rev 2 on pci0:7:1 de0 rev 17 int a irq 5 on pci0:13 de0: DC21041 [10Mb/s] pass 1.1 Ethernet address 00:00:c0:3b:ae:c7 Probing for devices on the ISA bus: sc0 at 0x60-0x6f irq 1 on motherboard sc0: VGA color <16 virtual consoles, flags=0x0> sio0 not found at 0x3f8 sio1 at 0x2f8-0x2ff irq 3 on isa sio1: type 16550A lpt0 not found at 0xffffffff fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa fdc0: NEC 72065B fd0: 1.44MB 3.5in wdc0 not found at 0x1f0 wdc1 not found at 0x170 aha0 at 0x330-0x333 irq 11 drq 5 on isa aha0 waiting for scsi devices to settle (aha0:0:0): "FUJITSU M2694ES-512 8134" type 0 fixed SCSI 1 sd0(aha0:0:0): Direct-Access 1033MB (2117025 512 byte sectors) (aha0:1:0): "QUANTUM XP34301 1071" type 0 fixed SCSI 2 sd1(aha0:1:0): Direct-Access 4106MB (8410200 512 byte sectors) npx0 on motherboard npx0: INT 16 interface changing root device to sd0a WARNING: / was not properly dismounted. de0: enabling Thinwire/BNC port machine "i386" cpu "I486_CPU" cpu "I586_CPU" ident "UX1" maxusers 64 options INET #InterNETworking options FFS #Berkeley Fast Filesystem options "CD9660" #ISO 9660 Filesystem options PROCFS #Process filesystem options "COMPAT_43" #Compatible with BSD 4.3 options "FAT_CURSOR" options "SCSI_DELAY=15" #Be pessimistic about Joe SCSI device options BOUNCE_BUFFERS #include support for DMA bounce buffers options DODUMP options DDB options "NMBCLUSTERS=4096" # lots of mbufs! options "INITIAL_CBLOCKS=200" # start with more clists options SYSVSHM options SYSVSEM options SYSVMSG config kernel root on wd0 controller isa0 controller pci0 controller fdc0 at isa? port "IO_FD1" bio irq 6 drq 2 vector fdintr disk fd0 at fdc0 drive 0 disk fd1 at fdc0 drive 1 controller wdc0 at isa? port "IO_WD1" bio irq 14 vector wdintr disk wd0 at wdc0 drive 0 disk wd1 at wdc0 drive 1 controller wdc1 at isa? port "IO_WD2" bio irq 15 vector wdintr disk wd2 at wdc1 drive 0 disk wd3 at wdc1 drive 1 options ATAPI #Enable ATAPI support for IDE bus device wcd0 #IDE CD-ROM controller ahc0 controller aha0 at isa? port "IO_AHA0" bio irq ? drq 5 vector ahaintr controller scbus0 device sd0 device st0 device cd0 #Only need one of these, the code dynamically grows # syscons is the default console driver, resembling an SCO console device sc0 at isa? port "IO_KBD" tty irq 1 vector scintr device npx0 at isa? port "IO_NPX" irq 13 vector npxintr device sio0 at isa? port "IO_COM1" tty irq 4 vector siointr device sio1 at isa? port "IO_COM2" tty irq 3 vector siointr device lpt0 at isa? port? tty irq 7 vector lptintr device de0 pseudo-device loop pseudo-device ether pseudo-device log pseudo-device pty 64 pseudo-device gzip # Exec gzipped a.out's