Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 07 Sep 1999 19:16:44 +0100
From:      Alan Judge <Alan.Judge@indigo.ie>
To:        stable@freebsd.org
Subject:   newly introduced repeatable SMP hang in STABLE/3.3-RC
Message-ID:  <19990907181823.97FC914C44@hub.freebsd.org>

next in thread | raw e-mail | index | archive | help
[Given the nearness of 3.3, I've put what I can quickly in a message
 here.  If opening a PR with more info would help, please let me
 know.]

I've just updated my SMP test machine to STABLE as of today and can
now repeatably hang the box.  Looks like it's stuck at some spl level,
as it will respond to pings, but all user processes are stuck.  More
info below.

The same problem appears in STABLE as of 27th August.  Both versions
have a problem with hanging on an SMP platform.  STABLE as of 23rd
July didn't hang in the same way.

I can now repeatably hang a dual-CPU box (Dell 2300) running STABLE as
of this afternoon.  I suspect the hang is device/networking related,
as various user activities provoke it; doing make buildworld in an NFS
directory works a charm :-)

I think I can repeat this problem at will.  I can get into DDB and
have a core dump with matching kernel and debugging symbols, but I'm
not sure what information is useful when tracking SMP problems.  I'm
not familiar with the 386 MP architecture.  I'll happily try anything
people want to suggest.  None of the stuff in the FAQ/Handbook covers
the sort of information it is useful to gather for SMP problems.

The trackback in DDB and the core dump show the machine stuck in
MPgetlock(), but no useful stuff before that.  Perhaps I can get at a
useful stack or frame elsewhere?  Anyway, I've attached what I can below.
--
Alan

quango# uname -a
FreeBSD quango.indigo.ie 3.3-RC FreeBSD 3.3-RC #0: Tue Sep  7 15:15:32 IST 1999     judgea@quango.indigo.ie:/f5/misc/FreeBSD/stable/src/sys/compile/QUANGO.SMP  i386
quango# gdb -k *.1               
GNU gdb 4.18
Copyright 1998 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "i386-unknown-freebsd"...
IdlePTD 3198976
initial pcb at 2818bc
panicstr: from debugger
panic messages:
---
panic: from debugger
mp_lock = 0005d768; cpuid = 0; lapic.id = 01000000
boot() called on cpu#0

syncing disks... 30 29 14 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 done

dumping to dev 20401, offset 524288
dump 256 255 254 253 252 251 250 249 248 247 246 245 244 243 242 241 240 239 238 237 236 235 234 233 232 231 230 229 228 227 226 225 224 223 222 221 220 219 218 217 216 215 214 213 212 211 210 209 208 207 206 205 204 203 202 201 200 199 198 197 196 195 194 193 192 191 190 189 188 187 186 185 184 183 182 181 180 179 178 177 176 175 174 173 172 171 170 169 168 167 166 165 164 163 162 161 160 159 158 157 156 155 154 153 152 151 150 149 148 147 146 145 144 143 142 141 140 139 138 137 136 135 134 133 132 131 130 129 128 127 126 125 124 123 122 121 120 119 118 117 116 115 114 113 112 111 110 109 108 107 106 105 104 103 102 101 100 99 98 97 96 95 94 93 92 91 90 89 88 87 86 85 84 83 82 81 80 79 78 77 76 75 74 73 72 71 70 69 68 67 66 65 64 63 62 61 60 59 58 57 56 55 54 53 52 51 50 49 48 47 46 45 44 43 42 41 40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 
---
#0  boot (howto=256) at ../../kern/kern_shutdown.c:285
285                     dumppcb.pcb_cr3 = rcr3();
(kgdb) bt
#0  boot (howto=256) at ../../kern/kern_shutdown.c:285
#1  0xc015d199 in panic (fmt=0xc0248c24 "from debugger")
    at ../../kern/kern_shutdown.c:446
#2  0xc012d2d9 in db_panic (addr=-1071530509, have_addr=0, count=-1, 
    modif=0xcbbd7da4 "") at ../../ddb/db_command.c:432
#3  0xc012d279 in db_command (last_cmdp=0xc026953c, cmd_table=0xc026939c, 
    aux_cmd_tablep=0xc027efcc) at ../../ddb/db_command.c:332
#4  0xc012d33e in db_command_loop () at ../../ddb/db_command.c:454
#5  0xc012f7b7 in db_trap (type=3, code=0) at ../../ddb/db_trap.c:71
#6  0xc021bb95 in kdb_trap (type=3, code=0, regs=0xcbbd7e98)
    at ../../i386/i386/db_interface.c:157
#7  0xc022e108 in trap (frame={tf_es = -1071448048, tf_ds = -1070989296, 
      tf_edi = -1071589272, tf_esi = 134, tf_ebp = -876773668, 
      tf_isp = -876773696, tf_ebx = 0, tf_edx = -1071265453, tf_ecx = 1, 
      tf_eax = 38, tf_trapno = 3, tf_err = 0, tf_eip = -1071530509, tf_cs = 8, 
      tf_eflags = 582, tf_esp = -1071265469, tf_ss = -1071277614})
    at ../../i386/i386/trap.c:548
#8  0xc021bdf3 in Debugger (msg=0xc02599d2 "manual escape to debugger")
    at ../../i386/i386/db_interface.c:317
#9  0xc02163bc in scgetc (kbd=0xc0294c98, flags=2)
    at ../../dev/syscons/syscons.c:3947
#10 0xc0210e40 in sckbdevent (thiskbd=0xc0294c98, event=0, arg=0x0)
    at ../../dev/syscons/syscons.c:844
#11 0xc020d733 in atkbd_intr (kbd=0xc0294c98, arg=0x0)
    at ../../dev/kbd/atkbd.c:564
#12 0xc0233124 in atkbd_isa_intr (unit=0) at ../../i386/isa/atkbd_isa.c:84
#13 0xc0229747 in MPgetlock ()
#14 0x28171b75 in ?? ()
#15 0x2815a272 in ?? ()
#16 0x2815a049 in ?? ()
#17 0x281502b8 in ?? ()
#18 0x280e7960 in ?? ()
#19 0x8050021 in ?? ()
#20 0x804e0fd in ?? ()
#21 0x8051d94 in ?? ()
#22 0x8057c75 in ?? ()
#23 0x804bb11 in ?? ()
(kgdb) print/x mp_lock
$1 = 0x5d768
(kgdb) frame 13
#13 0xc0229747 in MPgetlock ()
(kgdb) i frame
Stack level 13, frame at 0xbfbfd09c:
 eip = 0xc0229747 in MPgetlock; saved eip 0x28171b75
 (FRAMELESS), called by frame at 0xbfbfd0b4, caller of frame at 0xcbbd7f4c
 Arglist at 0xbfbfd09c, args: 
 Locals at 0xbfbfd09c, Previous frame's sp is 0x0
 Saved registers:
  ebp at 0xbfbfd09c, eip at 0xbfbfd0a0
(kgdb) i reg
eax            0x0      0
ecx            0x0      0
edx            0x0      0
ebx            0x281bf494       672920724
esp            0xcbbd7db4       0xcbbd7db4
ebp            0xbfbfd09c       0xbfbfd09c
esi            0xbfbfd17c       -1077948036
edi            0x20     32
eip            0xc021d6ec       0xc021d6ec
eflags         0x0      0
cs             0x0      0
ss             0x0      0
ds             0x0      0
es             0x0      0
fs             0x27     39
gs             0x27     39


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-stable" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?19990907181823.97FC914C44>