Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 4 Aug 2009 23:45:11 +0200
From:      Attilio Rao <attilio@freebsd.org>
To:        Stefan Bethke <stb@lassitu.de>
Cc:        freebsd-stable <freebsd-stable@freebsd.org>, FreeBSD Current <freebsd-current@freebsd.org>, Giovanni Trematerra <giovanni.trematerra@gmail.com>, Dan Naumov <dan.naumov@gmail.com>, barbara <barbara.xxx1975@libero.it>, "Bjoern A. Zeeb" <bz@freebsd.org>, Robert Watson <rwatson@freebsd.org>, "C. C. Tang" <hiyorin@gmail.com>
Subject:   Re: spinlock held too long on reboot
Message-ID:  <3bbf2fe10908041445j2dc4b480kd767f03c62b782c8@mail.gmail.com>
In-Reply-To: <3bbf2fe10907281943m2392a9f9w7c69303e6c3b91d0@mail.gmail.com>
References:  <746CE32B-BCF8-460A-982D-25341554E8FD@lassitu.de> <3bbf2fe10905221234k12c45932gb1e197143cd74b5d@mail.gmail.com> <FCCBA84A-1B9E-4C63-BD42-452565BE9292@lassitu.de> <20090522230333.X72053@maildrop.int.zabbadoz.net> <3bbf2fe10905221846q7fd1fe9cue744de61f9e12612@mail.gmail.com> <226F1AFF-45D8-4E4C-BE7F-D2EDC35EC8F6@lassitu.de> <3bbf2fe10907281943m2392a9f9w7c69303e6c3b91d0@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help
2009/7/29 Attilio Rao <attilio@freebsd.org>:
> 2009/5/23 Stefan Bethke <stb@lassitu.de>:
>> I wrote:
>>
>>> Syncing disks, vnodes remaining...0 done
>>> All buffers synced.
>>> GEOM_MIRROR: Device diesel_root: provider mirror/diesel_root destroyed.
>>> Uptime: 6m32s
>>> GEOM_MIRROR: Device diesel_root destroyed.
>>> Rebooting...
>>> cpu_reset: Stopping other CPUs
>>> spin lock 0xffffffff8078c900 (sched lock 1) held by 0xffffff00014d4ab0
>>> (tid 100002) too long
>>> panic: spin lock held too long
>>> cpuid = 0
>>> KDB: enter: panic
>>> [thread pid 77 tid 100090 ]
>>> Stopped at      kdb_enter+0x3d: movq    $0,0x48bbd0(%rip)
>>> db> bt
>>> Tracing pid 77 tid 100090 td 0xffffff000457bab0
>>> kdb_enter() at kdb_enter+0x3d
>>> panic() at panic+0x17b
>>> _mtx_lock_spin_failed() at _mtx_lock_spin_failed+0x39
>>> _mtx_lock_spin() at _mtx_lock_spin+0x9e
>>> _mtx_lock_spin_flags() at _mtx_lock_spin_flags+0x72
>>> sched_balance_group() at sched_balance_group+0xc5
>>> sched_balance_group() at sched_balance_group+0x1f8
>>> sched_balance() at sched_balance+0xa2
>>> sched_clock() at sched_clock+0xf6
>>> statclock() at statclock+0xbd
>>> lapic_handle_timer() at lapic_handle_timer+0x197
>>> Xtimerint() at Xtimerint+0x8c
>>> --- interrupt, rip = 0xffffffff80541cc4, rsp = 0xffffff80771dba90, rbp =
>>> 0xffffff80771dbab0 ---
>>> DELAY() at DELAY+0x64
>>> cpu_reset() at cpu_reset+0xdd
>>> boot() at boot+0x2e6
>>> reboot() at reboot+0x42
>>> syscall() at syscall+0x1a5
>>> Xfast_syscall() at Xfast_syscall+0xd0
>>> --- syscall (55, FreeBSD ELF64, reboot), rip = 0x800788eec, rsp =
>>> 0x7fffffffeca8, rbp = 0 ---
>>
>>
>> I've only seen this once.  If I should encounter it again, is there
>> something you'd like me to look at?
>
> [ Sorry, trying to add anyone who alredy reported such a problem even
> if I know many of you experienced it on -STABLE]

If you are experiencing this problem, you would like to test this port
from rink@ on 7.2 of the new version of the patch:
http://people.freebsd.org/~rink/tmp/ipi_7stable.diff

while the -CURRENT version that probabilly is going to be committed
soon is here:
http://www.freebsd.org/~attilio/stop_nmi2.diff

Thanks,
Attilio


-- 
Peace can only be achieved by understanding - A. Einstein



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?3bbf2fe10908041445j2dc4b480kd767f03c62b782c8>