Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 18 Aug 2011 02:35:37 +0200
From:      Attilio Rao <attilio@freebsd.org>
To:        Hiroki Sato <hrs@freebsd.org>
Cc:        kostikbel@gmail.com, freebsd-stable@freebsd.org, avg@freebsd.org
Subject:   Re: panic: spin lock held too long (RELENG_8 from today)
Message-ID:  <CAJ-FndDtxKAKr_xZrCYDY0K=UAm=sUuUtHkZRZjBgrZYsus1pQ@mail.gmail.com>
In-Reply-To: <20110818.091600.831954331552558249.hrs@allbsd.org>
References:  <20110818.023832.373949045518579359.hrs@allbsd.org> <CAJ-FndCDOW0_B2MV0LZEo-tpEa9%2B7oAnJ7iHvKQsM4j4B0DLqg@mail.gmail.com> <20110818.043332.27079545013461535.hrs@allbsd.org> <20110818.091600.831954331552558249.hrs@allbsd.org>

next in thread | previous in thread | raw e-mail | index | archive | help
2011/8/18 Hiroki Sato <hrs@freebsd.org>:
> Hiroki Sato <hrs@freebsd.org> wrote
> =C2=A0in <20110818.043332.27079545013461535.hrs@allbsd.org>:
>
> hr> Attilio Rao <attilio@freebsd.org> wrote
> hr> =C2=A0 in <CAJ-FndCDOW0_B2MV0LZEo-tpEa9+7oAnJ7iHvKQsM4j4B0DLqg@mail.g=
mail.com>:
> hr>
> hr> at> 2011/8/17 Hiroki Sato <hrs@freebsd.org>:
> hr> at> > Hi,
> hr> at> >
> hr> at> > Mike Tancsa <mike@sentex.net> wrote
> hr> at> > =C2=A0in <4E15A08C.6090407@sentex.net>:
> hr> at> >
> hr> at> > mi> On 7/7/2011 7:32 AM, Mike Tancsa wrote:
> hr> at> > mi> > On 7/7/2011 4:20 AM, Kostik Belousov wrote:
> hr> at> > mi> >>
> hr> at> > mi> >> BTW, we had a similar panic, "spinlock held too long", t=
he spinlock
> hr> at> > mi> >> is the sched lock N, on busy 8-core box recently upgrade=
d to the
> hr> at> > mi> >> stable/8. Unfortunately, machine hung dumping core, so t=
he stack trace
> hr> at> > mi> >> for the owner thread was not available.
> hr> at> > mi> >>
> hr> at> > mi> >> I was unable to make any conclusion from the data that w=
as present.
> hr> at> > mi> >> If the situation is reproducable, you coulld try to reve=
rt r221937. This
> hr> at> > mi> >> is pure speculation, though.
> hr> at> > mi> >
> hr> at> > mi> > Another crash just now after 5hrs uptime. I will try and =
revert r221937
> hr> at> > mi> > unless there is any extra debugging you want me to add to=
 the kernel
> hr> at> > mi> > instead =C2=A0?
> hr> at> >
> hr> at> > =C2=A0I am also suffering from a reproducible panic on an 8-STA=
BLE box, an
> hr> at> > =C2=A0NFS server with heavy I/O load. =C2=A0I could not get a k=
ernel dump
> hr> at> > =C2=A0because this panic locked up the machine just after it oc=
curred, but
> hr> at> > =C2=A0according to the stack trace it was the same as posted on=
e.
> hr> at> > =C2=A0Switching to an 8.2R kernel can prevent this panic.
> hr> at> >
> hr> at> > =C2=A0Any progress on the investigation?
> hr> at>
> hr> at> Hiroki,
> hr> at> how easilly can you reproduce it?
> hr>
> hr> =C2=A0It takes 5-10 hours. =C2=A0I installed another kernel for debug=
ging just
> hr> =C2=A0now, so I think I will be able to collect more detail informati=
on in
> hr> =C2=A0a couple of days.
> hr>
> hr> at> It would be important to have a DDB textdump with these informati=
ons:
> hr> at> - bt
> hr> at> - ps
> hr> at> - show allpcpu
> hr> at> - alltrace
> hr> at>
> hr> at> Alternatively, a coredump which has the stop cpu patch which Andr=
yi can provide.
> hr>
> hr> =C2=A0Okay, I will post them once I can get another panic. =C2=A0Than=
ks!
>
> =C2=A0I got the panic with a crash dump this time. =C2=A0The result of bt=
, ps,
> =C2=A0allpcpu, and traces can be found at the following URL:
>
> =C2=A0http://people.allbsd.org/~hrs/FreeBSD/pool-panic_20110818-1.txt

I'm not sure I understand it, is also a corefile available?
If yes, where I could get it? (with the relevant sources and kernel.debug).

Thanks,
Attilio


--=20
Peace can only be achieved by understanding - A. Einstein



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CAJ-FndDtxKAKr_xZrCYDY0K=UAm=sUuUtHkZRZjBgrZYsus1pQ>