From owner-freebsd-current@FreeBSD.ORG Fri Nov 21 11:41:19 2003 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 5428F16A4CE; Fri, 21 Nov 2003 11:41:19 -0800 (PST) Received: from obsecurity.dyndns.org (adsl-63-207-60-234.dsl.lsan03.pacbell.net [63.207.60.234]) by mx1.FreeBSD.org (Postfix) with ESMTP id C9BB143FDD; Fri, 21 Nov 2003 11:41:17 -0800 (PST) (envelope-from kris@obsecurity.org) Received: by obsecurity.dyndns.org (Postfix, from userid 1000) id 1AE0666C55; Fri, 21 Nov 2003 11:41:17 -0800 (PST) Date: Fri, 21 Nov 2003 11:41:17 -0800 From: Kris Kennaway To: John Baldwin Message-ID: <20031121194116.GA3913@xor.obsecurity.org> References: <20031120190342.GA87876@xor.obsecurity.org> Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="+QahgC5+KEYLbs62" Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.4.1i cc: re@FreeBSD.org cc: current@FreeBSD.org cc: Kris Kennaway Subject: Re: spin lock sched lock held by 0xc25a8640 for > 5 seconds X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 21 Nov 2003 19:41:19 -0000 --+QahgC5+KEYLbs62 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Thu, Nov 20, 2003 at 02:31:23PM -0500, John Baldwin wrote: >=20 > On 20-Nov-2003 Kris Kennaway wrote: > > I updated bento last night, and it panicked after a few hours with: > >=20 > > Fatal trap 12: page fault while in kernel mode > > cpuid =3D 0; apic id =3D 01 > > fault virtual address =3D 0xe5 > > fault code =3D supervisor read, page not present > > instruction pointer =3D 0x8:0xc052e219 > > stack pointer =3D 0x10:0xe00d2c3c > > frame pointer =3D 0x10:0xe00d2c64 > > code segment =3D base 0x0, limit 0xfffff, type 0x1b > > =3D DPL 0, pres 1, def32 1, gran 1 > > processor eflags =3D resume, IOPL =3D 0 > > current process =3D 40 (irq29: sym0) > > spin lock sched lock held by 0xc25a8640 for > 5 seconds > > panic: spin lock held too long > > cpuid =3D 1; > > Debugger("panic") > >=20 > > Unfortunately it hangs there instead of breaking to DDB. > >=20 > > The instruction pointer is in: > >=20 > > ... > > c052e000 t propagate_priority > > c052e360 T init_turnstiles > > ... > >=20 > > I'm currently rebuilding without WITNESS_SKIPSPIN in case this catches = it. >=20 > Argh, I didn't catch that you trap 12'd before the sched_lock hang. > Do you have INVARIANTS on? Also, do you possibly have a kernel.debug > around that you could try to figure out what line that IP corresponds to? kernel trap 12 with interrupts disabled Fatal trap 12: page fault while in kernel mode cpuid =3D 0; apic id =3D 01 fault virtual address =3D 0xe5 fault code =3D supervisor read, page not present instruction pointer =3D 0x8:0xc052e219 stack pointer =3D 0x10:0xe00cdc3c frame pointer =3D 0x10:0xe00cdc64 code segment =3D base 0x0, limit 0xfffff, type 0x1b =3D DPL 0, pres 1, def32 1, gran 1 processor eflags =3D resume, IOPL =3D 0 current process =3D 39 (irq28: sym1) Another one just happened. It again locked up and failed to enter DDB. Kris --+QahgC5+KEYLbs62 Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.3 (FreeBSD) iD8DBQE/vmpcWry0BWjoQKURAqR6AKDaW1MRt8XzchFPM6T6lWkgJfZV/ACg0zCZ QjxI3oTPh0eph94i6fEgzKo= =BwM1 -----END PGP SIGNATURE----- --+QahgC5+KEYLbs62--