Skip site navigation (1)Skip section navigation (2)
Date:      Sun, 11 Jun 2006 20:49:47 +0200
From:      Roland Smith <rsmith@xs4all.nl>
To:        Brad Waite <freebsd@wcubed.net>
Cc:        freebsd-stable@freebsd.org
Subject:   Re: 6.1-stable hangs and LORs
Message-ID:  <20060611184947.GA11365@slackbox.xs4all.nl>
In-Reply-To: <448C5C41.10302@wcubed.net>
References:  <448C5C41.10302@wcubed.net>

next in thread | previous in thread | raw e-mail | index | archive | help

--VbJkn9YxBvnuCH5J
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Sun, Jun 11, 2006 at 12:09:05PM -0600, Brad Waite wrote:
> Hi guys,
>=20
> I'm going to take another stab at getting some help.
>=20
> For the last 6 months my FBSD gateway has been locking up every few=20
> days, usually about once a week.  No panic, no reboots, just a hard lock=
=20
> with no response on the console or over the net.
>=20
> I've replaced literally every piece of hardware with the exception of=20
> the case and power supply.  No change.

One of my machines had random lockup problems, an then the power supply
died. The problems were gone after it was replaced. You might want to
swap out the power supply. And test the RAM.

> I've upgraded from 5.3- to 6.0- to 6.1-STABLE.  No change.
>=20
> I've researched as much as I know how and still come up with hardly=20
> anything.  I have turned on BREAK_TO_DEBUGGER, WITNESS and INVARIANTS=20
> and the only indication I've gotten is a lock order reversal that's=20
> *similar* to http://sources.zabbadoz.net/freebsd/lor/017.html.  The line=
=20
> numbers in pf.c don't match up with LOR 017, but that's about all I can=
=20
> tell.
>=20
> I'm reasonably certain the issue is with pf, since I have 3 other=20
> non-gateway servers humming along with no problems.  The hardware is=20
> nearly identical - their RAID cards are different, but I've tried=20
> running my gateway on just a single SCSI drive and had the same lockup=20
> issue.  Of course, the issue could be somewhere else, but I'm at a loss=
=20
> as to how to find it.

Could you try swapping two machines? That would be the ultimate check if
it's hardware related.
=20
> I'm running my console over serial so I can log anything that's=20
> necessary.  I've been able to break to the debugger, but to be honest, I=
=20
> don't know what to look for.  I've seen several posts on the lists about=
=20
>  posting the output of debug commands, but I figured it to be in poor=20
> taste to just dump my output here before someone asked.

Posting a stack backtrace (bt) might be a good start.

> I'm getting a lot of heat from the boss since our VoIP phones don't work=
=20
> when the gateway locks up.

Sometimes POTS isn't so bad after all. ;-)

Roland
--=20
R.F.Smith                                   http://www.xs4all.nl/~rsmith/
[plain text _non-HTML_ PGP/GnuPG encrypted/signed email much appreciated]
pgp: 1A2B 477F 9970 BA3C 2914  B7CE 1277 EFB0 C321 A725 (KeyID: C321A725)

--VbJkn9YxBvnuCH5J
Content-Type: application/pgp-signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.3 (FreeBSD)

iD8DBQFEjGXLEnfvsMMhpyURAj4hAJ9GtEBzWSrnOKJD6NY1i5b/nu818QCfQFIB
s66Y+v2RNhzLZ2uF/aSzlVE=
=3TOd
-----END PGP SIGNATURE-----

--VbJkn9YxBvnuCH5J--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20060611184947.GA11365>