Skip site navigation (1)Skip section navigation (2)
Date:      Sun, 23 Oct 2011 08:10:38 +0200
From:      Pawel Jakub Dawidek <pjd@FreeBSD.org>
To:        Lawrence Stewart <lstewart@freebsd.org>
Cc:        freebsd-net@freebsd.org, freebsd-current@freebsd.org, Andre Oppermann <andre@freebsd.org>
Subject:   Re: 9.0-RC1 panic in tcp_input: negative winow.
Message-ID:  <20111023061038.GE1697@garage.freebsd.pl>
In-Reply-To: <4EA36F53.9050907@freebsd.org>
References:  <20111022084931.GD1697@garage.freebsd.pl> <4EA36F53.9050907@freebsd.org>

next in thread | previous in thread | raw e-mail | index | archive | help

--k3qmt+ucFURmlhDS
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Sun, Oct 23, 2011 at 12:35:15PM +1100, Lawrence Stewart wrote:
> On 10/22/11 19:49, Pawel Jakub Dawidek wrote:
> > The panic message says:
> >
> > 	panic: tcp_input negative window: tp 0xfffffe007763e000 rcv_nxt 371826=
9252 rcv_adv 3718268291
> >
> > I only have picture of the backtrace:
> >
> > 	http://people.freebsd.org/~pjd/misc/panic_negative_window.jpg
> >
>=20
> ewww that is not good. Can you give us any more information about the=20
> machine and what it's doing? Is it terminating TCP connections from the=
=20
> internet at large or only local LAN (i.e. is there likely to be packet=20
> loss happening)? Are you doing TSO or LRO? Do you have any non-default=20
> tuning in place?

It is my local file server. It is doing NFS and AFP over LAN and also
downloads files from the internet. It is triggered after few hours.
I changed the KASSERT() into printf() and added printing 'win' variable
and this is what got logged during the night:

05:16:24 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 110782726=
9 rcv_adv 1107826256 win=3D242
05:16:29 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 110783345=
1 rcv_adv 1107832977 win=3D880
05:16:41 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 110784956=
3 rcv_adv 1107848860 win=3D639
05:20:02 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 110810823=
0 rcv_adv 1108107331 win=3D567
05:24:30 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 110843330=
2 rcv_adv 1108432272 win=3D974
05:24:46 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 110845038=
5 rcv_adv 1108450060 win=3D751
05:26:44 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 110857481=
8 rcv_adv 1108573851 win=3D71
05:28:03 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 110865410=
3 rcv_adv 1108653166 win=3D0
05:28:43 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 110869239=
6 rcv_adv 1108691451 win=3D0
05:30:06 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 110878125=
8 rcv_adv 1108780372 win=3D235
05:35:05 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 110906757=
8 rcv_adv 1109067335 win=3D663
05:37:03 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 110918040=
3 rcv_adv 1109179411 win=3D0
05:41:03 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 110942826=
5 rcv_adv 1109427375 win=3D170

And the systems seems to be fine.

I'm happy to test patches, but one round would take 24h.

My suggestion would be that if we won't be able to fix it before 9.0,
we should turn this assertion off, as the system seems to be able to
recover.

--=20
Pawel Jakub Dawidek                       http://www.wheelsystems.com
FreeBSD committer                         http://www.FreeBSD.org
Am I Evil? Yes, I Am!                     http://yomoli.com

--k3qmt+ucFURmlhDS
Content-Type: application/pgp-signature

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.14 (FreeBSD)

iEYEARECAAYFAk6jr9oACgkQForvXbEpPzRU0QCdFfQwbyw7POvY827S/P9iFvcG
Ru8An3jQWQ3oDpiSjZGeToiRh1c/5KGK
=oRuU
-----END PGP SIGNATURE-----

--k3qmt+ucFURmlhDS--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20111023061038.GE1697>