Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 23 Jun 2010 17:40:48 -0400
From:      Lowell Gilbert <freebsd-questions-local@be-well.ilk.org>
To:        Martin Minkus <martin.minkus@punz.co.nz>
Cc:        freebsd-questions <freebsd-questions@freebsd.org>
Subject:   Re: sshd / tcp packet corruption ?
Message-ID:  <44hbkt4ecf.fsf@be-well.ilk.org>
In-Reply-To: <H00000ac002fca76.1277265652.silver.pulse.local@MHS> (Martin Minkus's message of "Wed, 23 Jun 2010 16:00:52 %2B1200")
References:  <H00000ac002fca76.1277265652.silver.pulse.local@MHS>

next in thread | previous in thread | raw e-mail | index | archive | help
Martin Minkus <martin.minkus@punz.co.nz> writes:

> It seems this issue I reported below may actually be related to some
> kind of TCP packet corruption ?

Possible.  Or memory errors.  Hard to say much at this point, when you
don't even know which side is actually causing the errors.

> Still same box. I=92ve noticed my SSH connections into the box will die
> randomly, with errors.
>
>=20=20
>
> Sshd logs the following on the box itself:
>
>=20=20
>
> Jun 18 11:15:32 kinetic sshd[1406]: Received disconnect from
> 10.64.10.251: 2: Invalid packet header.  This probably indicates a
> problem with key exchange or encryption.=20
>

You might find more useful information by getting verbose messages from
the other end.=20=20

I don't have time to check this in detail, but if I recall correctly,
that message means that the other side closed the connection based on an
apparent invalid header type in a packet that 'kinetic' received.
Random corruption isn't likely in that case, because the error is always
in the same place in the packet.  Check the 'netstat -i' numbers to see
if the drivers are picking up any packet errors.

It's hard to debug network problems in ssh, though, because (obviously)
you can't tell in general whether packet data is corrupt.  If you can
set up a test case with, say, UDP echo, that would be easier to see the
damage to the packets if they are, in fact, being corrupted.=20=20

Unfortunately, I'm so used to having sophisticated test equipment in the
lab to look at these kinds of problems that I'm probably missing what
would be obvious to someone who deals with problems "in the field."
Hope I've been somewhat helpful anyway.



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?44hbkt4ecf.fsf>