Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 20 Sep 2010 20:31:33 +0100
From:      Pete French <pete@twisted.org.uk>
To:        freebsd-bugs@FreeBSD.org, jh@FreeBSD.org
Subject:   Re: bin/150727: diff on UTF-8 text files thinks they are binary - regression from 7.X
Message-ID:  <E1Oxm5N-000D1p-C3@toybox.twisted.org.uk>
In-Reply-To: <201009201332.o8KDWmlo074276@freefall.freebsd.org>

next in thread | previous in thread | raw e-mail | index | archive | help
> I couldn't reproduce this with simple UTF-8 files:

I just looked through my example files in detail, and it turns out the
problem is not with UTF-8 after all, but with NULL characters which
are also in the file. This is what trips up 'diff' - and though it
it a charge from 7.X I am not sure that it is really a bug.

Sorry for the noise - the code I used to verify that the file was
a valid UTF-8 file accepts the zero bytes quite happily and says
that it is a text file.

 



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?E1Oxm5N-000D1p-C3>