Skip site navigation (1)Skip section navigation (2)
Date:      Sun, 25 Jan 2015 20:21:43 +0100
From:      Baptiste Daroussin <bapt@FreeBSD.org>
To:        Slawa Olhovchenkov <slw@zxy.spb.ru>
Cc:        arch@FreeBSD.org, Jordan Hubbard <jkh@ixsystems.com>
Subject:   Re: [RFC] Set the default locale to en_US.UTF-8
Message-ID:  <20150125192143.GR81001@ivaldir.etoilebsd.net>
In-Reply-To: <20150125191733.GS3698@zxy.spb.ru>
References:  <20150124143357.GI81001@ivaldir.etoilebsd.net> <20150125143243.GB76051@zxy.spb.ru> <7B1D8345-248B-4C44-9568-079BA29614C2@ixsystems.com> <20150125155000.GD76051@zxy.spb.ru> <20150125185951.GC23253@server.rulingia.com> <20150125191733.GS3698@zxy.spb.ru>

next in thread | previous in thread | raw e-mail | index | archive | help

--aH/0uqREc1VzwMkO
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Sun, Jan 25, 2015 at 10:17:33PM +0300, Slawa Olhovchenkov wrote:
> On Mon, Jan 26, 2015 at 05:59:51AM +1100, Peter Jeremy wrote:
>=20
> > On 2015-Jan-25 18:50:00 +0300, Slawa Olhovchenkov <slw@zxy.spb.ru> wrot=
e:
> > >On Sun, Jan 25, 2015 at 06:58:13AM -0800, Jordan Hubbard wrote:
> > >> > On Jan 25, 2015, at 6:32 AM, Slawa Olhovchenkov <slw@zxy.spb.ru> w=
rote:
> > >> >=20
> > >> > NO! Please, NOT!
> > >> > Not all bytestring allowed in UTF-8, as result -- unpedicable fail=
ed
> > >> > execution of sed, grep, vi, ed and etc.
> >=20
> > I switched to en_AU.UTF-8 about 5 years ago with relatively little pain
> > (though I had very little non-ASCII text).
> >=20
> > The downside of UTF-8 in that random non-ASCII bytestrings are unlikely=
 to
> > be valid UTF-8 and will therefore get rejected.  About the only time I =
get
> > bitten by this is that my random password generator:
> >   dd if=3D/dev/random bs=3D32 count=3D1 | tr -cd '!-~'
> > will die with an "tr: Illegal byte sequence" and needs a "LC_ALL=3DC" to
> > placate it.
>=20
> Yes, I now remeber -- other case will be tr.
>=20
> > >I am years use ru_RU.KOI8-R. Now I try use ru_RU.UTF8 and got some
> > >issuse (on 10-STABLE). 9.x and OS may have dufferent version of
> > >software and don't touch this.
> >=20
> > Once you've started using any 8-bit locale, switching to UTF-8 (or any
> > other 8-bit locale) will be a PITA because you need to re-encode everyt=
hing.
> > And, since it's very difficult to run with multiple locales, you need to
> > do a complete sweep when you change locales.  If you are running into
> > specific issues with incorrect handling of ru_RU.UTF8, that is a bug and
> > you need to report it.
>=20
> No, I don't have incorrect handling of ru_RU.UTF8 (for correct UTF8
> files), I have trouble with processing non-utf files (like example
> with tr)

That is why on my proposal also set LC_COLLATE=3DC which fixed the tr case =
for
example

Bapt

--aH/0uqREc1VzwMkO
Content-Type: application/pgp-signature

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1

iEYEARECAAYFAlTFQkcACgkQ8kTtMUmk6EyACACfYTfoqsPveXf5HTMufTrcKn+A
tQsAmgLq+aCEuBC99fIIj+2/aryBITrZ
=ymg1
-----END PGP SIGNATURE-----

--aH/0uqREc1VzwMkO--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20150125192143.GR81001>