Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 12 Nov 2013 21:17:37 +0100
From:      Christian Weisgerber <naddy@mips.inka.de>
To:        "Julian H. Stacey" <jhs@berklix.com>
Cc:        hackers@freebsd.org, FreeBSD-gnats-submit@freebsd.org, "Bernhard Riedel \(Work\)" <bernhard@sdg.de>, Astrid Jekat <astrid@jekat.com>
Subject:   Re: patch for /usr/src/usr.bin/fmt/ (not 8 bit clean) for German & French
Message-ID:  <20131112201737.GA52200@lorvorc.mips.inka.de>
In-Reply-To: <201311120110.rAC1A1jc066753@fire.js.berklix.net>
References:  <201311120110.rAC1A1jc066753@fire.js.berklix.net>

next in thread | previous in thread | raw e-mail | index | archive | help
Julian H. Stacey:

> 	I don't know about ISO 8859-1 and UTF-8, (I dislike & avoid
> 	national char set stuff as much as possible), but I want

That is your problem right there.

> 	to be able to edit files that simultaneously contain eg all
> 	of English German & French etc, so setting some var to eg
> 	just German would be inappropriate.  8 bit clean would be ideal,
> 	next best would be my patches I suppose.

You MUST define a character set for this.  "8-bit clean" is meaningless
for a tool that deals with runs of characters.  Without a defined
character set, you have no idea what those bytes mean.  Is 0x90 a
printable character?  Is it a control character?  Is it part of a
multibyte character?

And setting, for example, LC_CTYPE=de_DE.ISO8859-1 does in no way
limit you to German.  For LC_CTYPE purposes, the language/country
part of the locale specification isn't used.

This is definitely a PEBKAC.

-- 
Christian "naddy" Weisgerber                          naddy@mips.inka.de



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20131112201737.GA52200>