From owner-freebsd-questions@FreeBSD.ORG Mon Nov 5 19:34:37 2012 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 3547576B for ; Mon, 5 Nov 2012 19:34:37 +0000 (UTC) (envelope-from freebsd@edvax.de) Received: from mx02.qsc.de (mx02.qsc.de [213.148.130.14]) by mx1.freebsd.org (Postfix) with ESMTP id E4FDF8FC0C for ; Mon, 5 Nov 2012 19:34:36 +0000 (UTC) Received: from r56.edvax.de (port-92-195-8-72.dynamic.qsc.de [92.195.8.72]) by mx02.qsc.de (Postfix) with ESMTP id 83BFA24D5F; Mon, 5 Nov 2012 20:34:35 +0100 (CET) Received: from r56.edvax.de (localhost [127.0.0.1]) by r56.edvax.de (8.14.5/8.14.5) with SMTP id qA5JYY34002377; Mon, 5 Nov 2012 20:34:35 +0100 (CET) (envelope-from freebsd@edvax.de) Date: Mon, 5 Nov 2012 20:34:34 +0100 From: Polytropon To: grarpamp Subject: Re: Character set conversion, locales, UTF-8, etc Message-Id: <20121105203434.0d9e2565.freebsd@edvax.de> In-Reply-To: References: <20121105022745.adc3e4c2.freebsd@edvax.de> Organization: EDVAX X-Mailer: Sylpheed 3.1.1 (GTK+ 2.24.5; i386-portbld-freebsd8.2) Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Cc: freebsd-questions@freebsd.org X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list Reply-To: Polytropon List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 05 Nov 2012 19:34:37 -0000 On Mon, 5 Nov 2012 14:05:17 -0500, grarpamp wrote: > >> As an aside, why does FreeBSD seem to default to the above locale > >> instead of say, en_US.UTF-8 ? > > > > FreeBSD's file system does not default to any locale, as far as I > > know. The system is "agnostic" to what the characters in the file > > name mean or what symbol they should represent. >=20 > Sure the fs is just binary, then viewed and written through > the mask of the selected langauge layer I think. Yes, that seems to be the case. > > There isn't much you can do on file system level except renaming > > the files: write a program that reads the file names according > > to the preferred interpretation and write new names for them, >=20 > I'll read more on language to see if I can reverse that and > recover them or just replace with X's. For X it's importat to have the required language variables set and the fonts containing the characters which are represented. > I was looking mostly for a tool that would show me what a > filename or data looks like in hex, octal, and different > selected encodings. Doing it by hand is slow. I'll check > ports again. The system already brings such a tool: od (octal, decimal, hex, ASCII dump). For example: % ls -w m=FCslifra=DF.txt | od -h 0000000 fc6d 6c73 6669 6172 2edf 7874 0a74 This is on en_US.ISO8859-1 (german special characters will be displayed properly as 1-bytes both in X and in console mode). --=20 Polytropon Magdeburg, Germany Happy FreeBSD user since 4.0 Andra moi ennepe, Mousa, ...