Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 09 Nov 2011 02:51:31 +0100
From:      "Michael Ross" <gmx@ross.cx>
To:        freebsd-questions@freebsd.org, "Conrad J. Sabatier" <conrads@cox.net>
Subject:   Re: "Unprintable" 8-bit characters
Message-ID:  <op.v4nor5dhg7njmm@michael-think>
In-Reply-To: <20111108184236.3a78ebf6@cox.net>
References:  <20111108184236.3a78ebf6@cox.net>

next in thread | previous in thread | raw e-mail | index | archive | help
Am 09.11.2011, 01:42 Uhr, schrieb Conrad J. Sabatier <conrads@cox.net>:

> Pardon me if this may seem like a stupid question, but this is
> something that's been bugging me for a long time, and none of my
> research has turned up anything useful yet.
>
> I've been trying to understand what the deal is with regards to the
> displaying of the "extended" 8-bit character set, i.e., 8-bit characte=
rs
> with the MSB set.
>
> More specifically, I'm trying to figure out how to get the "ls" comman=
d
> to properly display filenames containing characters in this extended
> set.  I have some MP3 files, for instance, whose names contain certain=

> European characters, such as the lowercase "u" with umlaut (code 0xfc
> in the Latin set, according to gucharmap), that I just can't get ls to=

> display properly.  These characters seem to be considered by ls as
> "unprintable", and the best I've been able to produce in the ls
> output is backslash interpretations of the characters using either the=

> -B or -b options, otherwise the default "?" is displayed in their plac=
e.

Unsure if I understand you correctly.
("extended" 8-bit character set with MSB? utf-16?)
I'm confused by this charset stuff in general.

Assuming you want \0xfc displayed as "=FC",

> cat test.py && python test.py && ls -l

#!/usr/local/bin/python
# -*- coding: utf-8 -*-

f=3Dopen('\xfc','w')
f.close()
total 2

-rw-r--r--  1 michael  wheel  29  9 Nov 02:43 test.py
-rw-r--r--  1 michael  wheel   0  9 Nov 02:44 =FC


here is what works for me:

in my login class in /etc/login.conf:

         :charset=3DISO-8859-1:\
         :lang=3Dde_DE.ISO8859-1:\

``cap_mkdb /etc/login.conf'' after changes


in /etc/rc.conf:

	scrnmap=3D"iso-8859-1_to_cp437"
	font8x8=3D"cp850-8x8"
	font8x14=3D"cp850-8x14"
	font8x16=3D"cp850-8x16"


and in /etc/ttys, console type is set to ``cons25l1''


Regards,

Michael



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?op.v4nor5dhg7njmm>