Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 15 Apr 1999 03:54:46 +0900
From:      Motomichi Matsuzaki <mzaki@e-mail.ne.jp>
To:        dcs@newsguy.com
Cc:        logix@foobar.franken.de, freebsd-bugs@FreeBSD.ORG, freebsd-current@FreeBSD.ORG
Subject:   Re: kern/5038: FreeBSD can't read MS Joliet CDs.
Message-ID:  <19990415035446A.mzaki@e-mail.ne.jp>
In-Reply-To: <3714D32A.DF778838@newsguy.com>
References:  <199904140620.XAA24224@freefall.freebsd.org> <37149716.582342ED@newsguy.com> <19990414164548.A14876@foobar.franken.de> <19990415021736I.mzaki@e-mail.ne.jp> <3714D32A.DF778838@newsguy.com>

next in thread | previous in thread | raw e-mail | index | archive | help

From: "Daniel C. Sobral" <dcs@newsguy.com>
Subject: Re: kern/5038: FreeBSD can't read MS Joliet CDs.
Date: Thu, 15 Apr 1999 02:40:58 +0900

dcs> > 2. Only Page 00 Unicode is shown (for Joliet CDs).
dcs> ...
dcs> >    Note.
dcs> >    Byung's patch passes Unicode transparently.
dcs> >    So CJK filenames are shown if terminals do Unicodes well.
dcs> 
dcs> Do you mean that your patch shows only page 00, but Byung's patch
dcs> passes Unicode directly?

My patch: 
  * character in page 00 is converted to ASCII(+ISO-8859-1)
  * character in other pages is replaced by '?'

Byung's patch:
  * character in page 00 is converted to ASCII(+ISO-8859-1)
  * character in other pages is passed as is

For example, suppose to be a file on Joliet CD, which filename is encoded 
as [ 0041 0042 0043 2525 3030 3535 ] in Unicode.

The kernel applied my patch shows it to userland via readdir(2) as 
  (41)(42)(43)(3F)(3F)(3F) ... "abc???" 

The kernel applied Byung's patch shows it as
  (41)(42)(43)(25)(25)(30)(30)(35)(35) ... "abc%%0055"


Byung reported,
> With this patch, I was able to see long file names and 
> non-english characters(such as korean&chinese) so i believe it 
> does support unicode as long as your locale is set correctly.

But with my locale 'ja_JP.EUC' (which is the major in Japan on *BSD box),
these strings make no sense.

To see filenames which contain Japanese characters,
kernel should convert it to EUC(JP).
Another possible answer is encoding to UTF-8,
 however, I don't know UTF-8 terminal compliant with Japanese.


-- 
Motomichi Matsuzaki <mzaki@e-mail.ne.jp>
Dept. of Biological Science, Fuculty of Sciences, Univ. of Tokyo, Japan


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-bugs" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?19990415035446A.mzaki>