Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 22 Sep 1997 20:50:01 -0700 (PDT)
From:      "Daniel O'Callaghan" <danny@FreeBSD.ORG>
To:        freebsd-bugs
Subject:   Re: docs/4555: Typo in utf2(4) man page
Message-ID:  <199709230350.UAA12850@hub.freebsd.org>

next in thread | raw e-mail | index | archive | help
The following reply was made to PR docs/4555; it has been noted by GNATS.

From: "Daniel O'Callaghan" <danny@FreeBSD.ORG>
To: freebsd-gnats-submit@freebsd.org, tejblum@arc.hq.cti.ru
Cc:  Subject: Re: docs/4555: Typo in utf2(4) man page
Date: Tue, 23 Sep 1997 13:43:58 +1100

 Hello,  you wrote:
 
 >      utf2(4) claims that runes 0x0080-0x03ff (00000bbb.bbbbbbbb) encoded with 2
 >                                          ^         ^^^
 >      bytes. Of course, it really means 0x0080-0x07ff.
 > 
 > How-To-Repeat
 > Fix
 > 
 >      --- /usr/src/lib/libc/locale/utf2.4     Tue Sep 16 18:20:11 1997
 >      +++ utf2.4      Tue Sep 16 21:45:31 1997
 >      @@ -60,7 +60,7 @@
 >       encoding is represented by the following table:
 >       .Bd -literal
 >       [0x0000 - 0x007f] [00000000.0bbbbbbb] -> 0bbbbbbb
 >      -[0x0080 - 0x03ff] [00000bbb.bbbbbbbb] -> 110bbbbb, 10bbbbbb
 >      +[0x0080 - 0x07ff] [00000bbb.bbbbbbbb] -> 110bbbbb, 10bbbbbb
 >       [0x0400 - 0xffff] [bbbbbbbb.bbbbbbbb] -> 1110bbbb, 10bbbbbb, 10bbbbbb
 
 Look closely and you'll see that the ranges are 0x0000-0x007f,
 0x0080-0x03ff and
 0x0400 -0xffff, making three contiguous ranges.  Your patch would
 suggest an overlap
 as the middle range would end at 0x07ff, after the third range starts at
 0x0400.
 
 Danny



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?199709230350.UAA12850>