Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 30 Oct 1998 19:01:52 -0500 (EST)
From:      "Woodchuck"  <djv@bedford.net>
To:        grog@lemis.com (Greg Lehey)
Cc:        gestura@nyherji.is, questions@FreeBSD.ORG
Subject:   Re: russian language mixes in sendmail
Message-ID:  <199810310001.TAA28067@castor.chuck>
In-Reply-To: <19981028093609.D20920@freebie.lemis.com> from Greg Lehey at "Oct 28, 98 09:36:09 am"

next in thread | previous in thread | raw e-mail | index | archive | help
Greg Lehey wrote:
> On Tuesday, 27 October 1998 at 17:50:16 +0000, gestura@nyherji.is wrote:
> > Greetings,
> >
> > sorry guys, I mistyped in the version for the sendmail I'm running,
> > it should have been version 8.9.1a ,, not 8.6.1a :)
> >
> > but the problem still persists,
> > I'm allowing 8bit characterset, and all Icelandic spec. letters come
> > through in good order
> 
> Can you get somebody to send you and me an original message in
> Russian; then forward me what you get, and I'll compare them.
> 
> Greg
> --


The first six letters of the Russian Cyrillic alphapbet,
encoded in KOI8

Lowercase: ÁÂ×ÇÄÅ
Latin eqv: abvgde
Uppercase: áâ÷çäå

Hexdump of those lines:

[listread@castor listread]$ hexdump -C goo 
00000000  6c 6f 77 65 72 63 61 73  65 3a 20 c1 c2 d7 c7 c4  |lowercase: .....|
00000010  c5 0a 4c 61 74 69 6e 20  65 71 76 3a 20 61 62 76  |..Latin eqv: abv|
00000020  67 64 65 0a 55 70 70 65  72 63 61 73 65 3a 20 e1  |gde.Uppercase: .|
00000030  e2 f7 e7 e4 e5 0a                                 |......|
00000036
[listread@castor listread]$ 


If you are seeing Icelandic characters, you'll never see Cyrillic.
It sounds like a problem of selecting a Cyrillic font, and enabling
it in the client. This can be daunting.

Also, there are two brain damaged encodings used (surprise) only
by microsoft. These appear as gibberish when viewed with a KOI8
font. TeX originally used these CP m$ crap because of MS-DOS
chauvinism in the original PeeCee implementations of TeX. Only
recently have KOI8 capable TeX implementations appeared.

KOI8 is the (Sov. then Russian) gov't standard. All the Cyrillic is
in the range 128-255; the range 0-127 are the ASCII standard glyphs.
Since there are 66 Cyrillic glyphs, the charsets are treated like
Hebrew or Arabic, as "things in themselves", rather than Latin + a few
weird letters.

Dave


-- 
           Strangers know your loved ones' phone numbers!!!!

To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-questions" in the body of the message



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?199810310001.TAA28067>