Date: Mon, 21 Feb 2005 18:22:17 -0800 From: Kent Stewart <kstewart@owt.com> To: Gary Kline <kline@tao.thought.org> Cc: freebsd-questions@freebsd.org Subject: Re: how do i translate non-ascii chars??? Message-ID: <200502211822.17784.kstewart@owt.com> In-Reply-To: <20050221230054.GA58364@thought.org> References: <20050221065149.GA77396@thought.org> <200502202354.49454.kstewart@owt.com> <20050221230054.GA58364@thought.org>
next in thread | previous in thread | raw e-mail | index | archive | help
On Monday 21 February 2005 03:00 pm, Gary Kline wrote: > On Sun, Feb 20, 2005 at 11:54:49PM -0800, Kent Stewart wrote: > > On Sunday 20 February 2005 10:51 pm, Gary Kline wrote: > > > l > > > Guys, > > > > > > I've got sseveral HTML files with O-aigu and O-grave and > > > others (these files were composed on a Mac. Rather than > > > display as ['] (apostrophes) or backticks, they are rendered > > > in full 8859-1. > > > > > > How to I translate these > 128 range characters? I'm > > > wedged. > > > > When people do this, they are supposed to use the á, > > à, and etc. Then, their browser does the correct display on > > their OS. > > I didn't explain myself very well, sorry. Befow is a line > from od -c on the index.html file. I'm not sure how this > will be rendered in the different mailers, but in mutt with > nvi, the "B" is surrounded by two iso8859-1 characters. > In mozilla, same way. It is meant to be `B'. I've got over > 28 files with what should be apostrophes and bcktcks rendered > this way. putchar() outputs these characters in 8859 form > --as characters. printf("0%o", ch); gives me their octal > values. But trying to catch them with getchar() and it > fails. gcc says that '\0325' is a dounle-wide. > > Maybe "od -c" is seeing this file as 16-bit characters... > > > r h y m i n g =D4 B =D5 w o r d Did you try -oc or -hc. I have one file that I translate from a Mac .doc=20 into html and his bullets translate into what vi sees as the Yen=20 symbol. I was always able to cut and paste into vi and then supply the=20 proper character. The problem is that the apostrophe is probably best=20 handled as "´". There isn't a comparable name for the grave. I=20 did the bullet translate with something like ":.,$s/=A5/\•/",=20 knowing that it would not appear as a bullet on some computers. I=20 followed the 80/20 rule and translated it for the PC's. I have always used the table in Castro's book as the source. She created=20 a table that works between OSes. Some things such as your grave do not=20 translate from one system to another. I do not use those characters in=20 my html. The visual value is only as good as your worst mistake.=20 Something that doesn't display properly will be viewed as an error by=20 the viewer. Kent > > gary =2D-=20 Kent Stewart Richland, WA http://users.owt.com/kstewart/index.html
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200502211822.17784.kstewart>