Skip site navigation (1)Skip section navigation (2)
Date:      Sat, 18 Feb 2006 15:50:04 -0500
From:      Gerard Seibert <gerard@seibercom.net>
To:        freebsd-questions@freebsd.org
Subject:   Re: Removing BOM from UTF-8
Message-ID:  <200602181550.13252.gerard@seibercom.net>
In-Reply-To: <1140294574.8388.8.camel@localhost>
References:  <20060218111849.15E6.GERARD@seibercom.net> <200602181435.07066.gerard@seibercom.net> <1140294574.8388.8.camel@localhost>

next in thread | previous in thread | raw e-mail | index | archive | help
--nextPart7635913.jlODiEy3dC
Content-Type: text/plain;
  charset="utf-8"
Content-Transfer-Encoding: quoted-printable
Content-Disposition: inline

enjamin A'Lee wrote:

> From:
> "Benjamin A'Lee" <lists@benalee.co.uk>
>   To:
> freebsd-questions@freebsd.org
>   Date:
> Today 03:29:34 pm
>   =C2=A0
>
> On Sat, 2006-02-18 at 14:34 -0500, Gerard Seibert wrote:
> > Maybe I am doing something wrong, but it does not appear to be
> > working correctly. I named the file nobom.sh and put it in the same
> > directory as the files I want to convert. I also set the program
> > permission to 0755.
> >
> > typing the p[program name does nothing; I have to precede it with
> > 'perl'. Even then, it does not appear to work correctly. In the
> > following example, the file is parsed, but not converted.
>
> Sorry; try changing the first line to #!/usr/local/bin/perl
>
> > perl nobom.sh testfile
> >
> > Am I doing something incorrectly here?
>
> Try:
>
> cat testfile | nobom.sh
>
> Though the way you describe appears to work here:
>
> $ cat bom-testfile | hd
> 00000000 =C2=A0ef bb bf 23 20 42 4f 4d =C2=A020 74 65 73 74 20 66 69 =C2=
=A0|...# BOM
> test fi| 00000010 =C2=A06c 65 0a =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0
> =C2=A0|le.| 00000013
> $ bomkill.pl bom-testfile | hd
> 00000000 =C2=A023 20 42 4f 4d 20 74 65 =C2=A073 74 20 66 69 6c 65 0a =C2=
=A0|# BOM
> test file.| 00000010
>
>
> =C2=A0 =C2=A0 Ben

Something appears to be wrong here. First, the file will not run unless=20
I precede it with 'perl'. I have another perl script in the same=20
directory that runs just fine without any special prefixes. Also, the=20
script does not seem to remove the BOM entity.

This is the script as I have it entered:

#!/usr/local/bin/perl
use warnings;
use diagnostics -verbose;
@file=3D<>;
$file[0] =3D~ s/^\xEF\xBB\xBF//;
print(@file);

I have the file permissions set to 0755. Is there anything else that=20
could be causing this to fail?

This is the first line of the file I am attempting to fix (well one of=20
them).

=C3=AF=C2=BB=C2=BFSubject:

That is what appears when I use pico to view the file.

=2D-=20
Gerard Seibert
gerard@seibercom.net

PGP: http://www.seibercom.net/sig/gerard.asc

--nextPart7635913.jlODiEy3dC
Content-Type: application/pgp-signature

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.2 (FreeBSD)

iD8DBQBD94iFchM2dIO+3uMRAhyaAKCQbknYMo5eckbi4U/8kvTS/dAg1QCfe96i
tLot4w2vyF53LChQhJVg/Yw=
=Wz3m
-----END PGP SIGNATURE-----

--nextPart7635913.jlODiEy3dC--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200602181550.13252.gerard>