Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 15 Jun 2010 20:01:58 +0300
From:      Gleb Kurtsou <gleb.kurtsou@gmail.com>
To:        Gabor Kovesdan <gabor@FreeBSD.org>
Cc:        Edwin Groothuis <edwin@mavetju.org>, FreeBSD Current <freebsd-current@freebsd.org>, Xin LI <d@delphij.net>, i18n@FreeBSD.org
Subject:   Re: [CFT] BSDL iconv in base system
Message-ID:  <20100615170158.GA2131@tops>
In-Reply-To: <4C16C5B5.1070308@FreeBSD.org>
References:  <4C16C5B5.1070308@FreeBSD.org>

next in thread | previous in thread | raw e-mail | index | archive | help
On (15/06/2010 02:13), Gabor Kovesdan wrote:
> Hello Folks,
> 
> during the last summer, Google generously founded my Summer of Code 
> project, which was providing a BSD-licensed iconv implementation for 
> FreeBSD. I'm proud to announce that the work has been completed and a 
> patch is available to add it to the base system.
> 
> The results of this work are:
> - The Citrus implementation has been ported from NetBSD.
> - Some utilities have been added. There is a conversion table generator, 
> which can compare conversion tables to reference data generated by GNU 
> libiconv. This helps ensuring conversion compatibility.
> - UTF-16 surrogate support and some endianness issues have been fixed.
> - The rather chaotic Makefiles to build metadata have been refactored 
> and cleaned up, now it is easy to read and it is also easier to add 
> support for new encodings.
> - A bunch of new encodings and encoding aliases have been added.
> - Support for 1->2, 1->3 and 1->4 mappings, which is needed for 
> transliterating with flying accents as GNU does, like "u.
> - Lots of warnings have been fixed, the major part of the code is now 
> WARNS=6 clean.
> - New section 1 and section 5 manual pages have been added.
> - Some GNU-specific calls have been implemented: iconvlist(), 
> iconvctl(), iconv_canonicalize(), iconv_open_into()
> - Support for GNU's //IGNORE suffix has been added.
> - The "-" argument for stdin is now recognized in iconv(1) as per POSIX.
> - The Big5 conversion module has been fixed.
> - The iconv.h header files is supposed to be compatible with the GNU 
> version, i.e. sources should build with base iconv.h and GNU libiconv. 
> I've just did a very quick test and it seems ports can safely link to 
> GNU libiconv, there's no conflict.
> - Various cleanups and style(9) fixes.
> - A bachelor thesis written in Hungarian language: 
> http://www.kovesdan.org/files/bsc_iconv.pdf
> 
> The rather big patch (42,5M) is available here: 
> http://www.kovesdan.org/patches/iconv_base_integrate.diff
> 
> Any comments, suggestions or bugreports are very welcome.

Awesome! Thanks for working on it.

Are there any plans to resurrect/finish multibyte collation support
GSoC'2008 project:
http://wiki.freebsd.org/KonradJankowski/Collation

And are you aware of any plans on adding utf8-aware regex? I think
NetBSD has already imported one:
http://blog.netbsd.org/tnf/entry/efficient_wide_character_regular_expressions

Thanks,
Gleb.

> -- 
> Gabor Kovesdan
> FreeBSD Volunteer
> 
> EMAIL:gabor@FreeBSD.org  .:|:.gabor@kovesdan.org
> WEB:http://people.FreeBSD.org/~gabor  .:|:.http://kovesdan.org
> 
> _______________________________________________
> freebsd-current@freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-current
> To unsubscribe, send any mail to "freebsd-current-unsubscribe@freebsd.org"



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20100615170158.GA2131>