From owner-freebsd-stable@FreeBSD.ORG Wed Mar 12 15:45:41 2014 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id 6462DA93 for ; Wed, 12 Mar 2014 15:45:41 +0000 (UTC) Received: from nm1-vm6.bullet.mail.ir2.yahoo.com (nm1-vm6.bullet.mail.ir2.yahoo.com [212.82.96.77]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.freebsd.org (Postfix) with ESMTPS id B4A81C66 for ; Wed, 12 Mar 2014 15:45:40 +0000 (UTC) Received: from [212.82.98.57] by nm1.bullet.mail.ir2.yahoo.com with NNFMP; 12 Mar 2014 15:45:32 -0000 Received: from [46.228.39.87] by tm10.bullet.mail.ir2.yahoo.com with NNFMP; 12 Mar 2014 15:45:32 -0000 Received: from [127.0.0.1] by smtp124.mail.ir2.yahoo.com with NNFMP; 12 Mar 2014 15:45:32 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1394639132; bh=2wIb923XPcsqVoUxLfnz9n2g//FezoXpWPHXSrqVwRY=; h=X-Yahoo-Newman-Id:X-Yahoo-Newman-Property:X-YMail-OSG:X-Yahoo-SMTP:X-Rocket-Received:Message-ID:Date:From:User-Agent:MIME-Version:To:CC:Subject:References:In-Reply-To:Content-Type:Content-Transfer-Encoding; b=ABS8iSTz9Td7Y6FwjwdFhuVMfyEBWRUNdMTRSPYNAy2hpcr3jTRIYJhstCrdJYquX2R7JxG9Jw9j2pOIrEKvoJ6CrQKwmwrg6AfvqVYSu3lE5btmPxXkjpvMwXzvM3VblSmpc+wppqboTBd7IDrMd7VbfEjIcFe9qgJHiSkS650= X-Yahoo-Newman-Id: 234190.73929.bm@smtp124.mail.ir2.yahoo.com X-Yahoo-Newman-Property: ymail-3 X-YMail-OSG: gdINAoIVM1mDQcL5dlcdHzQesD4ioKhCeYvTdgbHTd3tBjT JbBXFac2VyzXNhB4Qz6eiBmnRY27aqxofJEAJ4PX1nZEVesqAQxFtjRFEL62 FA7jBzhS_6hOqi9B.wlaS4MiQNVp4z9W3fup2WJ1_L42omgv_i9_udpaDOfp HTssIK25cJYoRRTmjgQOjVW7kjqaasOZNN0ZUzMYbnCWN_Df6_5w32TfU9YY 5VQ.Lc8ctPEnGVPc1u9cSKIhnaqiu6QYHFYDAlN.yYB4AXi5_sF3Ek8DLR31 oLNIa08hn7_wPsV67NMQAmFIzw9r3i87nV_NrL5v_qKzmx6cZI2plcj3_R68 98ges8CC20YqLGKqkt.2zgpt_ddtd.jV93BS8H1NcxOhnN5ev7Au7o8R9JNB bKg8Z5lE2fVZxt6LLuj08qV.JDGkHGfRP.WXfjKlIwfQtpg8Fgq_WPJRNHW8 hUUW4ns.OT2AUIG13RCjZ9CULf0wHvPGTF2Ngs7dg9FPgfHVFghA8FTW3vZa SU.PSruL4.g-- X-Yahoo-SMTP: .O5qiqOswBCAHusliVRDDr_SG.Tb X-Rocket-Received: from [192.168.1.67] (rmg70swe@213.64.218.92 with plain [188.125.69.59]) by smtp124.mail.ir2.yahoo.com with SMTP; 12 Mar 2014 15:45:32 +0000 UTC Message-ID: <53208119.6060009@yahoo.com> Date: Wed, 12 Mar 2014 16:45:29 +0100 From: Rolf Nielsen User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:24.0) Gecko/20100101 Thunderbird/24.3.0 MIME-Version: 1.0 To: stable@freebsd.org Subject: Re: UTF-8 Sorting References: <5320297F.1080400@ze.tum.de> <53207451.3010305@yahoo.com> <53207613.2090801@ze.tum.de> In-Reply-To: <53207613.2090801@ze.tum.de> Content-Type: text/plain; charset=ISO-8859-15; format=flowed Content-Transfer-Encoding: 8bit Cc: Gerhard Schmidt X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.17 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 12 Mar 2014 15:45:41 -0000 Gerhard Schmidt skrev 2014-03-12 15:58: > -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA1 > > On 12.03.2014 15:50, Rolf Nielsen wrote: >> >> >> Gerhard Schmidt skrev 2014-03-12 10:31: >>> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 >>> >>> Hi, >>> >>> I've a problem with FreeBSD, UTF-8 and Sorting. >>> >>> e.g. there is a file with the following content >>> >>> Meier Müller Öger Ofner Schmidt >>> >>> I have set my Terminal to ISO-8859-1 Encoding and call sort on >>> this file I get the following output. >>> >>> Meier Müller Ofner Öger Schmidt >>> >>> Which is correctly sorted. >>> >>> When i change my Terminal to UTF-8 encoding and convert the file >>> to UTF-8 and call sort again I get the following output. >>> >>> Meier Müller Ofner Schmidt Öger >>> >>> which is wrong. >>> >>> The problem seams to be that the LC_COLLATE file in the >>> de_DE.UTF-8 locale is linked to ../la_LN.US-ASCII/LC_COLLATE (as >>> are all LC_COLLATE Files in any UTF-8 locale). >>> >>> After some Research i found a Mail from Kuba Lida in December >>> 2008 (yeah that's 5 Years ago) stating the same Problem and got >>> no response. >>> >>> Why isn't there a UTF-8 LC_COLLATE file for any language. Kuba >>> Lida believed there was a Problem with multibyte collate files in >>> FreeBSD. Is this true and are there plans to fix this problem. >>> >>> The same test under Linux works without problem. >>> >>> Regards Estartu >>> >>> - -- - >> >> Hi, >> >> Hmm, to me the result that you claim is wrong looks perfectly >> correct, however, it may of course differ between languages. In >> Swedish Ö is a separate letter, located last in the alphabet (from >> A to Z we have the exact same alphabet as English, and then come Å, >> Ä and Ö, in that order). > > Yeah, Sweedisch sorts these characters after Z but in German Ö equals > Oe in Names and O in all other cases. There have to be collation > tables for different languages as there are different one for dieffent > languages in ISO encoding. I know that the direfrence in Name and Not > name will not be implementable but the default whould be much of an > improvement. > > The same difference is between German German (de_DE) and Austrian > German (de_AT). > > Regards > Estartu I see. Well, different countries, different customs. :) (I should have included the list in my previous reply, but I hit the wrong button. I apologise for that). Regards, Rolf