Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 5 Aug 2009 07:44:31 +0000 (UTC)
From:      Edwin Groothuis <edwin@FreeBSD.org>
To:        src-committers@freebsd.org, svn-src-user@freebsd.org
Subject:   svn commit: r196067 - user/edwin/locale/tools
Message-ID:  <200908050744.n757iVWv047701@svn.freebsd.org>

next in thread | raw e-mail | index | archive | help
Author: edwin
Date: Wed Aug  5 07:44:30 2009
New Revision: 196067
URL: http://svn.freebsd.org/changeset/base/196067

Log:
  show what a certain unicode name can be

Added:
  user/edwin/locale/tools/whatis.pl   (contents, props changed)

Added: user/edwin/locale/tools/whatis.pl
==============================================================================
--- /dev/null	00:00:00 1970	(empty, because file is newly added)
+++ user/edwin/locale/tools/whatis.pl	Wed Aug  5 07:44:30 2009	(r196067)
@@ -0,0 +1,29 @@
+#!/bin/sh
+
+UNIDATA=/home/edwin/unicode/UNIDATA/5.2.0/UnicodeData.txt
+CHARMAPS=/home/edwin/svn/edwin/locale/tools/charmaps
+UTF8=~/unicode/cldr/1.7.1/posix/UTF-8.cm
+
+if [ -z "$1" ]; then
+	echo "Usage: $0 <unicode string>"
+	exit
+fi
+
+UCS=$*
+UCS_=$(echo $* | sed -e 's/ /./g')
+echo UCS: ${UCS}
+
+echo UTF-8.cm:
+grep "${UCS_}" ${UTF8} | sed -e 's/   */	/g'
+
+echo UNIDATA:
+grep "${UCS_}" ${UNIDATA}
+L=$(grep "${UCS_}" ${UNIDATA})
+
+echo UCC:
+grep "${UCS_}" ${UNIDATA} | awk -F\; '{ print $1 }'
+
+
+echo CHARMAPS:
+grep ${UCS_} ${CHARMAPS}/* | sed -e "s|${CHARMAPS}/||g"
+grep ${UCC} ${CHARMAPS}/* | sed -e "s|${CHARMAPS}/||g"



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200908050744.n757iVWv047701>