Skip site navigation (1)Skip section navigation (2)
Date:      Fri,  8 Jun 2007 17:43:17 +0800 (CST)
From:      Gea-Suan Lin <gslin@gslin.org>
To:        FreeBSD-gnats-submit@FreeBSD.org
Cc:        gslin@gslin.org
Subject:   ports/113476: [NEW PORT] chinese/p5-Lingua-ZH-WordSegmenter: Simplified Chinese Word Segmentation
Message-ID:  <20070608094317.7CB975C1F@ccreader.NCTU.edu.tw>
Resent-Message-ID: <200706080950.l589o2XT073320@freefall.freebsd.org>

next in thread | raw e-mail | index | archive | help

>Number:         113476
>Category:       ports
>Synopsis:       [NEW PORT] chinese/p5-Lingua-ZH-WordSegmenter: Simplified Chinese Word Segmentation
>Confidential:   no
>Severity:       non-critical
>Priority:       low
>Responsible:    freebsd-ports-bugs
>State:          open
>Quarter:        
>Keywords:       
>Date-Required:
>Class:          change-request
>Submitter-Id:   current-users
>Arrival-Date:   Fri Jun 08 09:50:01 GMT 2007
>Closed-Date:
>Last-Modified:
>Originator:     Gea-Suan Lin
>Release:        FreeBSD 6.2-STABLE i386
>Organization:
>Environment:
System: FreeBSD ccreader.NCTU.edu.tw 6.2-STABLE FreeBSD 6.2-STABLE #1: Tue Jun  5 03:26:27 CST
>Description:
This is a perl version of simplified Chinese word segmentation.

The algorithm for this segmenter is to search the longest word at each
point from both left and right directions, and choose the one with
higher frequency product.

The original program is from the CPAN module Lingua::ZH::WordSegment
(http://search.cpan.org/~chenyr/) I did the follwing changes: 1) make
the interface object oriented; 2) make the internal string into utf8;
3) using sogou's dictionary (http://www.sogou.com/labs/dl/w.html) as
the default dictionary.

WWW:	http://search.cpan.org/dist/Lingua-ZH-WordSegmenter/

Generated with FreeBSD Port Tools 0.77
>How-To-Repeat:
>Fix:

--- p5-Lingua-ZH-WordSegmenter-0.01.shar begins here ---
# This is a shell archive.  Save it in a file, remove anything before
# this line, and then unpack it by entering "sh file".  Note, it may
# create directories; files and directories will be owned by you and
# have default permissions.
#
# This archive contains:
#
#	p5-Lingua-ZH-WordSegmenter
#	p5-Lingua-ZH-WordSegmenter/pkg-descr
#	p5-Lingua-ZH-WordSegmenter/Makefile
#	p5-Lingua-ZH-WordSegmenter/pkg-plist
#	p5-Lingua-ZH-WordSegmenter/distinfo
#
echo c - p5-Lingua-ZH-WordSegmenter
mkdir -p p5-Lingua-ZH-WordSegmenter > /dev/null 2>&1
echo x - p5-Lingua-ZH-WordSegmenter/pkg-descr
sed 's/^X//' >p5-Lingua-ZH-WordSegmenter/pkg-descr << 'END-of-p5-Lingua-ZH-WordSegmenter/pkg-descr'
XThis is a perl version of simplified Chinese word segmentation.
X
XThe algorithm for this segmenter is to search the longest word at each
Xpoint from both left and right directions, and choose the one with
Xhigher frequency product.
X
XThe original program is from the CPAN module Lingua::ZH::WordSegment
X(http://search.cpan.org/~chenyr/) I did the follwing changes: 1) make
Xthe interface object oriented; 2) make the internal string into utf8;
X3) using sogou's dictionary (http://www.sogou.com/labs/dl/w.html) as
Xthe default dictionary.
X
XWWW:	http://search.cpan.org/dist/Lingua-ZH-WordSegmenter/
END-of-p5-Lingua-ZH-WordSegmenter/pkg-descr
echo x - p5-Lingua-ZH-WordSegmenter/Makefile
sed 's/^X//' >p5-Lingua-ZH-WordSegmenter/Makefile << 'END-of-p5-Lingua-ZH-WordSegmenter/Makefile'
X# New ports collection makefile for:	p5-Lingua-ZH-WordSegmenter
X# Date created:		2007-06-08
X# Whom:			Gea-Suan Lin <gslin@gslin.org>
X#
X# $FreeBSD$
X#
X
XPORTNAME=	Lingua-ZH-WordSegmenter
XPORTVERSION=	0.01
XCATEGORIES=	chinese perl5
XMASTER_SITES=	CPAN
XMASTER_SITE_SUBDIR=	Lingua
XPKGNAMEPREFIX=	p5-
X
XMAINTAINER=	gslin@gslin.org
XCOMMENT=	Simplified Chinese Word Segmentation
X
XPERL_CONFIGURE=	yes
X
XMAN3=		Lingua::ZH::WordSegmenter.3
X
X.include <bsd.port.mk>
END-of-p5-Lingua-ZH-WordSegmenter/Makefile
echo x - p5-Lingua-ZH-WordSegmenter/pkg-plist
sed 's/^X//' >p5-Lingua-ZH-WordSegmenter/pkg-plist << 'END-of-p5-Lingua-ZH-WordSegmenter/pkg-plist'
X@comment $FreeBSD$
X%%SITE_PERL%%/%%PERL_ARCH%%/auto/Lingua/ZH/WordSegmenter/.packlist
X%%SITE_PERL%%/Lingua/ZH/WordSegmenter.pm
X@dirrmtry %%SITE_PERL%%/Lingua/ZH
X@dirrmtry %%SITE_PERL%%/Lingua
X@dirrmtry %%SITE_PERL%%/%%PERL_ARCH%%/auto/Lingua/ZH/WordSegmenter
X@dirrmtry %%SITE_PERL%%/%%PERL_ARCH%%/auto/Lingua/ZH
X@dirrmtry %%SITE_PERL%%/%%PERL_ARCH%%/auto/Lingua
END-of-p5-Lingua-ZH-WordSegmenter/pkg-plist
echo x - p5-Lingua-ZH-WordSegmenter/distinfo
sed 's/^X//' >p5-Lingua-ZH-WordSegmenter/distinfo << 'END-of-p5-Lingua-ZH-WordSegmenter/distinfo'
XMD5 (Lingua-ZH-WordSegmenter-0.01.tar.gz) = 033dca8be176cd507c0b7f193ad372f1
XSHA256 (Lingua-ZH-WordSegmenter-0.01.tar.gz) = 8be1f370f3c65b933e0e0b8ca1d2d6267a5fd121d25903bdd388ed8be9d9a932
XSIZE (Lingua-ZH-WordSegmenter-0.01.tar.gz) = 1227001
END-of-p5-Lingua-ZH-WordSegmenter/distinfo
exit
--- p5-Lingua-ZH-WordSegmenter-0.01.shar ends here ---

>Release-Note:
>Audit-Trail:
>Unformatted:



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20070608094317.7CB975C1F>