head	1.1;
access;
symbols
	RELEASE_8_3_0:1.1
	RELEASE_9_0_0:1.1
	RELEASE_7_4_0:1.1
	RELEASE_8_2_0:1.1
	RELEASE_6_EOL:1.1
	RELEASE_8_1_0:1.1
	RELEASE_7_3_0:1.1
	RELEASE_8_0_0:1.1
	RELEASE_7_2_0:1.1
	RELEASE_7_1_0:1.1
	RELEASE_6_4_0:1.1
	RELEASE_5_EOL:1.1
	RELEASE_7_0_0:1.1
	RELEASE_6_3_0:1.1;
locks; strict;
comment	@# @;


1.1
date	2007.07.02.02.08.49;	author clsung;	state Exp;
branches;
next	;


desc
@@


1.1
log
@Add p5-Lingua-ZH-WordSegmenter 0.01, simplified Chinese Word
Segmentation.

PR:		ports/113476
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
@
text
@This is a perl version of simplified Chinese word segmentation.

The algorithm for this segmenter is to search the longest word at each
point from both left and right directions, and choose the one with
higher frequency product.

The original program is from the CPAN module Lingua::ZH::WordSegment
(http://search.cpan.org/~chenyr/) I did the follwing changes: 1) make
the interface object oriented; 2) make the internal string into utf8;
3) using sogou's dictionary (http://www.sogou.com/labs/dl/w.html) as
the default dictionary.

WWW:	http://search.cpan.org/dist/Lingua-ZH-WordSegmenter/
@
