Posted By: Zhang Chiyuan
Date: 2008-02-01 16:21
Summary: rmmseg 0.0.1 Released
Project: RMMSeg: MMSeg implementation in Ruby
rmmseg version 0.0.1 has been released!
RMMSeg is an implementation of MMSEG Chinese word segmentation
algorithm. It is based on two variants of maximum matching
algorithms. Two algorithms are available for using:
* simple algorithm that uses only forward maximum matching.
* complex algorithm that uses three-word chunk maximum matching and 3
aditonal rules to solve ambiguities.
For more information about the algorithm, please refer to the
following essays:
* http://technology.chtsai.org/mmseg/
* http://pluskid.lifegoo.com/?p=261
Changes:
### 0.0.1 / 2008-01-31
* Analyser integration with Ferret.
* rdoc added
* Lazily init the +Word+ objects inside the +Dictionary+.
* Handle English punctuation correctly. |
|