Release Name: 0.1.0
Notes:
WP2TXT is a converter program that extracts text data (sentences, to be exact) from a database dump XML file of the Japanese Wikipedia. It is primarily for corpus linguists who study the Japanese language. WP2TXT makes it possible to build a large corpus of the written Japanese that is easy to use on any text-processing software.
Changes:
0.1.0
-----
* Initial release
|