[ 源代码: unidic-mecab ]
软件包:unidic-mecab(202302-1)
试制(Experimental)软件包
警告:这个软件包来自于 experimental 发行版。这表示它很有可能表现出不稳定或者出现 bug ,甚至是导致资料损失。请务必在使用之前查阅 changelog 以及其他潜在的文档。
Dictionary for Mecab (Corpus of Contemporary Written Japanese)
unidic-mecab is a dictionary for Mecab (Japanese morphological analysis implementation), based on corpus of Contemporary Written Japanese (upstream publish it as unidic-cwj).
* All entries are based on the definition of "SUW (short-unit word)" that is specified by NINJAL (The National Institute for Japanese Language and Linguistics), which provides word segmentation in uniform size suited for linguistic research. * It has three-layered structure with - lemma - form - spelling And it can provide a clear distinction of two types of word variant: spelling variant and form variant. * It is useful for research of Speech processing since it can be added accent and shift in sound information.
This package is huge. You need more than 10GB of free space to download and install.
其他与 unidic-mecab 有关的软件包
|
|
|
|
-
- rec: mecab (>= 0.96)
- Japanese morphological analysis system
-
- rec: mecab-utils (>= 0.96)
- Support programs of Mecab