11 unstable releases (4 breaking)
0.27.1 | Aug 25, 2023 |
---|---|
0.27.0 | Jul 10, 2023 |
0.26.0 | Jul 7, 2023 |
0.25.0 | May 21, 2023 |
0.1.2 | Feb 20, 2020 |
#1084 in Text processing
12,158 downloads per month
Used in 14 crates
(3 directly)
71KB
1.5K
SLoC
Lindera IPADIC NEologd Builder
IPADIC NEologd dictionary builder for Lindera. This project fork from kuromoji-rs.
Dictionary version
This repository contains mecab-ipadic-neologd.
Dictionary format
Refer to the manual for details on the IPADIC dictionary format and part-of-speech tags.
Index | Name (Japanese) | Name (English) | Notes |
---|---|---|---|
0 | 表層形 | Surface | |
1 | 左文脈ID | Left context ID | |
2 | 右文脈ID | Right context ID | |
3 | コスト | Cost | |
4 | 品詞 | Major POS classification | |
5 | 品詞細分類1 | Middle POS classification | |
6 | 品詞細分類2 | Small POS classification | |
7 | 品詞細分類3 | Fine POS classification | |
8 | 活用形 | Conjugation type | |
9 | 活用型 | Conjugation form | |
10 | 原形 | Base form | |
11 | 読み | Reading | |
12 | 発音 | Pronunciation |
User dictionary format (CSV)
Simple version
Index | Name (Japanese) | Name (English) | Notes |
---|---|---|---|
0 | 表層形 | surface | |
1 | 品詞 | Major POS classification | |
2 | 読み | Reading |
Detailed version
Index | Name (Japanese) | Name (English) | Notes |
---|---|---|---|
0 | 表層形 | Surface | |
1 | 左文脈ID | Left context ID | |
2 | 右文脈ID | Right context ID | |
3 | コスト | Cost | |
4 | 品詞 | POS | |
5 | 品詞細分類1 | POS subcategory 1 | |
6 | 品詞細分類2 | POS subcategory 2 | |
7 | 品詞細分類3 | POS subcategory 3 | |
8 | 活用形 | Conjugation type | |
9 | 活用型 | Conjugation form | |
10 | 原形 | Base form | |
11 | 読み | Reading | |
12 | 発音 | Pronunciation | |
13 | - | - | After 13, it can be freely expanded. |
How to use IPADIC dictionary
For more details about lindera
command, please refer to the following URL:
API reference
The API reference is available. Please see following URL:
Dependencies
~10MB
~238K SLoC