76 releases (40 breaking)

new 0.42.2 Apr 29, 2025
0.41.0 Apr 13, 2025
0.40.1 Mar 27, 2025
0.38.1 Nov 30, 2024
0.3.2 Feb 20, 2020

#2113 in Text processing

Download history 4624/week @ 2025-01-07 3487/week @ 2025-01-14 2635/week @ 2025-01-21 3806/week @ 2025-01-28 3470/week @ 2025-02-04 3552/week @ 2025-02-11 4700/week @ 2025-02-18 5401/week @ 2025-02-25 5784/week @ 2025-03-04 4764/week @ 2025-03-11 4838/week @ 2025-03-18 6380/week @ 2025-03-25 8174/week @ 2025-04-01 6428/week @ 2025-04-08 6195/week @ 2025-04-15 7767/week @ 2025-04-22

29,826 downloads per month
Used in 20 crates (via lindera)

MIT license

140KB
3K SLoC

Lindera IPADIC

License: MIT Crates.io

Dictionary version

This repository contains mecab-ipadic.

Dictionary format

Refer to the manual for details on the IPADIC dictionary format and part-of-speech tags.

Index Name (Japanese) Name (English) Notes
0 表層形 Surface
1 左文脈ID Left context ID
2 右文脈ID Right context ID
3 コスト Cost
4 品詞 Major POS classification
5 品詞細分類1 Middle POS classification
6 品詞細分類2 Small POS classification
7 品詞細分類3 Fine POS classification
8 活用形 Conjugation type
9 活用型 Conjugation form
10 原形 Base form
11 読み Reading
12 発音 Pronunciation

User dictionary format (CSV)

Simple version

Index Name (Japanese) Name (English) Notes
0 表層形 surface
1 品詞 Major POS classification
2 読み Reading

Detailed version

Index Name (Japanese) Name (English) Notes
0 表層形 Surface
1 左文脈ID Left context ID
2 右文脈ID Right context ID
3 コスト Cost
4 品詞 POS
5 品詞細分類1 POS subcategory 1
6 品詞細分類2 POS subcategory 2
7 品詞細分類3 POS subcategory 3
8 活用形 Conjugation type
9 活用型 Conjugation form
10 原形 Base form
11 読み Reading
12 発音 Pronunciation
13 - - After 13, it can be freely expanded.

API reference

The API reference is available. Please see following URL:

Dependencies

~12–23MB
~379K SLoC