5 unstable releases

0.4.0 Jul 12, 2021
0.3.0 Feb 4, 2021
0.2.2 May 12, 2020
0.2.1 May 12, 2020
0.2.0 May 12, 2020
Download history 48/week @ 2021-07-04 137/week @ 2021-07-11 66/week @ 2021-07-18 18/week @ 2021-07-25 21/week @ 2021-08-01 19/week @ 2021-08-08 29/week @ 2021-08-15 16/week @ 2021-08-22 9/week @ 2021-08-29 5/week @ 2021-09-05 34/week @ 2021-09-12 9/week @ 2021-09-19 10/week @ 2021-09-26 8/week @ 2021-10-03 10/week @ 2021-10-10 13/week @ 2021-10-17

184 downloads per month
Used in 6 crates (2 directly)

MIT/Apache

47KB
1K SLoC

Oh No! More Lemmas

ohnomore consists of two tools to incorporate TüBa-D/Z style lemmas into language processing pipelines. The first tool, ohnomore-preproc takes TüBa-D/Z lemmas and transforms them into lemmas that are more fit for machine learning pipelines. For example:

  • Alternative lemmatizations are removed.
  • Separable prefix markers are removed.
  • Separable prefixes are removed when they are separated.
  • The special reflexive lemma #refl is replaced by the lowercased form.
  • Lemmas of truncations are replaced by their forms.

The second tool, ohnomore performs the opposite transformation (as much as is feasible).

Dependencies

~4.5MB
~87K SLoC

Š`