28 stable releases (3 major)

4.0.5 Oct 29, 2024
4.0.4 Oct 27, 2024
3.0.10 Oct 18, 2024
2.0.0 Oct 8, 2024
1.0.0 Sep 10, 2024

#412 in Text processing

Download history 118/week @ 2024-09-05 46/week @ 2024-09-12 14/week @ 2024-09-19 14/week @ 2024-09-26 964/week @ 2024-10-03 873/week @ 2024-10-10 727/week @ 2024-10-17 314/week @ 2024-10-24 50/week @ 2024-10-31 2/week @ 2024-11-14 48/week @ 2024-11-28 212/week @ 2024-12-05

262 downloads per month

Apache-2.0

35KB
677 lines

UAST: Unicode Aware Saṃskṛta Transliteration

[!IMPORTANT] For implementation in more than 10 scripts, visit https://uast.dev for web-version or https://github.com/aneri0x4f/uast-cli for CLI version. This Rust implementation does the following conversions:

  • [d]: UAST-IO/IAST to देवनागरी
  • [i]: देवनागरी to IAST
  • [g]: देवनागरी to ગુજરાતી

UAST - A tool for transliterating and typing Saṃskṛta in the easiest computer, language, and human-friendly way.

Direct installation

To directly install with Cargo,

cargo install -v uast

This is the preferred method as it handles everything for you.

Manual installation

To install this program, you will need to install Rust.

After that, simply run:

cargo install -v --git https://github.com/dhruvildave/uast.rs

To build locally,

make

To remove,

make clean

If you use this repository, please cite the following paper:

@misc{uast_2022,
  doi = {10.48550/ARXIV.2203.14277},
  url = {https://arxiv.org/abs/2203.14277},
  author = {Dalwadi, Aneri and Dave, Dhruvil},
  keywords = {Human-Computer Interaction (cs.HC), FOS: Computer and information sciences, H.5.2},
  title = {UAST: Unicode Aware Sanskrit Transliteration},
  publisher = {arXiv},
  year = {2022},
  copyright = {Creative Commons Attribution 4.0 International}
}

poster

Devanāgarī is the writing system that is adapted by various languages like Sanskrit (IAST: saṃskṛta). International Alphabet of Sanskrit Transliteration (IAST) is a transliteration scheme for romanisation of Sanskrit language. IAST makes use of diacritics to represent various characters. On a computer, these are represented using Unicode standard which differs from how the Sanskrit language behaves at a very fundamental level. This results in an issue that is encountered while designing typesetting software for devanāgarī and IAST. We hereby discuss the problems and provide a solution that solves the issue of incompatibilities between various transliteration and encoding schemes.

Web version URL: https://uast.dev

Click on the encoding name for available options.

No runtime deps