33 stable releases (5 major)

new 6.0.0 Feb 20, 2025
5.0.0 Jan 21, 2025
4.2.0 Jan 21, 2025
4.0.5 Oct 29, 2024
1.0.0 Sep 10, 2024

#382 in Text processing

Download history 299/week @ 2024-10-25 34/week @ 2024-11-01 2/week @ 2024-11-15 103/week @ 2024-11-29 360/week @ 2024-12-06 73/week @ 2024-12-13 15/week @ 2024-12-20 16/week @ 2024-12-27 146/week @ 2025-01-03 195/week @ 2025-01-10 190/week @ 2025-01-17 21/week @ 2025-01-24 4/week @ 2025-01-31 3/week @ 2025-02-07

220 downloads per month

Apache-2.0

41KB
847 lines

UAST: Unicode Aware Saṃskṛta Transliteration

[!IMPORTANT] For implementation in more than 10 scripts, visit https://uast.dev for web-version or https://github.com/aneri0x4f/uast-cli for CLI version. This Rust implementation does the following conversions:

  • [d]: UAST-IO/IAST to देवनागरी
  • [i]: देवनागरी to IAST
  • [g]: देवनागरी to ગુજરાતી
  • [s]: SLP1 to IAST

UAST - A tool for transliterating and typing Saṃskṛta in the easiest computer, language, and human-friendly way.

Direct installation

To directly install with Cargo,

cargo install -v uast

This is the preferred method as it handles everything for you.

Manual installation

To install this program, you will need to install Rust.

After that, simply run:

cargo install -v --git https://github.com/dhruvildave/uast.rs

To build locally,

make

To remove,

make clean

If you use this repository, please cite the following paper:

@misc{uast_2022,
  doi = {10.48550/ARXIV.2203.14277},
  url = {https://arxiv.org/abs/2203.14277},
  author = {Dalwadi, Aneri and Dave, Dhruvil},
  keywords = {Human-Computer Interaction (cs.HC), FOS: Computer and information sciences, H.5.2},
  title = {UAST: Unicode Aware Saṃskṛta Transliteration},
  publisher = {arXiv},
  year = {2022},
  copyright = {Creative Commons Attribution 4.0 International}
}

poster

Devanāgarī is the writing system that is adapted by various languages like Sanskrit (IAST: saṃskṛta). International Alphabet of Sanskrit Transliteration (IAST) is a transliteration scheme for romanisation of Sanskrit language. IAST makes use of diacritics to represent various characters. On a computer, these are represented using Unicode standard which differs from how the Sanskrit language behaves at a very fundamental level. This results in an issue that is encountered while designing typesetting software for devanāgarī and IAST. We hereby discuss the problems and provide a solution that solves the issue of incompatibilities between various transliteration and encoding schemes.

Web version URL: https://uast.dev

Click on the encoding name for available options.

No runtime deps