1 unstable release

Uses old Rust 2015

0.1.0 Sep 25, 2016

#340 in Biology

AGPL-3.0

265KB
646 lines

acetylene-parser

A string parser for different chemical nomenclature.

functions

  • tokenize(string, type="formula") -> Substance

    Tokenizes a string describing a chemical, yielding a Substance with (optional) functional groups corresponding to (more) fundamental components.

    • "formula" type expects a simple "secondary school" element-symbol naming string.

    • "smiles" expects a chemical name utilizing the SMILES system.

    • TODO: "iupac" expects a chemical name utilizing the IUPAC system.

roadmap

  • implement SMILES parsing
  • improve struct based on SMILES findings
  • decide whether InChI implementation is worth it

references

SMILES

IUPAC

InChI

data

Dependencies

~3.5MB
~76K SLoC