1 unstable release

0.1.0	Jun 3, 2024

#778 in Images

Apache-2.0

52KB
1K SLoC

layoutparser-ort

A simplified port of LayoutParser for detecting layout elements on documents. Runs Detectron2 and YOLOX layout models from unstructured-inference in ONNX format through onnxruntime (bindings via ort). Check out the examples for a quick start!

License

layoutparser-ort mirrors its API from LayoutParser and includes preprocessing code derived from unstructured-inference, both licensed under the Apache License 2.0. Likewise, layoutparser-ort is licensed under the Apache License 2.0.

Appendix: Similar libraries

surya: OCR, layout analysis, reading order, line detection in 90+ languages
- SegFormer (transformers: SegFormer), Donut (transformers: Donut), CRAFT (pytorch)
- License: GPLv3.0 (code), cc-by-nc-sa-4.0 (models)
  - cc-by-nc-sa-4.0: noncommerical but author "waive[s] that for any organization under $5M USD in gross revenue in the most recent 12-month period."
unstructured-inference: hosted model inference code for layout parsing models
- Models: Detectron2 (LayoutParser-PubLayNet-PyTorch, LayoutParser-PubLayNet-ONNX), YOLOX (probably trained on DocLayNet, Quantized, ONNX), Table-Transformer (transformers: Table Transformer), Donut (transformers: Donut)
- License: Apache 2.0
LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis
- Models: Detectron2
- License: Apache 2.0
- Documentation: https://layout-parser.readthedocs.io/en/latest/api_doc/elements.html

Dependencies

~5–21MB
~247K SLoC