#pdf #pdf2txt #pdf2text #text

pdf-extract-temporary-mitigation-panic

A library to extract content from pdfs

1 unstable release

0.7.1 Feb 18, 2023

#119 in #pdf

34 downloads per month

MIT license

355KB
9K SLoC

pdf-extract

Build Status crates.io Documentation

A rust library to extract content from PDF files.

let bytes = std::fs::read("tests/docs/simple.pdf").unwrap();
let out = pdf_extract::extract_text_from_mem(&bytes).unwrap();
assert!(out.contains("This is a small demonstration"));

See also

Dependencies

~15MB
~170K SLoC