#pdf #pdf2txt #pdf2text #text

yanked pdf-extract-temporary-migitation-panic

A library to extract content from pdfs

0.7.1 Feb 18, 2023

#196 in #pdf

MIT license

355KB
9K SLoC

pdf-extract

Build Status crates.io Documentation

A rust library to extract content from PDF files.

let bytes = std::fs::read("tests/docs/simple.pdf").unwrap();
let out = pdf_extract::extract_text_from_mem(&bytes).unwrap();
assert!(out.contains("This is a small demonstration"));

See also

Dependencies

~15MB
~163K SLoC