36 releases

0.7.10 Sep 26, 2024
0.7.7 May 10, 2024
0.7.4 Jan 17, 2024
0.7.2 Sep 8, 2023
0.3.0 Feb 5, 2018

#103 in Text processing

Download history 1698/week @ 2024-07-27 1603/week @ 2024-08-03 1520/week @ 2024-08-10 1654/week @ 2024-08-17 2836/week @ 2024-08-24 2034/week @ 2024-08-31 1360/week @ 2024-09-07 2032/week @ 2024-09-14 2678/week @ 2024-09-21 2462/week @ 2024-09-28 2906/week @ 2024-10-05 2512/week @ 2024-10-12 2478/week @ 2024-10-19 2969/week @ 2024-10-26 2607/week @ 2024-11-02 2717/week @ 2024-11-09

11,163 downloads per month
Used in 31 crates (21 directly)

MIT license

355KB
9K SLoC

pdf-extract

Build Status crates.io Documentation

A rust library to extract content from PDF files.

let bytes = std::fs::read("tests/docs/simple.pdf").unwrap();
let out = pdf_extract::extract_text_from_mem(&bytes).unwrap();
assert!(out.contains("This is a small demonstration"));

See also

Not PDF specific

Dependencies

~15MB
~228K SLoC