36 releases

0.7.10 Sep 26, 2024
0.7.7 May 10, 2024
0.7.4 Jan 17, 2024
0.7.2 Sep 8, 2023
0.3.0 Feb 5, 2018

#110 in Text processing

Download history 2324/week @ 2024-08-21 2377/week @ 2024-08-28 1683/week @ 2024-09-04 1792/week @ 2024-09-11 2051/week @ 2024-09-18 3016/week @ 2024-09-25 2307/week @ 2024-10-02 2809/week @ 2024-10-09 2621/week @ 2024-10-16 2605/week @ 2024-10-23 2963/week @ 2024-10-30 2581/week @ 2024-11-06 2381/week @ 2024-11-13 2217/week @ 2024-11-20 2565/week @ 2024-11-27 1875/week @ 2024-12-04

9,696 downloads per month
Used in 33 crates (22 directly)

MIT license

355KB
9K SLoC

pdf-extract

Build Status crates.io Documentation

A rust library to extract content from PDF files.

let bytes = std::fs::read("tests/docs/simple.pdf").unwrap();
let out = pdf_extract::extract_text_from_mem(&bytes).unwrap();
assert!(out.contains("This is a small demonstration"));

See also

Not PDF specific

Dependencies

~15MB
~229K SLoC