#pdf #editing

lopdf

A Rust library for PDF document manipulation

33 releases (breaking)

✓ Uses Rust 2018 edition

0.23.0 Jul 14, 2019
0.21.0 Apr 26, 2019
0.20.0 Mar 8, 2019
0.19.0 Oct 24, 2018
0.1.1 Dec 27, 2016

#43 in Text processing

Download history 39/week @ 2019-05-02 136/week @ 2019-05-09 132/week @ 2019-05-16 99/week @ 2019-05-23 100/week @ 2019-05-30 127/week @ 2019-06-06 56/week @ 2019-06-13 169/week @ 2019-06-20 374/week @ 2019-06-27 198/week @ 2019-07-04 98/week @ 2019-07-11 85/week @ 2019-07-18 54/week @ 2019-07-25 101/week @ 2019-08-01 188/week @ 2019-08-08

559 downloads per month
Used in 15 crates (7 directly)

MIT license

305KB
9K SLoC

lopdf

Crates.io Build Status Docs

A Rust library for PDF document manipulation.

Example Code

  • Create PDF document
#[macro_use]
extern crate lopdf;
use lopdf::{Document, Object, Stream};
use lopdf::content::{Content, Operation};

let mut doc = Document::with_version("1.5");
let pages_id = doc.new_object_id();
let font_id = doc.add_object(dictionary! {
	"Type" => "Font",
	"Subtype" => "Type1",
	"BaseFont" => "Courier",
});
let resources_id = doc.add_object(dictionary! {
	"Font" => dictionary! {
		"F1" => font_id,
	},
});
let content = Content {
	operations: vec![
		Operation::new("BT", vec![]),
		Operation::new("Tf", vec!["F1".into(), 48.into()]),
		Operation::new("Td", vec![100.into(), 600.into()]),
		Operation::new("Tj", vec![Object::string_literal("Hello World!")]),
		Operation::new("ET", vec![]),
	],
};
let content_id = doc.add_object(Stream::new(dictionary! {}, content.encode().unwrap()));
let page_id = doc.add_object(dictionary! {
	"Type" => "Page",
	"Parent" => pages_id,
	"Contents" => content_id,
});
let pages = dictionary! {
	"Type" => "Pages",
	"Kids" => vec![page_id.into()],
	"Count" => 1,
	"Resources" => resources_id,
	"MediaBox" => vec![0.into(), 0.into(), 595.into(), 842.into()],
};
doc.objects.insert(pages_id, Object::Dictionary(pages));
let catalog_id = doc.add_object(dictionary! {
	"Type" => "Catalog",
	"Pages" => pages_id,
});
doc.trailer.set("Root", catalog_id);
doc.compress();
doc.save("example.pdf").unwrap();
  • Modify PDF document
let mut doc = Document::load("example.pdf")?;
doc.version = "1.4".to_string();
doc.replace_text(1, "Hello World!", "Modified text!");
doc.save("modified.pdf")?;

FAQ

  • Why keeping everything in memory as high-level objects until finally serializing the entire document?

    Normally a PDF document won't be very large, ranging form tens of KB to hundreds of MB. Memory size is not a bottle neck for today's computer. By keep the whole document in memory, stream length can be pre-calculated, no need to use a reference object for the Length entry, the resulting PDF file is smaller for distribution and faster for PDF consumers to process.

    Producing is a one-time effort, while consuming is many more.

Dependencies

~5MB
~80K SLoC