#github #type #pdf #typed #text #nursery #pdf-rs

bin+lib typed-pdf

A nursery of additional types for PDF-RS (https://github.com/pdf-rs/pdf)

1 unstable release

0.1.0 May 11, 2021

#117 in #typed

MIT license

30KB
732 lines

Typed PDF

A nursery of additional types for PDF-RS (https://github.com/pdf-rs/pdf).

It currently contains an enum for page content operations (https://docs.rs/pdf/0.7.1/pdf/content/struct.Operation.html) and a binary that will attempt to parse a set of PDF files and print operations that it can't type.

At the moment, operations have the following issues:

  • Text is parsed into rust strings, if they cannot be parsed into rust strings they're considered unknown operations. This happens a lot more regularly that I had expected.

  • The following operations are not implemented, they're mostly property lists and binary data:

    • BDC
    • BI
    • BMC
    • BX
    • DP
    • EI
    • EMC
    • EX
    • ID

Dependencies

~10MB
~189K SLoC