11 stable releases

1.0.12	Jan 1, 2025
1.0.9	Dec 30, 2024
1.0.6	Dec 27, 2024
1.0.4	Dec 11, 2024
1.0.3	Nov 30, 2024

#167 in Text processing

491 downloads per month

MIT license

89KB
1.5K SLoC

Rust Research Paper Parser (rsrpp)

RuSt Research Paper Parser (rsrpp)

The rsrpp library provides a set of tools for parsing research papers.

Quick Start

Pre-requirements

Poppler: sudo apt install poppler-utils
OpenCV: sudo apt install libopencv-dev clang libclang-dev

Installation

To start using the rsrpp library, add it to your project's dependencies in the Cargo.toml file:

cargo install rsrpp-cli
rsrpp --help
A Rust project for research paper pdf.

Usage: rsrpp [OPTIONS] --pdf <PDF>

Options:
  -p, --pdf <PDF>  
  -o, --out <OUT>  
  -h, --help       Print help
  -V, --version    Print version

Releases

1.0.12

Fixed a bug: remove unused println!.

1.0.11

Fixed a bug in xml loop to finish when the file reaches to end.

1.0.10

Added verbose mode.
Fixed a bug in the process extracting page number.

1.0.9

Updated: implemented new errors to handle invalid URLs.

1.0.8

Update: The max retry time for saving PDF files has been increased.

1.0.7

Fix bugs: After converting to PDF, the program now waits until processing is complete.

1.0.4

Fixed bugs in get_pdf_info.
Made minor improvements.

1.0.3

Added cli -> rsrpp-cli.

1.0.2

Updated the Section module. content: String was replaced by content: Vec<TextBlock>.

Dependencies

~23–54MB
~826K SLoC