#readability #port #content #updated #liveboat #arc90

readability-liveboat

Port of arc90's readability project to rust, updated for use with liveboat

4 releases

Uses old Rust 2015

new 0.3.3 Jan 20, 2025
0.3.2 Jan 20, 2025
0.3.1 Jan 13, 2025
0.3.0 Jan 11, 2025

#2 in #updated

Download history 286/week @ 2025-01-08 375/week @ 2025-01-15

661 downloads per month
Used in liveboat

MIT license

27KB
701 lines

readability-rs

NOTE:

This is fork of readability-rs which contains changes required by the Liveboat project. I'm not original author of the project

Build Status

readability-rs is a library for extracting the primary readable content of a webpage. This is a rust port of arc90's readability project. inspired by kingwkb/readability.

Hot to use

  • Add readability to dependencies in Cargo.toml
[dependencies]
readability = "^0"
  • Then, use it as below

extern crate readability;
use readability::extractor;

fn main() {
  match extractor::scrape("https://spincoaster.com/chromeo-juice") {
      Ok(product) => {
          println!("------- html ------");
          println!("{}", product.content);
          println!("---- plain text ---");
          println!("{}", product.text);
      },
      Err(_) => println!("error occured"),
  }
}

Demo

Visit demo page.

License

MIT

Dependencies

~8–21MB
~299K SLoC