8 releases
Uses old Rust 2015
0.3.0 | Dec 20, 2023 |
---|---|
0.2.0 | Apr 1, 2021 |
0.1.5 | May 7, 2019 |
0.1.4 | Feb 17, 2018 |
0.1.2 | Nov 13, 2017 |
#507 in Text processing
2,791 downloads per month
Used in 17 crates
(11 directly)
26KB
697 lines
readability-rs
readability-rs is a library for extracting the primary readable content of a webpage. This is a rust port of arc90's readability project. inspired by kingwkb/readability.
Hot to use
- Add
readability
to dependencies in Cargo.toml
[dependencies]
readability = "^0"
- Then, use it as below
extern crate readability;
use readability::extractor;
fn main() {
match extractor::scrape("https://spincoaster.com/chromeo-juice") {
Ok(product) => {
println!("------- html ------");
println!("{}", product.content);
println!("---- plain text ---");
println!("{}", product.text);
},
Err(_) => println!("error occured"),
}
}
Demo
Visit demo page.
Related Projects
License
Dependencies
~9–21MB
~316K SLoC