15 releases (8 stable)
2.1.0 | Dec 4, 2024 |
---|---|
2.0.0 | Sep 25, 2024 |
1.2.1 | Mar 11, 2024 |
1.2.0 | Sep 13, 2023 |
0.1.0 | Nov 26, 2019 |
#146 in Parser implementations
31,054 downloads per month
Used in 60 crates
(39 directly)
570KB
13K
SLoC
LOL HTML
Low Output Latency streaming HTML rewriter/parser with CSS-selector based API.
It is designed to modify HTML on the fly with minimal buffering. It can quickly handle very large documents, and operate in environments with limited memory resources. More details can be found in the blog post.
The crate serves as a back-end for the HTML rewriting functionality of Cloudflare Workers, but can be used as a standalone library with a convenient API for a wide variety of HTML rewriting/analysis tasks.
Documentation
Bindings for other programming languages
Example
Rewrite insecure hyperlinks:
use lol_html::{element, HtmlRewriter, Settings};
fn main() -> Result<(), Box<dyn std::error::Error>> {
let mut output = vec![];
let mut rewriter = HtmlRewriter::new(
Settings {
element_content_handlers: vec![
element!("a[href]", |el| {
let href = el
.get_attribute("href")
.expect("href was required")
.replace("http:", "https:");
el.set_attribute("href", &href)?;
Ok(())
})
],
..Settings::new()
},
|c: &[u8]| output.extend_from_slice(c)
);
rewriter.write(b"<div><a href=")?;
rewriter.write(b"http://example.com>")?;
rewriter.write(b"</a></div>")?;
rewriter.end()?;
assert_eq!(
String::from_utf8(output)?,
r#"<div><a href="https://example.com"></a></div>"#
);
Ok(())
}
License
BSD licensed. See the LICENSE file for details.
Dependencies
~7.5MB
~194K SLoC