1 unstable release

0.2.4 Dec 22, 2023

#17 in #sitemap

Download history 7/week @ 2024-02-22 7/week @ 2024-02-29 20/week @ 2024-03-28 68/week @ 2024-04-04

88 downloads per month
Used in sws-lua

MIT/Apache

75KB
2K SLoC

Sitemap Web Scraper

Sitemap Web Scraper (sws) is a tool for simple, flexible, and yet performant web pages scraping.

It consists of a CLI written in Rust that crawls web pages and executes a Lua JIT script to scrap them, outputting results to a CSV file.

sws crawl --script examples/fandom_mmh7.lua -o result.csv

Check out the doc for more details.


lib.rs:

HTML parsing and querying with CSS selectors.

Partial port of scraper using sws-tree which uses Rc instead of references with lifetimes.

Dependencies

~4–10MB
~89K SLoC