|new 1.46.2||Sep 28, 2023|
|1.46.1||Sep 25, 2023|
|1.37.4||Aug 30, 2023|
|1.34.4||Jul 26, 2023|
|1.26.7||Mar 22, 2023|
#2 in #crawler
1,472 downloads per month
A spider worker to decentralize the crawl lifting.
This project depends on the spider crate.
The worker starts on port 3030 and the scraper for html gathering on 3031 by default.
SPIDER_WORKER_PORT=3030 SPIDER_WORKER_SCRAPER_PORT=3031 cargo run
scrape- When the html is needed run the instance with the flag. Requires spider feature flag matching on the client to start. This also starts the instance on port 3031 instead.
full_resources- Start the basic worker to gather links and scraper together.
tls- Enable tls support use the env variables
.rsafile. Defaults to
By default the instance runs on port
SPIDER_WORKER_PORT to adjust the port.
The scraper runs on port
3031 when enabled use
SPIDER_WORKER_SCRAPER_PORT to adjust the port.