206 stable releases
new 2.13.13 | Nov 21, 2024 |
---|---|
2.13.11 | Nov 20, 2024 |
2.11.0 | Oct 31, 2024 |
2.6.21 | Sep 30, 2024 |
0.1.3 | Jul 24, 2024 |
#2129 in Web programming
10,712 downloads per month
695KB
14K
SLoC
spider_utils
Utilities to use to help with getting the most out of spider.
CSS Scraping
use spider::{
hashbrown::HashMap,
packages::scraper::Selector,
};
use spider_utils::{QueryCSSMap, QueryCSSSelectSet, build_selectors, css_query_select_map_streamed};
async fn css_query_selector_extract() {
let map = QueryCSSMap::from([(
"list",
QueryCSSSelectSet::from([".list", ".sub-list"]),
)]);
let data = css_query_select_map_streamed(
r#"<html>
<body>
<ul class="list"><li>First</li></ul>
<ul class="sub-list"><li>Second</li></ul>
</body>
</html>"#,
&build_selectors(map),
)
.await;
println!("{:?}", data);
// {"list": ["First", "Second"]}
}
Features
You can use the feature flag indexset
to order the CSS scraping extraction order.
Dependencies
~20–35MB
~620K SLoC