3 releases
Uses new Rust 2024
new 0.0.2 | May 22, 2025 |
---|---|
0.0.1 | May 21, 2025 |
0.0.0 | May 21, 2025 |
#121 in HTTP client
85 downloads per month
22KB
373 lines
ASIMOV Bright Data Module
ASIMOV module for data import powered by the Bright Data web data platform.
✨ Features
- Imports structured data from Airbnb, Amazon, Crunchbase, eBay, Facebook, Google, Indeed, Instagram, LinkedIn, Walmart, X (aka Twitter), Yahoo, and YouTube.
- Collects the raw JSON data via the Bright Data API (requires an API key).
- Constructs a semantic knowledge graph based on the KNOW Ontology.
- Supports plain JSON output as well as RDF output formats such as JSON-LD, Turtle, and N-Triples.
🛠️ Prerequisites
- Rust 1.85+ (2024 edition)
⬇️ Installation
Installation from Source Code
cargo install asimov-brightdata-module
👉 Examples
export BRIGHTDATA_API_KEY="..."
Fetching X Profiles
asimov-brightdata-fetcher https://x.com/bendiken
asimov-brightdata-fetcher https://x.com/asimov_protocol
Fetching LinkedIn Profiles
asimov-brightdata-fetcher https://www.linkedin.com/in/arto/
asimov-brightdata-fetcher https://www.linkedin.com/company/asimov-protocol/
Fetching Crunchbase Profiles
asimov-brightdata-fetcher https://www.crunchbase.com/organization/near-f896
Fetching Amazon Products
asimov-brightdata-fetcher https://www.amazon.com/Master-Algorithm-Ultimate-Learning-Machine/dp/0465094279
⚙ Configuration
Environment Variables
BRIGHTDATA_API_KEY
: (required) the Bright Data API key to use
📚 Reference
Installed Binaries
asimov-brightdata-cataloger
: discovers entities via the Bright Data API (not implemented yet)asimov-brightdata-fetcher
: collects JSON data from the Bright Data APIasimov-brightdata-importer
: collects and transforms JSON into RDF formats (not implemented yet)
Supported Datasets
Dataset | URL Prefix | JSON | RDF |
---|---|---|---|
Airbnb | https://www.airbnb.com/rooms/ |
✅ | 🚧 |
Amazon | https://www.amazon.com/ |
✅ | 🚧 |
https://www.amazon.com/sp?seller= |
✅ | 🚧 | |
Crunchbase | https://www.crunchbase.com/organization/ |
✅ | 🚧 |
eBay | https://www.ebay.com/itm/ |
✅ | 🚧 |
https://www.facebook.com/events/ |
✅ | 🚧 | |
https://www.facebook.com/groups/ |
✅ | 🚧 | |
https://www.facebook.com/marketplace/item/ |
✅ | 🚧 | |
https://www.facebook.com/share/p/ |
✅ | 🚧 | |
https://www.google.com/shopping/product/ |
✅ | 🚧 | |
Indeed | https://www.indeed.com/cmp/ |
✅ | 🚧 |
https://www.instagram.com/ |
✅ | 🚧 | |
https://www.instagram.com/p/ |
✅ | 🚧 | |
https://www.instagram.com/reel/ |
✅ | 🚧 | |
https://www.linkedin.com/company/ |
✅ | 🚧 | |
https://www.linkedin.com/in/ |
✅ | 🚧 | |
https://www.linkedin.com/jobs/ |
✅ | 🚧 | |
https://www.linkedin.com/posts/ |
✅ | 🚧 | |
https://www.linkedin.com/pulse/ |
✅ | 🚧 | |
Walmart | https://www.walmart.com/global/seller/ |
✅ | 🚧 |
https://www.walmart.com/ip/ |
✅ | 🚧 | |
X (Twitter) | https://x.com/ |
✅ | 🚧 |
Yahoo | https://finance.yahoo.com/quote/ |
✅ | 🚧 |
YouTube | https://www.youtube.com/@ |
✅ | 🚧 |
https://www.youtube.com/watch?v= |
✅ | 🚧 | |
👨💻 Development
git clone https://github.com/asimov-modules/asimov-brightdata-module.git
Dependencies
~15–31MB
~506K SLoC