#artificial-intelligence #asimov-module #asimov #api-bindings #dataset

no-std bin+lib asimov-brightdata-module

ASIMOV module for data import powered by the Bright Data web data platform

3 releases

Uses new Rust 2024

new 0.0.2 May 22, 2025
0.0.1 May 21, 2025
0.0.0 May 21, 2025

#121 in HTTP client

Download history

85 downloads per month

Unlicense

22KB
373 lines

ASIMOV Bright Data Module

License Compatibility Package

ASIMOV module for data import powered by the Bright Data web data platform.

✨ Features

  • Imports structured data from Airbnb, Amazon, Crunchbase, eBay, Facebook, Google, Indeed, Instagram, LinkedIn, Walmart, X (aka Twitter), Yahoo, and YouTube.
  • Collects the raw JSON data via the Bright Data API (requires an API key).
  • Constructs a semantic knowledge graph based on the KNOW Ontology.
  • Supports plain JSON output as well as RDF output formats such as JSON-LD, Turtle, and N-Triples.

🛠️ Prerequisites

  • Rust 1.85+ (2024 edition)

⬇️ Installation

Installation from Source Code

cargo install asimov-brightdata-module

👉 Examples

export BRIGHTDATA_API_KEY="..."

Fetching X Profiles

asimov-brightdata-fetcher https://x.com/bendiken
asimov-brightdata-fetcher https://x.com/asimov_protocol

Fetching LinkedIn Profiles

asimov-brightdata-fetcher https://www.linkedin.com/in/arto/
asimov-brightdata-fetcher https://www.linkedin.com/company/asimov-protocol/

Fetching Crunchbase Profiles

asimov-brightdata-fetcher https://www.crunchbase.com/organization/near-f896

Fetching Amazon Products

asimov-brightdata-fetcher https://www.amazon.com/Master-Algorithm-Ultimate-Learning-Machine/dp/0465094279

⚙ Configuration

Environment Variables

📚 Reference

Installed Binaries

  • asimov-brightdata-cataloger: discovers entities via the Bright Data API (not implemented yet)
  • asimov-brightdata-fetcher: collects JSON data from the Bright Data API
  • asimov-brightdata-importer: collects and transforms JSON into RDF formats (not implemented yet)

Supported Datasets

Dataset URL Prefix JSON RDF
Airbnb https://www.airbnb.com/rooms/ 🚧
Amazon https://www.amazon.com/ 🚧
  https://www.amazon.com/sp?seller= 🚧
Crunchbase https://www.crunchbase.com/organization/ 🚧
eBay https://www.ebay.com/itm/ 🚧
Facebook https://www.facebook.com/events/ 🚧
  https://www.facebook.com/groups/ 🚧
  https://www.facebook.com/marketplace/item/ 🚧
  https://www.facebook.com/share/p/ 🚧
Google https://www.google.com/shopping/product/ 🚧
Indeed https://www.indeed.com/cmp/ 🚧
Instagram https://www.instagram.com/ 🚧
  https://www.instagram.com/p/ 🚧
  https://www.instagram.com/reel/ 🚧
LinkedIn https://www.linkedin.com/company/ 🚧
  https://www.linkedin.com/in/ 🚧
  https://www.linkedin.com/jobs/ 🚧
  https://www.linkedin.com/posts/ 🚧
  https://www.linkedin.com/pulse/ 🚧
Walmart https://www.walmart.com/global/seller/ 🚧
  https://www.walmart.com/ip/ 🚧
X (Twitter) https://x.com/ 🚧
Yahoo https://finance.yahoo.com/quote/ 🚧
YouTube https://www.youtube.com/@ 🚧
  https://www.youtube.com/watch?v= 🚧

👨‍💻 Development

git clone https://github.com/asimov-modules/asimov-brightdata-module.git

Share on X Share on Reddit Share on Hacker News Share on Facebook Share on LinkedIn

Dependencies

~15–31MB
~506K SLoC