Skip to content

asimov-modules/asimov-brightdata-module

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

43 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

ASIMOV Bright Data Module

License Package on Crates.io Package on PyPI Package on RubyGems Package on NPM

ASIMOV module for data import powered by the Bright Data web data platform.

✨ Features

  • Imports structured data from Airbnb, Amazon, Crunchbase, eBay, Facebook, Google, Indeed, Instagram, LinkedIn, Walmart, X (aka Twitter), Yahoo, and YouTube.
  • Collects the raw JSON data via the Bright Data API (requires an API key).
  • Constructs a semantic knowledge graph based on the KNOW ontology.
  • Supports plain JSON output as well as RDF output in the form of JSON-LD.
  • Distributed as a standalone static binary with zero runtime dependencies.

πŸ› οΈ Prerequisites

  • Rust 1.85+ (2024 edition) if building from source code

⬇️ Installation

Installation from PyPI

pip install -U asimov-brightdata-module

Installation from RubyGems

gem install asimov-brightdata-module

Installation from NPM

npm install -g asimov-brightdata-module

Installation from Source Code

cargo install asimov-brightdata-module

πŸ‘‰ Examples

export BRIGHTDATA_API_KEY="..."

Fetching X Profiles

asimov-brightdata-fetcher https://x.com/bright_init   # JSON
asimov-brightdata-importer https://x.com/bright_init  # JSON-LD

Fetching LinkedIn Profiles

asimov-brightdata-fetcher https://www.linkedin.com/in/orlenchner
asimov-brightdata-importer https://www.linkedin.com/in/orlenchner
asimov-brightdata-fetcher https://www.linkedin.com/company/bright-data
asimov-brightdata-importer https://www.linkedin.com/company/bright-data

Fetching Crunchbase Profiles

asimov-brightdata-fetcher https://www.crunchbase.com/organization/brightdata
asimov-brightdata-importer https://www.crunchbase.com/organization/brightdata

Fetching Amazon Products

asimov-brightdata-fetcher https://www.amazon.com/Master-Algorithm-Ultimate-Learning-Machine/dp/0465094279
asimov-brightdata-importer https://www.amazon.com/Master-Algorithm-Ultimate-Learning-Machine/dp/0465094279

βš™ Configuration

Environment Variables

πŸ“š Reference

Installed Binaries

  • asimov-brightdata-cataloger: discovers entities via the Bright Data API (not implemented yet)
  • asimov-brightdata-fetcher: collects JSON data from the Bright Data API
  • asimov-brightdata-importer: collects and transforms JSON into JSON-LD (not implemented yet)

Supported Datasets

Dataset URL Prefix JSON RDF
Airbnb https://www.airbnb.com/rooms/ βœ… βœ…
Amazon https://www.amazon.com/ βœ… βœ…
Β  https://www.amazon.com/sp?seller= βœ… βœ…
Crunchbase https://www.crunchbase.com/organization/ βœ… βœ…
eBay https://www.ebay.com/itm/ βœ… βœ…
Facebook https://www.facebook.com/events/ βœ… βœ…
Β  https://www.facebook.com/groups/ βœ… βœ…
Β  https://www.facebook.com/marketplace/item/ βœ… βœ…
Β  https://www.facebook.com/share/p/ βœ… βœ…
Google https://www.google.com/shopping/product/ βœ… βœ…
Indeed https://www.indeed.com/cmp/ βœ… βœ…
Instagram https://www.instagram.com/ βœ… βœ…
Β  https://www.instagram.com/p/ βœ… βœ…
Β  https://www.instagram.com/reel/ βœ… βœ…
LinkedIn https://www.linkedin.com/company/ βœ… βœ…
Β  https://www.linkedin.com/in/ βœ… βœ…
Β  https://www.linkedin.com/jobs/ βœ… βœ…
Β  https://www.linkedin.com/posts/ βœ… βœ…
Β  https://www.linkedin.com/pulse/ βœ… βœ…
Walmart https://www.walmart.com/global/seller/ βœ… βœ…
Β  https://www.walmart.com/ip/ βœ… βœ…
X (Twitter) https://x.com/ βœ… βœ…
Yahoo https://finance.yahoo.com/quote/ βœ… βœ…
YouTube https://www.youtube.com/@ βœ… βœ…
Β  https://www.youtube.com/watch?v= βœ… βœ…

πŸ‘¨β€πŸ’» Development

git clone https://github.com/asimov-modules/asimov-brightdata-module.git

Share on X Share on Reddit Share on Hacker News Share on Facebook Share on LinkedIn

About

🚧 ASIMOV module for data import powered by the Bright Data web data platform.

Topics

Resources

License

Contributing

Security policy

Stars

Watchers

Forks

Contributors 5