Tools for retrieving Brawl Vault entries from https://forums.kc-mm.com/
This repository contains three scripts that work together to process Brawl Vault data:
- BVexport.py - Exports entries from saved web pages to Excel
- BVexportToJSON.py - Converts the Excel export to JSON format
- BVextractImages.py - Downloads all images referenced in the data
- Python 3.6 or higher
- Clone this repository:
git clone <repository-url>
cd <repository-name>- Install dependencies:
pip install pandas openpyxl beautifulsoup4 requestsScript: BVexport.py
- Save your Brawl Vault web pages to a folder
- Place
BVexport.pyin the same folder as the saved web pages - Run the script
Output: brawl_vault_full_export.xlsx - An Excel file containing all exported entries
Script: BVexportToJSON.py
- Place
BVexportToJSON.pyin the same folder asbrawl_vault_full_export.xlsx - Run the script
Output: brawl_data.json - A JSON file with the exported data
Script: BVextractImages.py
- Place
BVextractImages.pyin the same folder asbrawl_vault_full_export.xlsx - Run the script
Output:
images/- Folder containing all downloaded imagesMissing Images.txt- List of image URLs that failed to download
your-folder/
├── BVexport.py
├── BVexportToJSON.py
├── BVextractImages.py
├── [saved web pages]
├── brawl_vault_full_export.xlsx (generated)
├── brawl_data.json (generated)
├── Missing Images.txt (generated)
└── images/ (generated)
└── [downloaded images]
The typical workflow is:
- Save Brawl Vault web pages to a folder
- Run
BVexport.pyto create the Excel export - (Optional) Run
BVexportToJSON.pyif you need JSON format - (Optional) Run
BVextractImages.pyto download all images