A script for collecting the USPTO Backgrounds dataset in a language modelling friendly format.
To run, do:
git clone https://github.com/cfoster0/pile_uspto.git
cd pile_uspto
virtualenv env
. env/bin/activate
pip install -r requirements.txt
python main.py