Setup a virtual environment and run training for a few steps with run.sh:
sh protein_lm/run.sh
Implements BERT and autoregressive models for proteins, as described in Biological Structure and Function Emerge from Scaling Unsupervised Learning to 250 Million Protein Sequences and ProGen: Language Modeling for Protein Generation.