Skip to content

kahnchana/LangToMo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

97 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

[WIP] LangToMo

Installation

Python=3.9
conda install pytorch torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia
pip install -U diffusers accelerate transformers
pip install einops matplotlib wandb
pip install dm-reverb[tensorflow] tensorflow-datasets rlds
pip install "pydantic>=2.0" --upgrade

Installation - LDM (Diffusers Code Base)

git clone https://github.com/huggingface/diffusers
cd diffusers
pip install -e .
pip install accelerate transformers datasets
pip install einops matplotlib wandb
pip install "pydantic>=2.0" --upgrade
pip install dm-reverb[tensorflow] tensorflow-datasets rlds
pip install xformers

Sythetic Setup - CALVIN

This section explains using CALVIN dataset to train our model.

Data Generation

To generate optical flow and save, run following script:

cd src/dataset
python generate_flow.py

To visualize generation data, run

python test/dataset/test_generated_flow.py

Additional code for visualization and sanity checks on CALVIN optical flow generation are found in src/dataset/calvin.py.

Training

python src/train.py --output-dir test_0XX

About

[WIP] Code for LangToMo

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published