This repo is the official implementation of "Sign2GPT: Leveraging Large Language Models for Gloss-Free Sign Language Translation".
Build the Docker image using the Dockerfile and environment.yml files provided.
Request access for CSL-Daily from the authors of "Improving Sign Language Translation with Monolingual Data by Sign Back-Translation" (Zhou et al.) and add it to folder named csldaily.
-
Create the tsv file for easier reading of the dataset using
scripts/csldaily/tsv_processing.py -
Since the dataset is divided into frames, using the script
scripts/csldaily/video_creator.pyto convert the individual frames to videos which are saved indataset_creation/csl-dailyfolder. You will need to do this for every sequence in the dataset. -
Convert the videos into lmdbs using
scripts/csldaily/image_lmdb_creator.py. You will need to do this for every video in the dataset. -
Create the pseudo-gloss dictionary pickle file with
scripts/csldaily/pseudo_gloss_zn.py -
Update the paths in
configs/base/base_utils.pyforckpt_pathandlmdb_pathto your desired checkpoint save location and base path for the lmdb directory. -
If you plan on using Weights and Biases add the api key to
environment_variables.pyand in the config files set:
cfg.logger_name = ['wandb']
Run the pretraining stage:
python main.py --config=configs/csldaily/csldaily_stage1_configs/CSL_example_s1_config.pyRun the downstream training stage:
python main.py --config=configs/csldaily/csldaily_stage2_configs/CSL_example_s2_config.pyNOTE: By default during training BLEU scores are the teacher forcing scores (obleu) every epoch. If you want to get the true BLEU scores (which takes longer) you need to get ableu scores which are run every 10 epochs on the validation set.
@inproceedings{
wong2024signgpt,
title={Sign2{GPT}: Leveraging Large Language Models for Gloss-Free Sign Language Translation},
author={Ryan Wong and Necati Cihan Camgoz and Richard Bowden},
booktitle={The Twelfth International Conference on Learning Representations},
year={2024},
url={https://openreview.net/forum?id=LqaEEs3UxU}
}
