TeaserGen: Generating Teasers for Long Documentaries

This repository contains the official implementation of "TeaserGen: Generating Teasers for Long Documentaries"(The codebase is still under construction.)

TeaserGen:Generating Teasers for Long Documentaries

Weihan Xu, Paul Pu Liang, Haven Kim, Julian McAuley, Taylor Berg-Kirkpatrick, Hao-Wen Dong

The International Conference on Learning Representations (ICLR), 2025

[Paper] [Demo Page] [Pretrained Model]

Overview

Dataset Annotation Overview:

TeaserGen-PT:

TeaserGen-LR:

Prerequisites

conda env create -f newgpt.yml

Dataset Annotation and Processing

We annotate the separating point of teaser and main documentary and save the annotations in annotation/annotation.csv.

You can find detailed data processing code under ./data_preprocessing

The general pipeline for data processing:

Download raw data from youtube: ./data_preprocessing/video_download.py
Preprocess raw video by separating the audio track from the video: ./data_preprocessing/video_preprocessing.py
Audio Separation: ./data_preprocessing/audio_preprocess.py
Transcription: Timestamped-whisper or whisperX: ./data_preprocessing/whisperx.py
Prepare your CLIP feature: Extract frames from video with ./data_preprocessing/frame_extraction.py and then use ./data_preprocessing/clip_frame_feat_extractor.py and ./data_preprocessing/clip_text_feat_extractor.py

We also provide additional processing on scene at scene_process.py

Narration Generation

Input your transcribed text into narr_gen/text_gpt4.py and generate narrations
Get audio track and correponding audio length with narr_gen/text2speech.py

TeaserGen-PT

If you want to use the pretrained model from UniVTG: You can find code ./teasergen-pt/gpt_tf_queue.py

If you want to use the finetuned model on DocumentaryNet: You can find code ./teasergen-pt/gpt_ft_queue.py

TeaserGen-LR

Prepare your training dataset: ./teasergen_lr/prepare_dataset.py
Training: ./teasergen_lr/train_epoch.py or ./teasergen_lr/train_step.py
Decoding: ./teasergen_lr/inference.py

Evaluation

Generate vtgscore array: ./eval/vtgscore.py
Evaluate finetuned highlight detection model: ./eval/highlight_eval.py
Evaluation: ./eval/evaluation.py

Reproducibility

We put all decoded array(TeaserGen-LR) and time intervals(TeaserGen-PT) within ./reproducibility folder. We also provide the pretrained model at the shared google drive.

Interactive Demo

We will release interactive demonstration shortly.

Acknowledgement

WhipserX is based on WhipserX

Audio Separation is based on CDX

TeaserGen-PT is based on UniVTG

We thank the authors for their open-source contributions.

Final Note

Due to copyright concerns, we are unable to release the raw data. If you encounter any issues with your data or have any questions, feel free to reach out to Weihan at [email protected].

Citation

@inproceedings{xu2025teasergen,
    title={TeaserGen: Generating Teasers for Long Documentaries},
    author={Weihan Xu and Paul Pu Liang and Haven Kim and Julian McAuley and Taylor Berg-Kirkpatrick and Hao-Wen Dong},
    booktitle={International Conference on Learning Representations},
    year={2025}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TeaserGen: Generating Teasers for Long Documentaries

Contents

Overview

Prerequisites

Dataset Annotation and Processing

Narration Generation

TeaserGen-PT

TeaserGen-LR

Evaluation

Reproducibility

Interactive Demo

Acknowledgement

Final Note

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
annotation		annotation
data_preprocessing		data_preprocessing
eval		eval
narr_gen		narr_gen
pictures		pictures
reproducibility		reproducibility
teasergen-pt		teasergen-pt
teasergen_lr		teasergen_lr
README.md		README.md
newgpt.yml		newgpt.yml

wx83/TeaserGen_Official

Folders and files

Latest commit

History

Repository files navigation

TeaserGen: Generating Teasers for Long Documentaries

Contents

Overview

Prerequisites

Dataset Annotation and Processing

Narration Generation

TeaserGen-PT

TeaserGen-LR

Evaluation

Reproducibility

Interactive Demo

Acknowledgement

Final Note

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages