Learning to Disentangle Latent Physical Factors for Video Prediction

This repository contains datasets, code for dataset initialization and MIG evaluation scripts corresponding to:

D. Zhu, M. Munderloh, B. Rosenhahn, J. Stückler. Learning to Disentangle Latent Physical Factors for Video Prediction. German Conference on Pattern Recognition (GCPR) 2019.

A video demonstrating the results can be found here

Datasets Description

Three video datasets describing physical scenarios. Each sequence in these datasets has 10 frames in 1 second. Resolution is 128x128.

Sliding Set

Objects sliding on a plane
Varying discrete shape, scale, friction, speed and position
26000 sequences with 20000/3000/3000 for training, validation, and test.

Wall Set

Objects sliding into a wall
Varying discrete shape, scale, material (density, restitution, friction, color), initial speed and position
10125 sequences with 7425/1350/1350 for training, validation, and test.

Collision Set

Two objects sliding into each other
Varying discrete shape, scale, material (density, restitution, friction, color), initial speed and position
30000 sequences with 25000/2500/2500 for training, validation, and test.

How to Use

Datasets can be downloaded here: Datasets.zip (md5sum: 27ca28c4646c4fa77911338061f0c820)

Data are in the '.tfrecord' form. The code to load datasets can be found in the folder 'video_prediction/datastes'. The file 'scripts/eval_mig.py' demonstrates how to initialize these datasets. Besides, it is also our implementation for Mutual Information Gap evaluation. TensorFlow version is v1.12.

Our code is based on Alex X. Lee's SAVP and Ricky Tian Qi Chen's beta-TCVAE. Their License can also be found in the license file.

Citation

If you find this useful for your research, please cite the following:

@article{Deyao2019GCPR,
    author    = {Deyao Zhu and Marco Munderloh and Bodo Rosenhahn and Jörg Stückler},
    title     = {Learning to Disentangle Latent  Physical Factors for Video Prediction},
    journal   = {German Conference on Pattern Recognition (GCPR)},
    year      = {2019},
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.README		.README
scripts		scripts
video_prediction		video_prediction
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning to Disentangle Latent Physical Factors for Video Prediction

Datasets Description

Sliding Set

Wall Set

Collision Set

How to Use

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Learning to Disentangle Latent Physical Factors for Video Prediction

Datasets Description

Sliding Set

Wall Set

Collision Set

How to Use

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages