Instant Gaussian Stream: Fast and Generalizable Streaming of Dynamic Scene Reconstruction via Gaussian Splatting [CVPR25]

Jinbo Yan, Rui Peng, Zhiyan Wang, Luyang Tang, Jiayu Yang, Jie Liang, Jiahao Wu, Ronggang Wang
Arxiv|Datasets|Weights
CVPR 25

This repository contains the official authors implementation associated with the paper: Instant Gaussian Stream: Fast and Generalizable Streaming of Dynamic Scene Reconstruction via Gaussian Splatting

Bibtex

@misc{yan2025instantgaussianstreamfast,
      title={Instant Gaussian Stream: Fast and Generalizable Streaming of Dynamic Scene Reconstruction via Gaussian Splatting}, 
      author={Jinbo Yan and Rui Peng and Zhiyan Wang and Luyang Tang and Jiayu Yang and Jie Liang and Jiahao Wu and Ronggang Wang},
      year={2025},
      eprint={2503.16979},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2503.16979}, 
}

Installation

clone

git clone https://github.com/yjb6/IGS.git --recursive

Python >= 3.9

Install PyTorch >= 2.0.0. We have tested on torch2.0.0+cu118, but other versions should also work fine.

pip install torch==2.0.0 torchvision==0.15.1 torchaudio==2.0.1 --index-url https://download.pytorch.org/whl/cu118

Install torch_cluster by following the instructions provided in the official repository: pytorch_cluster.

Install gaussian_rasterization. We use a variant of the Rade-GS renderer.

pip install submodules/RaDe-GS/submodules/diff-gaussian-rasterization-clamp/

pip install submodules/RaDe-GS/submodules/diff-gaussian-rasterization/

Follow 3DGS to install simple-knn
Install other pkgs:
```
pip install -r requirements.txt
```

Demo

Step 1: Donwnload the prepared Data and CKPT

Extract the Prepared Data

.
├── bbox.json
├── sear_steak
│   ├── colmap_0
│   │   ├── images_512
│   │   ├── images_r2
│   │   └── start_gs
│   ├── colmap_...
│   │   ├── images_512
│   │   └── images_r2
└── sear_steak_total_50_interval_5.json

Extract the ckpt under the code directory

.
├── infer_batch.py
├── ckpt
│   ├── gmflow
│   └── igs
├── ...

Step 2: Run

python infer_batch.py --config configs/demo.yaml

• Remember to replace data.data.root_dir in demo.yaml with your own directory path.
• Feel free to adjust the hyperparameters, such as refine_iterations and max_num.

Step 3: Generate Videos

Use this script to convert the rendered images into a video.

Streaming Reconstruction

Data Preparation

Step 1: Prepare Inputs

We can prepare our streaming dataset following 3DGStream and SpacetimeGaussian, and here we provide a simple script.

cd script
./pre_test_data.sh /path/to/your/scene
cd ..

Then you will get:

<root>
│   |---<your_scene>
│   |   |--- colmap_0
│   |   |   |--- image
│   |   |--- colmap_...
│   |   |--- colmap_299

Step 2: Train the Gaussian of Frame 0

In this step, we recommend first optimizing the Gaussian model for several thousand iterations using RaDe-GS, followed by compressing the Gaussian points with LightGaussian. This process ensures an efficient and high-quality reconstruction of the 3D scene.

To facilitate this, we provide a script that includes the RaDe-GS training process, LightGaussian compression, and rendering. Before running the script, make sure to adjust the YOUR_PATH parameter to match your specific directory structure.

Install the submodules:

pip install submodules/RaDe-GS/submodules/diff-gaussian-rasterization-clamp/
pip install submodules/RaDe-GS/submodules/compress-diff-gaussian-rasterization/

Train:

cd submodules/RaDe-GS
./train.sh
cd -

Step 3: Downsize the Image

Resize the images to 512x512 for processing by AGM-Net.

cd script
python subsample.py -r /path/to/your/scene
cd ..

(Optional)
If you chose to downsample the image during the training of the first frame of Gaussians, you need to resize the images to the corresponding dimensions here.

cd script
python subsample_pil.py -r /path/to/your/scene --target_size corrsponding_width corrsponding_height
#example: python subsample_pil.py -r /path/to/your/scene --target_size 1024 1024
cd ..

Step 4: Generate Key-Candidate Pair and Bounding Box Configuration

Use the provided script to control the interval between key frames and generate the corresponding key frame sequence.

Additionally, you need to specify the bounding box (bbox) for the dynamic area of the scene. We have provided the BBOX configurations for the N3DV and MeetingRoom scenes. Make sure to copy them to your dataset root directory.

The final testing data structure:

<root>
│   |---bbox.json
│   |---<scene_name>_total_<>_interval<w>.json
│   |---<scene_name>
│   |   |--- colmap_0
│   |   |   |--- start_gs
│   |   |   |--- image
│   |   |   |--- image_512
│   |   |   |--- image_r2
│   |   |--- colmap_...
│   |   |   |--- image
│   |   |   |--- image_512
│   |   |   |--- image_r2
│   |   |--- colmap_299
│   |   |   |--- image
│   |   |   |--- image_512
│   |   |   |--- image_r2

Run

To test the model, use the following command:

python infer_batch.py  --config <path to config>

We have provided the relevant configurations for N3DV and Meeting Room, and the details can be found in configs.

Train AGM-Net

Datasets Preparation

Our Training Dataset

Download our processed data from 4 sequences of N3DV, which can be directly used for training. It contains 1,200 optimized Gaussian points and requires 150GB of storage space.

After extraction, the directory structure is as follows:

.
└── IGS_data
    ├── bbox.json
    ├── coffee_martini_colmap
    ├── cook_spinach_colmap
    ├── flame_salmon_1
    ├── flame_steak_colmap
    └── N3D_train_gap_10.json

Prepare More Dataset

You can prepare additional data following:

Step 1:

Follow the tutorial to prepare the extracted images from multi-view video.

Step 2:

For each frame, reconstruct the corresponding Gaussians and perform rendering. We provide a script that supports multi-GPU parallel processing for Gaussian point reconstruction, compression, and rendering. Remember to change the parameters in the code.

cd submodules/RaDe-GS
python build_3dgs_dataset.py

Step 3:

Generate training pairs, camera groups by executing the provided script.

Manually annotate bounding boxes (BBoxes) of a scene using visualization tools such as Open3D.

Run

accelerate launch --config_file acc_cfg/default_config.yaml main.py --config configs/train.yaml

Acknowledgments

Our work builds upon the following open-source projects and their contributions:
• 3DGStream
• LGM
• 3DGS
• Rade-GS
• LightGaussian

We are deeply grateful to the authors and communities behind these projects for their valuable work and inspiration.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Instant Gaussian Stream: Fast and Generalizable Streaming of Dynamic Scene Reconstruction via Gaussian Splatting [CVPR25]

Bibtex

Installation

Demo

Step 1: Donwnload the prepared Data and CKPT

Step 2: Run

Step 3: Generate Videos

Streaming Reconstruction

Data Preparation

Step 1: Prepare Inputs

Step 2: Train the Gaussian of Frame 0

Step 3: Downsize the Image

Step 4: Generate Key-Candidate Pair and Bounding Box Configuration

The final testing data structure:

Run

Train AGM-Net

Datasets Preparation

Our Training Dataset

Prepare More Dataset

Step 1:

Step 2:

Step 3:

Run

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
acc_cfg		acc_cfg
configs		configs
igs		igs
lpipsPyTorch		lpipsPyTorch
script		script
submodules/RaDe-GS		submodules/RaDe-GS
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
infer_batch.py		infer_batch.py
main.py		main.py
requirements.txt		requirements.txt

yjb6/IGS

Folders and files

Latest commit

History

Repository files navigation

Instant Gaussian Stream: Fast and Generalizable Streaming of Dynamic Scene Reconstruction via Gaussian Splatting [CVPR25]

Bibtex

Installation

Demo

Step 1: Donwnload the prepared Data and CKPT

Step 2: Run

Step 3: Generate Videos

Streaming Reconstruction

Data Preparation

Step 1: Prepare Inputs

Step 2: Train the Gaussian of Frame 0

Step 3: Downsize the Image

Step 4: Generate Key-Candidate Pair and Bounding Box Configuration

The final testing data structure:

Run

Train AGM-Net

Datasets Preparation

Our Training Dataset

Prepare More Dataset

Step 1:

Step 2:

Step 3:

Run

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages