iControl3D: An Interactive System for Controllable 3D Scene Generation (ACM MM 2024)

Xingyi Li^1,2, Yizheng Wu^1,2, Jun Cen², Juewen Peng², Kewei Wang^1,2, Ke Xian¹, Zhe Wang³, Zhiguo Cao^1*, Guosheng Lin²

¹Huazhong University of Science and Technology, ²Nanyang Technological University, ³SenseTime Research

Paper | arXiv | Video | Supp | Poster

This repository contains the official PyTorch implementation of our ACM MM 2024 paper "iControl3D: An Interactive System for Controllable 3D Scene Generation".

Environment Setup

First install dependencies:

conda create -n icontrol3d python=3.10
conda activate icontrol3d
conda install pytorch=1.13.0 torchvision torchaudio pytorch-cuda=11.6 -c pytorch -c nvidia
conda install scipy scikit-image
conda install -c conda-forge diffusers transformers ftfy accelerate
pip install opencv-python
pip install -U gradio
pip install pytorch-lightning==1.7.7 einops==0.4.1 omegaconf==2.2.3
pip install timm

# Install diffusers
git clone https://github.com/takuma104/diffusers.git
cd diffusers
git checkout 9a37409663a53f775fa380db332d37d7ea75c915
pip install .

# Update transformers and huggingface_hub
pip install git+https://github.com/huggingface/transformers
pip install -U huggingface_hub

# Pytorch3D
conda install -c iopath iopath
conda install -c bottler nvidiacub
conda install pytorch3d -c pytorch3d

# skylibs
pip install --upgrade skylibs
conda install -c conda-forge openexr-python openexr
conda install -c conda-forge pyshtools

# Grounded-Segment-Anything
python -m pip install -e segment_anything
python -m pip install -e GroundingDINO
pip install opencv-python pycocotools matplotlib onnxruntime onnx ipykernel

Follow https://github.com/haofanwang/ControlNet-for-Diffusers and download pipeline_stable_diffusion_controlnet_inpaint.py to enable ControlNet for diffusers:

# assume you already know the absolute path of installed diffusers
cp pipeline_stable_diffusion_controlnet_inpaint.py PATH/pipelines/stable_diffusion

Then, you need to import this new added pipeline in corresponding files

PATH/pipelines/stable_diffusion/__init__.py
PATH/pipelines/__init__.py
PATH/__init__.py

Last but not least, as per (haofanwang/ControlNet-for-Diffusers#6), to use any control model already present in ControlNet models, the way to do it is:

Download the models and annotators from the controlnet huggingface repo (https://huggingface.co/lllyasviel/ControlNet)[https://huggingface.co/lllyasviel/ControlNet] and place it under models folder. Then convert the models which can be used with the pipeline:

cd diffusers
python ./scripts/convert_controlnet_to_diffusers.py --checkpoint_path ./models/control_sd15_***.pth --dump_path ../controlnet_models/control_sd15_*** --device cpu

For Grounded-SAM:

cd lib/grounded_sam

wget https://dl.fbaipublicfiles.com/segment_anything/sam_vit_h_4b8939.pth
wget https://github.com/IDEA-Research/GroundingDINO/releases/download/v0.1.0-alpha/groundingdino_swint_ogc.pth

Usage

conda activate icontrol3d
# scribble
python app_controlnet_inpaint.py
# depth
# python app_controlnet_inpaint_depth.py
# hed
# python app_controlnet_inpaint_hed.py
# seg
# python app_controlnet_inpaint_seg.py
# canny
# python app_controlnet_inpaint_canny.py
# mlsd
# python app_controlnet_inpaint_mlsd.py

You can add --outdoor and adjust parameters like --box_threshold to enable the ability to handle outdoor scenes. Please refer to lib/utils/opt.py for more information.

After this, you can use nerfstudio to train a NeRF and render videos.

Acknowledgement

This code is built on stablediffusion-infinity, Text2Room and many other projects. We would like to acknowledge them for making great code openly available for us to use.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
PyPatchMatch		PyPatchMatch
annotator		annotator
blip_model		blip_model
css		css
glid_3_xl_stable		glid_3_xl_stable
js		js
lib		lib
models		models
sd_grpcserver		sd_grpcserver
third_party		third_party
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
app_controlnet_inpaint.py		app_controlnet_inpaint.py
app_controlnet_inpaint_canny.py		app_controlnet_inpaint_canny.py
app_controlnet_inpaint_depth.py		app_controlnet_inpaint_depth.py
app_controlnet_inpaint_hed.py		app_controlnet_inpaint_hed.py
app_controlnet_inpaint_mlsd.py		app_controlnet_inpaint_mlsd.py
app_controlnet_inpaint_seg.py		app_controlnet_inpaint_seg.py
camera.py		camera.py
canvas.py		canvas.py
config.yaml		config.yaml
convert_checkpoint.py		convert_checkpoint.py
global_var.py		global_var.py
index.html		index.html
interrogate.py		interrogate.py
perlin2d.py		perlin2d.py
postprocess.py		postprocess.py
process.py		process.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

iControl3D: An Interactive System for Controllable 3D Scene Generation (ACM MM 2024)

Environment Setup

Usage

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

xingyi-li/iControl3D

Folders and files

Latest commit

History

Repository files navigation

iControl3D: An Interactive System for Controllable 3D Scene Generation (ACM MM 2024)

Environment Setup

Usage

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages