Data Efficient 3D Learner via Knowledge Transferred from 2D Model

This is the implementation of our ECCV'22 "Data Efficient 3D Learner via Knowledge Transferred from 2D Model"

Overview

We use a strong and well-trained image scene parser to augment single view RGB-D datasets with pseudo-labels, which is used to pre-train a 3D model in an architecture agnostic manner. Our pre-training improves the results of the limited annotation training.

Preparation & Installation

Dataset

Download the 3D dataset ScanNet and pre-process by following command python tools/scannet.py --run process_scannet --path_in <scannet_folder> --path_out data/scannet

You can download the filelist to split train, valid, and test subset by command

python tools/scannet.py --run download_filelists

Pre-traind model weights

We utilize RGB-D dataset Matterport3D to pre-train 3D models. The pre-trained weight is provided in link. To fine-tune or evaluate the following cases, the pre-trained weight needs to be downloaded and placed in ckpt

Installation

The codes have been run on Ubuntu20 and PyTorch1.7.

conda create --name 3DLearner python=3.7
conda activate 3DLearner``
conda install -c conda-forge cudatoolkit-dev==10.2
conda install pytorch==1.7.0 torchvision==0.8.0 cudatoolkit=10.2 -c pytorch

sudo apt-get install openexr
sudo apt-get install libopenexr-dev
pip install -r requirement.txt

Follow O-CNN tutorial to install ocnn package, or run the commands below:

git clone https://github.com/microsoft/O-CNN.git
cd octree/external && git clone --recursive https://github.com/wang-ps/octree-ext.git
cd .. && mkdir build && cd build
cmake ..  && cmake --build . --config Release
export PATH=`pwd`:$PATH

cd pytorch
python setup.py install --build_octree
python -W ignore test/test_all.py -v

Training and Testing

Fine-tuning

Limited Annotations training

python LA_segmentation.py --config config/LA_scannet.yaml

Limited Recontructions training

python LR_segmentation.py --config config/LR_scannet.yaml

Key argument in yaml file
- train.semi: If True, use the unlabeled data for semi-supervied training.
- train.limit: Points of LA configuration
- train.limited_reconst: File-lists of LR configuration

Evaluation

To evaluate our pretraining model on validation set, we provide the trained model weights here. You can replace SOLVER.ckpt with the directory of checkpoint file in config/scannet_val.yaml.

python segmentation.py --config config/scannet_val.yaml

Customized Pre-training

You can choose any RGB-D dataset as you want, and all you need is to convert the image type into point-cloud forms by pre-processingtool/preprocess.py. We provide the soft-label pre-training scripts by following command.

Data preparation

Download RGB-D dataset Matterport3D document and pre-process by following command python tools/mp3d.py --run process_mp3d --path_in <mp3d_folder> --path_out data/mp3d

Image scene parser

Download the well-trained scene parser weights in DPT repository. We utilize dpt_hybrid-ade20k-53898607.pt in our paper.

Pre-training

python softlabel_segmentation --config config/seg_mp3d.yaml

Acknowledgement

Many thanks to these practical open source projects:

O-CNN
DPT

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
configs		configs
datasets		datasets
points		points
solver		solver
tools		tools
.gitignore		.gitignore
LA_segmentation.py		LA_segmentation.py
LR_segmentation.py		LR_segmentation.py
README.md		README.md
segmentation.py		segmentation.py
softlabel_segmentation.py		softlabel_segmentation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Data Efficient 3D Learner via Knowledge Transferred from 2D Model

Overview

Preparation & Installation

Dataset

Pre-traind model weights

Installation

Training and Testing

Fine-tuning

Limited Annotations training

Limited Recontructions training

Evaluation

Customized Pre-training

Data preparation

Image scene parser

Pre-training

Acknowledgement

About

Uh oh!

Releases

Packages

Languages

bryanyu1997/Data-Efficient-3D-Learner

Folders and files

Latest commit

History

Repository files navigation

Data Efficient 3D Learner via Knowledge Transferred from 2D Model

Overview

Preparation & Installation

Dataset

Pre-traind model weights

Installation

Training and Testing

Fine-tuning

Limited Annotations training

Limited Recontructions training

Evaluation

Customized Pre-training

Data preparation

Image scene parser

Pre-training

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages