M3: 3D-Spatial Multimodal Memory

m3-teaser.mp4

TODO

Installation

Prepare Conda Environment

conda create --name gs python=3.10
conda activate gs
conda install -c conda-forge cudatoolkit=11.7
conda install -c nvidia/label/cuda-11.7.0 cuda-toolkit
pip install torch==2.0.1+cu117 torchvision==0.15.2+cu117 torchaudio==2.0.2+cu117 -f https://download.pytorch.org/whl/torch_stable.html
conda install -c conda-forge gxx_linux-64=11.2.0
conda install -c conda-forge libxcrypt
pip install plyfile tqdm psutil setuptools mkl pandas
pip install --force-reinstall numpy==1.23.5
export LD_LIBRARY_PATH=$CONDA_PREFIX/lib:$LD_LIBRARY_PATH

Download M3 and install submodules

git clone https://github.com/MaureenZOU/m3-spatial.git
cd submodules/diff-gaussian-rasterization && pip install -e .
cd submodules/diff-gaussian-rasterization2 && pip install -e .

Download Grendel-GS in a separate folder and install submodules

git clone [email protected]:nyu-systems/Grendel-GS.git --recursive
cd submodules/gsplat && pip install -e .
cd submodules/simple-knn && pip install -e .

Demo

sh run/app.sh

demo_video.mp4

Dataset

Download data for raw image

Download data for embedding | Train | |-------| | Link |
Feature Extraction

python3 -m lmm.clip.extract # CLIP feature
python3 -m lmm.siglip.extract # SigLIP feature
python3 -m lmm.dinov2.extract # DINOv2 feature
python3 -m lmm.llama.extract # LLaMA3 feature
python3 -m lmm.llamav.extract # LLaMAv feature
python3 -m lmm.seem.extract # SEEM feature

Checkpoint

We prepare trained M3 representation for two scenes train and geisel.

Name	size	link
train	2.04GB	https://huggingface.co/xueyanz/M3-Train/resolve/main/train_ckpt.tar.gz
geisel	1.04GB	https://huggingface.co/xueyanz/M3-Train/resolve/main/geisel_ckpt.tar.gz

Training

sh run/train.sh # single GPU
sh run/mtrain.sh # multi GPU

Evaluation and Demo

sh run/eval.sh # single GPU evaluation
sh run/app.sh # run interactive demo

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
arguments		arguments
assets		assets
demo_v1		demo_v1
evaluators		evaluators
examples		examples
gaussian_renderer		gaussian_renderer
lmm		lmm
lpipsPyTorch		lpipsPyTorch
run		run
scene		scene
submodules		submodules
utils		utils
xy_utils		xy_utils
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
analyze.py		analyze.py
analyze_statistic.py		analyze_statistic.py
convert.py		convert.py
densification.py		densification.py
metrics.py		metrics.py
render.py		render.py
render_metrics.py		render_metrics.py
render_trace.py		render_trace.py
train.py		train.py
train_internal.py		train_internal.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

M3: 3D-Spatial Multimodal Memory

TODO

Installation

Demo

Dataset

Checkpoint

Training

Evaluation and Demo

About

Uh oh!

Releases

Packages

Languages

License

MaureenZOU/m3-spatial

Folders and files

Latest commit

History

Repository files navigation

M3: 3D-Spatial Multimodal Memory

TODO

Installation

Demo

Dataset

Checkpoint

Training

Evaluation and Demo

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages