Online Language Splatting

Saimouli Katragadda, Cho-Ying Wu, Yuliang Guo†, Xinyu Huang, Guoquan Huang, Liu Ren
(† indicates corresponding author)
Webpage | Paper | Video
Pretrained Models: HuggingFace -> contains pre-trained weights for models

Sofa

Rug

🔔 Highlights

Our method realize Gaussian Splatting and open-vocabulary-preserving language mapping simultenuously in an online SLAM framework.
This release include both the integrated framework, and the plug-in-and-play pretrained network modules producing dense and sharp CLIP maps (192x192x768) beyond real-time speed, e.g., >40 FPS.
We’re actively working on improving the pipeline for higher speed, larger-scale datasets, and broader compatibility — stay tuned for upcoming updates!

🚀 Getting Started

📦 Dataset

mkdir -p data
cd data
wget https://huggingface.co/datasets/kxic/vMAP/resolve/main/vmap.zip
unzip vmap.zip

🛠️ Installation

git clone https://github.com/rpng/online_lang_splatting.git --recursive
cd online_lang_splatting

Setup the environment.

conda env create -f environment.yaml
conda activate LangGS

💬 Language Model Setup

cd langauge/sed/open_clip 
make install
python -m pip install 'git+https://github.com/facebookresearch/detectron2.git'

Download language model weights from

https://drive.google.com/file/d/1zAXE0QXy47n0cVn7j_2cSR85eqxdDGg8/view?usp=drive_link

Edit language/configs/convnextL_768.yaml and Set the WEIGHTS to the path of the downloaded language model weights

cd online_lang_splatting
python create_lang_model.py --config language/configs/convnextL_768.yaml

🧠 Language Features Demo

Downlod the pre-trained weights. We use omni_general indoor trained weights

To test language feature on your own image, run

python3 language/language_features.py --high-res-model "high_res_71_indoor.ckpt" --lang-model "seg_clip_model_l.pth" --input "sample/replica_room0.jpg" --query-text "vase"

🧭 Running the Pipeline

Edit base_config.yaml file to load auto_ckpt_path to load generalized autoencoder. lang_model_path to point to the language feature map model weights and hr_ckpt_path to point to the high resolution module weights.

for room0.yaml edit dataset_path to point to the room0 dataset and online_ckpt_path to where you want the checkpoint to be saved.

To Run ▶️ 2-Stage Pipeline

In base_config.yaml point auto_ckpt_path and hr_ckpt_path to the respective files and in room0.yaml set single_stage_ae to False.

To Run ▶️ 1-Stage Pipeline

To run the 1-stage pipeline, open room0.yaml and update the following parameters:

Set auto_ckpt_path to the cross-data generalization checkpoint file.
Set single_stage_ae to True.

We use a 4-split strategy for training:

Split 1: office0, room0
Split 2: office1, room1
Split 3: office2, room2
Split 4: office3, office4 Training and Testing Example for 4-Split Strategy:
Run 1: Train on Splits 2, 3, 4 → Test on Split 1
Run 2: Train on Splits 1, 3, 4 → Test on Split 2
Run 3: Train on Splits 1, 2, 4 → Test on Split 3
Run 4: Train on Splits 1, 2, 3 → Test on Split 4

The weights are in the pretrained weights folder. Use appropriate weights Example: For evaluating on room0 and office0, use weights from Run 1.

python3 slam.py --config configs/rgbd/replicav2/room0.yaml

Evaluate

🔖 Create Labels

python3 eval/create_replica_labels.py

✅ Evaluate 2-Stage Pipeline

To evaluate 2 stage

python3 eval/evaluate_onlinelangslam.py

✅ Evaluate 1-Stage Pipeline

To evaluate cross data genenarizable

python3 eval/evaluate_langslam.py

🧱 3D Evaluation

⚠️ Note: in each .py file, please read the comment and change path variables that match your local.

Prepare colorized GT by running

cd eval/tsdf_fusion
python3 save_semantic_colors_gt.py

To reconstruct TSDF for groundtruth, run

python3 dim3_recon_gt.py

cd PytorchEMD; python3 setup.py

copy the compiled .so file to the tsdf-fusion folder (move one level up)

▶️ Run 3D Evaluation LangSlam

python3 3d_evaluation_and_visualize_langslam_dim15.py

LangSplat

python3 3d_evaluation_and_visualize_langsplat.py

🧪 Training

To train your own AE on your domain for 1-stage

Language feature script can be used to save high or low resolution langauge features labels to train auto encoder on your own domain.

python3 language/autoencoder/train_encoder_light.py

🧬 Reprodicibility

There might be minor differences between the released version and the results in the paper. Please bear in mind that multi-process performance has some randomness due to GPU utilisation. We run all our experiments on an RTX A4500 GPU, and the performance may differ when running with a different GPU.

🙏 Acknowledgement

This work incorporates many open-source codes. We extend our gratitude to the authors of the software.

📖 Citation

If you find this work helpful, please consider citing us:

@inproceedings{katragadda2025_onlinelang,
  title     = {{O}nline {L}anguage {S}platting},
  author    = {Saimouli Katragadda and Cho-Ying Wu and Yuliang Guo and Xinyu Huang and Guoquan Huang and Liu Ren},
  booktitle = {arXiv},
  year      = {2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
configs/rgbd/replicav2		configs/rgbd/replicav2
eval		eval
gaussian_splatting		gaussian_splatting
gui		gui
language		language
media		media
sample		sample
submodules		submodules
tsdf-fusion		tsdf-fusion
utils		utils
.gitignore		.gitignore
README.md		README.md
create_lang_model.py		create_lang_model.py
environment.yaml		environment.yaml
slam.py		slam.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Online Language Splatting

🔔 Highlights

🚀 Getting Started

📦 Dataset

🛠️ Installation

🧠 Language Features Demo

🧭 Running the Pipeline

To Run ▶️ 2-Stage Pipeline

To Run ▶️ 1-Stage Pipeline

Evaluate

✅ Evaluate 2-Stage Pipeline

✅ Evaluate 1-Stage Pipeline

🧱 3D Evaluation

To train your own AE on your domain for 1-stage

🧬 Reprodicibility

🙏 Acknowledgement

📖 Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

rpng/online_lang_splatting

Folders and files

Latest commit

History

Repository files navigation

Online Language Splatting

🔔 Highlights

🚀 Getting Started

📦 Dataset

🛠️ Installation

🧠 Language Features Demo

🧭 Running the Pipeline

To Run ▶️ 2-Stage Pipeline

To Run ▶️ 1-Stage Pipeline

Evaluate

✅ Evaluate 2-Stage Pipeline

✅ Evaluate 1-Stage Pipeline

🧱 3D Evaluation

To train your own AE on your domain for 1-stage

🧬 Reprodicibility

🙏 Acknowledgement

📖 Citation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages