MICCAI 2025: Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion

Mona Sheikh Zeinoddin · Mobarak I. Hoque · Zafer Tandogdu · Greg L. Shaw · Matthew J. Clarkson · Evangelos B. Mazomenos · Danail Stoyanov

Paper | Video

This work was early-accpeted and selected for an oral presentation at MICCAI 2025.

⚙️ Setup

We ran our experiments with PyTorch 2.1.0, CUDA 12.0, Python 3.10 and Ubuntu 22.04.

🖼️ Prediction for a single image or a folder of images

You can predict scaled disparity for a single image or a folder of images with:

CUDA_VISIBLE_DEVICES=0 python test_simple.py --model_path <your_model_path> --image_path <your_image_or_folder_path>

Initializing with AF-Sfm Learner weights

You can download AF-Sfm Learners weights that we use in initialization with:

gdown 1kf7LjQ6a2ACKr6nX5Uyee3of3bXn1xWB
unzip -q Model_trained_end_to_end.zip
mv Model_trained_end_to_end af_sfmlearner_weights

💾 Datasets

You can download the Endovis or SCARED dataset by signing the challenge rules and emailing them to max.allan@intusurg.com

Endovis split

The train/test/validation split for Endovis dataset used in our works is defined in the splits/endovis folder.

Endovis data preprocessing

We use the ffmpeg to convert the RGB.mp4 into images.png:

find . -name "*.mp4" -print0 | xargs -0 -I {} sh -c 'output_dir=$(dirname "$1"); ffmpeg -i "$1" "$output_dir/%10d.png"' _ {}

We only use the left frames in our experiments and please refer to extract_left_frames.py. For dataset 8 and 9, we rephrase keyframes 0-4 as keyframes 1-5.

Data structure

The directory of dataset structure is shown as follows:

/path/to/endovis_data/
  dataset1/
    keyframe1/
      image_02/
        data/
          0000000001.png

⏳ Endovis training

CUDA_VISIBLE_DEVICES=0 python train_end_to_end.py --data_path <your_data_path> --log_dir <path_to_save_model (depth, pose, appearance flow, optical flow)>

📊 Endovis evaluation

To prepare the ground truth depth maps run:

CUDA_VISIBLE_DEVICES=0 python export_gt_depth.py --data_path <your_data_path> --split endovis

Depth Evaluation:

python evaluate_depth.py --data_path <your_data_path> --load_weights_folder <path_to_weights_i_folder> --eval_mono

Pose evaluation:

python evaluate_pose.py --data_path <your_data_path>  --load_weights_folder <path_to_weights_i_folder> --scared_pose_seq <trajectory_1_or_2>

Want to see our project in action? ✨ Dive into our interactive Colab demo: Launch in Colab

The StereoMIS sequence we used to evaluate our model is available here.

Depth Estimation on SCARED & Hamlyn

Visual Odometry on SCARED Trajectory2

3D Reconstruction on SCARED & Visual Odometry on StereoMIS

Our Model

Model	Abs Rel	Sq Rel	RMSE	ATE-Trajectory 1	ATE-Trajectory 2	Link
End-to-end best model weights	0.051	0.354	4.480	0.0702	0.0438	google

Citation

If you found this code/work to be useful in your own research, please considering citing the following:

  @article{zeinoddin2025endo,
  title={Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion},
  author={Zeinoddin, Mona Sheikh and Islam, Mobarakol and Tandogdu, Zafer and Shaw, Greg and Clarkson, Mathew J and Mazomenos, Evangelos and Stoyanov, Danail},
  booktitle={International Conference on Medical Image Computing and Computer-Assisted Intervention},
  year={2025},
  organization={Springer}
}

Contact

If you have any questions, please feel free to contact mona.zeinoddin.22@ucl.ac.uk

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
croco		croco
datasets		datasets
networks		networks
reloc3r		reloc3r
splits/endovis		splits/endovis
Endo_FASt3r_Demo.ipynb		Endo_FASt3r_Demo.ipynb
LICENSE		LICENSE
README.md		README.md
endo_3d.png		endo_3d.png
endo_depth.png		endo_depth.png
endo_pose.png		endo_pose.png
evaluate_3d_reconstruction.py		evaluate_3d_reconstruction.py
evaluate_depth.py		evaluate_depth.py
evaluate_pose.py		evaluate_pose.py
export_gt_depth.py		export_gt_depth.py
layers.py		layers.py
miccai.gif		miccai.gif
mygif.gif		mygif.gif
options.py		options.py
train_end_to_end.py		train_end_to_end.py
trainer_end_to_end.py		trainer_end_to_end.py
utils.py		utils.py
visualize_pose.py		visualize_pose.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MICCAI 2025: Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion

Paper | Video

⚙️ Setup

🖼️ Prediction for a single image or a folder of images

Initializing with AF-Sfm Learner weights

💾 Datasets

⏳ Endovis training

📊 Endovis evaluation

Want to see our project in action? ✨ Dive into our interactive Colab demo: Launch in Colab

Depth Estimation on SCARED & Hamlyn

Visual Odometry on SCARED Trajectory2

3D Reconstruction on SCARED & Visual Odometry on StereoMIS

Our Model

Citation

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

License

surgical-vision/Endo_FASt3r

Folders and files

Latest commit

History

Repository files navigation

MICCAI 2025: Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion

Paper | Video

⚙️ Setup

🖼️ Prediction for a single image or a folder of images

Initializing with AF-Sfm Learner weights

💾 Datasets

⏳ Endovis training

📊 Endovis evaluation

Want to see our project in action? ✨ Dive into our interactive Colab demo: Launch in Colab

Depth Estimation on SCARED & Hamlyn

Visual Odometry on SCARED Trajectory2

3D Reconstruction on SCARED & Visual Odometry on StereoMIS

Our Model

Citation

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages