Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching

This is the official implementation of our paper

Authors: Bowen Wen, Shaurya Dewan, Stan Birchfield

Abstract

Stereo foundation models achieve strong zero-shot generalization but remain computationally prohibitive for real-time applications. Efficient stereo architectures, on the other hand, sacrifice robustness for speed and require costly per-domain fine-tuning. To bridge this gap, we present Fast-FoundationStereo, a family of architectures that achieve, for the first time, strong zero-shot generalization at real-time frame rate. We employ a divide-and-conquer acceleration strategy with three components: (1) knowledge distillation to compress the hybrid backbone into a single efficient student; (2) blockwise neural architecture search for automatically discovering optimal cost filtering designs under latency budgets, reducing search complexity exponentially; and (3) structured pruning for eliminating redundancy in the iterative refinement module. Furthermore, we introduce an automatic pseudo-labeling pipeline used to curate 1.4M in-the-wild stereo pairs to supplement synthetic training data and facilitate knowledge distillation. The resulting model can run over 10× faster than FoundationStereo while closely matching its zero-shot accuracy, thus establishing a new state-of-the-art among real-time methods.

Code coming soon, please stay tuned. For offline stereo depth computation, please check FoundationStereo

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
.gitignore		.gitignore
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching

Abstract

Code coming soon, please stay tuned. For offline stereo depth computation, please check FoundationStereo

About

Uh oh!

Releases

Packages

NVlabs/Fast-FoundationStereo

Folders and files

Latest commit

History

Repository files navigation

Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching

Abstract

Code coming soon, please stay tuned. For offline stereo depth computation, please check FoundationStereo

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages