Skip to content
This repository was archived by the owner on Aug 25, 2025. It is now read-only.
This repository was archived by the owner on Aug 25, 2025. It is now read-only.

Unable to Reproduce NYUv2 and KITTI Results Using TorchHub Models (Hugging Face Versions Work) #285

@ahmedshahabaz

Description

@ahmedshahabaz

Hi,
Apologies in advance if I’m missing something, but I’ve been trying to reproduce the zero-shot relative depth estimation results on the NYUv2 and KITTI datasets using the MiDaS v3.1 models. When using the pre-trained models from the official GitHub or TorchHub (intel-isl/MiDaS), I wasn’t able to match the reported performance.

However, when I used the same models from the Hugging Face Hub (e.g., Intel/dpt-beit-large-384), I was able to recreate the expected results on both datasets.

Could there be a mismatch or issue with the weights provided via TorchHub or the pre/post-processing steps?

Thanks for your great work and support!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions