PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage
[**Denis Zavadski**](https://scholar.google.com/citations?user=S7mDg00AAAAJ)
\* · [**Damjan Kalšan**](https://scholar.google.com/citations?user=6NAxnFUAAAAJ)
\* · [**Carsten Rother**](https://scholar.google.com/citations?user=N_YNMIMAAAAJ)
Computer Vision and Learning Lab,
IWR, Heidelberg University
*equal contribution
ACCV 2024
PrimeDepth is a diffusion-based monocular depth estimator which leverages the rich representation of the visual world stored within Stable Diffusion. The representation, termed
preimage
, is extracted in a single diffusion step from frozen Stable Diffusion 2.1 and adjusted towards depth prediction. PrimeDepth yields detailed predictions while simulatenously being fast at inference time due to the single-step approach.
![teaser](images/teaser.png)
## Introduction
These are the weights for the inference codebase for [PrimeDepth](https://arxiv.org/abs/2409.09144) based on