Depth Estimation

799 papers with code • 14 benchmarks • 70 datasets

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Benchmarks

Add a Result

These leaderboards are used to track progress in Depth Estimation

Dataset	Best Model	Compare
Stanford2D3D Panoramic	HiMODE	See all
NYU-Depth V2	EVP	See all
eBDtheque	Bhattacharjee et al.	See all
DCM	Bhattacharjee et al.	See all
ScanNet	Atlas (plain)	See all
ScanNetV2	DELTAS	See all
DIODE	AIP-Brown	See all
Mars DTM Estimation	GLPDepth	See all
KITTI 2015	H-Net (Ours) Full Eigen	See all
4D Light Field Dataset	LFattNet	See all
Taskonomy	X-TC (Cross-Task Consistency)	See all
Cityscapes test	SDC-Depth	See all
Matterport3D	UniFuse	See all
KITTI Eigen split	LightDepth	See all

Show all 14 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Depth Estimation models and implementations

huggingface/transformers

3 papers

124,984

open-mmlab/mmdetection3d

2 papers

4,808

google-research/big_vision

2 papers

1,554

Datasets

Subtasks

3D Depth Estimation

Depth Map Super-Resolution

Stereo-LiDAR Fusion

Indoor Monocular Depth Estimation

Depth Aleatoric Uncertainty Estimation

Depth Image Upsampling

Most implemented papers

Most implemented Social Latest No code

High Quality Monocular Depth Estimation via Transfer Learning

ialhashim/DenseDepth • • 31 Dec 2018

Accurate depth estimation from images is a fundamental task in many applications including scene understanding and reconstruction.

Paper
Code

Deeper Depth Prediction with Fully Convolutional Residual Networks

iro-cp/FCRN-DepthPrediction • • 1 Jun 2016

This paper addresses the problem of estimating the depth map of a scene given a single RGB image.

Paper
Code

Unsupervised Monocular Depth Estimation with Left-Right Consistency

mrharicot/monodepth • • CVPR 2017

Learning based methods have shown very promising results for the task of depth estimation in single images.

Paper
Code

Vision Transformers for Dense Prediction

isl-org/DPT • • ICCV 2021

We introduce dense vision transformers, an architecture that leverages vision transformers in place of convolutional networks as a backbone for dense prediction tasks.

Paper
Code

Digging Into Self-Supervised Monocular Depth Estimation

nianticlabs/monodepth2 • • 4 Jun 2018

Per-pixel ground-truth depth data is challenging to acquire at scale.

Paper
Code

Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer

intel-isl/MiDaS • • 2 Jul 2019

In particular, we propose a robust training objective that is invariant to changes in depth range and scale, advocate the use of principled multi-objective learning to combine data from different sources, and highlight the importance of pretraining encoders on auxiliary tasks.

Paper
Code