Saliency guided depth prediction from a single image

Yu Wang, Lizhuang Ma

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

With the recent surge of deep neural networks, depth prediction from a single image has seen substantial progress. Deep regression networks are typically learned from large data without much constraints about the scene structure, thus often leading to uncertainties at discontinuous regions. In this paper, we propose a structure-aware depth prediction method based on two observations: depth is relatively smooth within the same objects, and it is usually easier to model relative depth than model the absolute depth from scratch. Our network first predicts an initial depth map and takes an object saliency map as input, which helps to teach the network to learn depth refinement. Specifically, a stable anchor depth is first estimated from the detected salient objects, and the learning objective is to penalize the difference in relative depth versus the estimated anchor. We show such saliency-guided relative depth constraint unveils helpful scene structures, leading to significant gains on the RGB-D saliency dataset NLPR and depth prediction dataset NYU V2. Furthermore, our method is appealing in that it is pluggable to any depth network and is trained end-to-end with no overhead of time during testing.

Original languageEnglish
Title of host publicationProceedings of the International Conference on Advances in Computer Technology, Information Science and Communications, CTISC 2019
EditorsWen-Bing Horng, Yong Yue
PublisherSciTePress
Pages153-159
Number of pages7
ISBN (Electronic)9789897583575
DOIs
StatePublished - 2019
Externally publishedYes
Event2019 International Conference on Advances in Computer Technology, Information Science and Communications, CTISC 2019 - Xiamen, China
Duration: 15 Mar 201917 Mar 2019

Publication series

NameProceedings of the International Conference on Advances in Computer Technology, Information Science and Communications, CTISC 2019

Conference

Conference2019 International Conference on Advances in Computer Technology, Information Science and Communications, CTISC 2019
Country/TerritoryChina
CityXiamen
Period15/03/1917/03/19

Keywords

  • CNN
  • Single-image depth prediction

Fingerprint

Dive into the research topics of 'Saliency guided depth prediction from a single image'. Together they form a unique fingerprint.

Cite this