RecResNet: A recurrent residual CNN architecture for disparity map enhancement

Konstantinos Batsos, Philippos Mordohai

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

38 Scopus citations

Abstract

We present a neural network architecture applied to the problem of refining a dense disparity map generated by a stereo algorithm to which we have no access. Our approach is able to learn which disparity values should be modified and how, from a training set of images, estimated disparity maps and the corresponding ground truth. Its only input at test time is a disparity map and the reference image. Two design characteristics are critical for the success of our network: (i) it is formulated as a recurrent neural network, and (ii) it estimates the output refined disparity map as a combination of residuals computed at multiple scales, that is at different up-sampling and down-sampling rates. The first property allows the network, which we named RecResNet, to progressively improve the disparity map, while the second property allows the corrections to come from different scales of analysis, addressing different types of errors in the current disparity map. We present competitive quantitative and qualitative results on the KITTI 2012 and 2015 benchmarks that surpass the accuracy of previous disparity refinement methods.

Original languageEnglish
Title of host publicationProceedings - 2018 International Conference on 3D Vision, 3DV 2018
Pages238-247
Number of pages10
ISBN (Electronic)9781538684252
DOIs
StatePublished - 12 Oct 2018
Event6th International Conference on 3D Vision, 3DV 2018 - Verona, Italy
Duration: 5 Sep 20188 Sep 2018

Publication series

NameProceedings - 2018 International Conference on 3D Vision, 3DV 2018

Conference

Conference6th International Conference on 3D Vision, 3DV 2018
Country/TerritoryItaly
CityVerona
Period5/09/188/09/18

Keywords

  • 3D vision
  • Deep learning
  • Disparity refinement
  • Stereo matching

Fingerprint

Dive into the research topics of 'RecResNet: A recurrent residual CNN architecture for disparity map enhancement'. Together they form a unique fingerprint.

Cite this