TY - GEN
T1 - RecResNet
T2 - 6th International Conference on 3D Vision, 3DV 2018
AU - Batsos, Konstantinos
AU - Mordohai, Philippos
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/10/12
Y1 - 2018/10/12
N2 - We present a neural network architecture applied to the problem of refining a dense disparity map generated by a stereo algorithm to which we have no access. Our approach is able to learn which disparity values should be modified and how, from a training set of images, estimated disparity maps and the corresponding ground truth. Its only input at test time is a disparity map and the reference image. Two design characteristics are critical for the success of our network: (i) it is formulated as a recurrent neural network, and (ii) it estimates the output refined disparity map as a combination of residuals computed at multiple scales, that is at different up-sampling and down-sampling rates. The first property allows the network, which we named RecResNet, to progressively improve the disparity map, while the second property allows the corrections to come from different scales of analysis, addressing different types of errors in the current disparity map. We present competitive quantitative and qualitative results on the KITTI 2012 and 2015 benchmarks that surpass the accuracy of previous disparity refinement methods.
AB - We present a neural network architecture applied to the problem of refining a dense disparity map generated by a stereo algorithm to which we have no access. Our approach is able to learn which disparity values should be modified and how, from a training set of images, estimated disparity maps and the corresponding ground truth. Its only input at test time is a disparity map and the reference image. Two design characteristics are critical for the success of our network: (i) it is formulated as a recurrent neural network, and (ii) it estimates the output refined disparity map as a combination of residuals computed at multiple scales, that is at different up-sampling and down-sampling rates. The first property allows the network, which we named RecResNet, to progressively improve the disparity map, while the second property allows the corrections to come from different scales of analysis, addressing different types of errors in the current disparity map. We present competitive quantitative and qualitative results on the KITTI 2012 and 2015 benchmarks that surpass the accuracy of previous disparity refinement methods.
KW - 3D vision
KW - Deep learning
KW - Disparity refinement
KW - Stereo matching
UR - http://www.scopus.com/inward/record.url?scp=85056747107&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85056747107&partnerID=8YFLogxK
U2 - 10.1109/3DV.2018.00036
DO - 10.1109/3DV.2018.00036
M3 - Conference contribution
AN - SCOPUS:85056747107
T3 - Proceedings - 2018 International Conference on 3D Vision, 3DV 2018
SP - 238
EP - 247
BT - Proceedings - 2018 International Conference on 3D Vision, 3DV 2018
Y2 - 5 September 2018 through 8 September 2018
ER -