Object detection from large-scale 3D datasets using bottom-up and top-down descriptors

Alexander Patterson IV, Philippos Mordohai, Kostas Daniilidis

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

38 Scopus citations

Abstract

We propose an approach for detecting objects in large-scale range datasets that combines bottom-up and top-down processes. In the bottom-up stage, fast-to-compute local descriptors are used to detect potential target objects. The object hypotheses are verified after alignment in a top-down stage using global descriptors that capture larger scale structure information. We have found that the combination of spin images and Extended Gaussian Images, as local and global descriptors respectively, provides a good trade-off between efficiency and accuracy. We present results on real outdoors scenes containing millions of scanned points and hundreds of targets. Our results compare favorably to the state of the art by being applicable to much larger scenes captured under less controlled conditions, by being able to detect object classes and not specific instances, and by being able to align the query with the best matching model accurately, thus obtaining precise segmentation.

Original languageEnglish
Title of host publicationComputer Vision - ECCV 2008 - 10th European Conference on Computer Vision, Proceedings
Pages553-566
Number of pages14
EditionPART 4
DOIs
StatePublished - 2008
Event10th European Conference on Computer Vision, ECCV 2008 - Marseille, France
Duration: 12 Oct 200818 Oct 2008

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 4
Volume5305 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference10th European Conference on Computer Vision, ECCV 2008
Country/TerritoryFrance
CityMarseille
Period12/10/0818/10/08

Fingerprint

Dive into the research topics of 'Object detection from large-scale 3D datasets using bottom-up and top-down descriptors'. Together they form a unique fingerprint.

Cite this