Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01zc77ss43g
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorXiao, Jianxiong-
dc.contributor.authorLichtenberg, Samuel-
dc.date.accessioned2015-06-26T18:06:17Z-
dc.date.available2015-06-26T18:06:17Z-
dc.date.created2015-04-30-
dc.date.issued2015-06-26-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/dsp01zc77ss43g-
dc.description.abstractAlthough RGB-D sensors have enabled breakthroughs for several vision tasks, such as 3D reconstruction, we have not attained the same level of success in high-level scene understand- ing. Perhaps one of the main reasons is the lack of a large-scale benchmark with 3D annotations and 3D evaluation metrics. In this paper, we introduce an RGB-D benchmark suite with the goal of advancing the state-of-the-art in all major scene understanding tasks. Our dataset is captured by four different sensors and contains 10,335 RGB-D images, putting it at roughly the same scale as PASCAL VOC. The whole dataset is densely annotated: it includes 146,617 2D polygons and 64,595 3D bounding boxes with accurate object orientations, as well as a 3D room layout and scene category label for each image. This dataset enables us to train data- hungry algorithms for scene understanding tasks, evaluate them using meaningful 3D metrics, avoid overfitting to a small test set, and study cross-sensor bias.en_US
dc.format.extent46 pagesen_US
dc.language.isoen_USen_US
dc.titleSUN RGB-D: An RGB-D Scene Understanding Benchmark Suiteen_US
dc.typePrinceton University Senior Theses-
pu.date.classyear2015en_US
pu.departmentComputer Scienceen_US
pu.pdf.coverpageSeniorThesisCoverPage-
Appears in Collections:Computer Science, 1988-2020

Files in This Item:
File SizeFormat 
PUTheses2015-Lichtenberg_Samuel.pdf49.71 MBAdobe PDF    Request a copy


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.