Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/99999/fk41850x3r
Title: Fast and Robust 3D Reconstruction
Authors: Lipson, Lahav
Advisors: Deng, Jia
Contributors: Computer Science Department
Keywords: 3D Computer Vision
Camera Pose
Depth
Odometry
SLAM
Stereo
Subjects: Computer science
Issue Date: 2025
Publisher: Princeton, NJ : Princeton University
Abstract: 3D reconstruction from visual data is an important subtask for robotics, autonomous machines, and 3D scene understanding. It involves estimating camera and object motion from images/video, as well as 3D structure. I will introduce an approach known as Optimization Guided Neural Iterations (OGNI), and demonstrate how it can be applied to various 3D reconstruction tasks. In OGNI-based approaches, we mimic classical optimization algorithms by breaking down each task into a series of small revisions predicted by a shallow network. Each revision is supervised independently, and is informed by features conditioned on a running estimate of the solution. This mechanism is suprisingly general, and leads to robust and efficient solutions to 3D reconstruction. Moreover, I introduce several explicit optimization layers which enable one to reformat these challenging problems into easier, low-level vision tasks. On visual SLAM, stereo matching and object pose estimation, I show how this approach leads to state-of-the art accuracy and/or speed. I also discuss potential future directions for this line of work.
URI: http://arks.princeton.edu/ark:/99999/fk41850x3r
Type of Material: Academic dissertations (Ph.D.)
Language: en
Appears in Collections:Computer Science

Files in This Item:
File SizeFormat 
Lipson_princeton_0181D_15339.pdf25.65 MBAdobe PDFView/Download


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.