Please use this identifier to cite or link to this item:
http://arks.princeton.edu/ark:/88435/dsp012j62s727w
Title: | Extracting Cognition out of Images for the Purpose of Autonomous Driving |
Authors: | Chen, Chenyi |
Advisors: | Kornhauser, Alain Lucien |
Contributors: | Operations Research and Financial Engineering Department |
Keywords: | Artificial Intelligence Autonomous Driving Computer Vision Deep Learning Machine Learning |
Subjects: | Artificial intelligence Computer science |
Issue Date: | 2016 |
Publisher: | Princeton, NJ : Princeton University |
Abstract: | Autonomous driving is a broadly recognized solution to serious traffic problems such as accidents and congestions. It is a very broad topic that extends across cognition, artificial intelligence, and control. While this thesis primarily focuses on the cognition aspect, others are also considered. Here, the thesis proposes several computer vision algorithms for autonomous driving, encompassing three major parts: In part one, experiments on motion-based object recognition are presented. The proposed method differentiates objects according to their speed. In part two, the artificial intelligence aspect of autonomous driving is considered. Research on training an autonomous driving AI agent through reinforcement learning is introduced. In part three, the key part of the thesis, a direct perception approach is proposed to drive a car in a highway environment. In this approach, an input image is mapped to a small number of key perception indicators that directly relate to the affordance of a road/traffic state for driving. This representation provides a set of compact yet complete descriptions of the scene to enable a simple controller to drive autonomously on highways. Using synthetic images from a virtual environment, a deep convolutional neural network (ConvNet) is trained for direct perception. Experiments show that the model can effectively drive a car in a very diverse set of virtual environments, and it provides good estimation of affordance indicators from real driving images. To further improve the performance of the direct perception-based system, the issue of temporal information is considered by studying the Long Short Term Memory (LSTM) unit and its influence on the affordance indicator estimation. Quantitative results show that adding the LSTM unit does help to improve the system's performance. Finally, as object detection is closely related to autonomous driving, in Appendix A a deep learning-based small object detection approach is proposed. The applicability of the state-of-the-art object detection algorithms to the small object detection task is studied. |
URI: | http://arks.princeton.edu/ark:/88435/dsp012j62s727w |
Alternate format: | The Mudd Manuscript Library retains one bound copy of each dissertation. Search for these copies in the library's main catalog: http://catalog.princeton.edu/ |
Type of Material: | Academic dissertations (Ph.D.) |
Language: | en |
Appears in Collections: | Operations Research and Financial Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Chen_princeton_0181D_11760.pdf | 34.44 MB | Adobe PDF | View/Download |
Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.