Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01c821gn62v
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorHazan, Elad-
dc.contributor.authorVan Soest, Abby-
dc.date.accessioned2019-07-24T19:37:37Z-
dc.date.available2019-07-24T19:37:37Z-
dc.date.created2019-05-05-
dc.date.issued2019-07-24-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/dsp01c821gn62v-
dc.description.abstractIn order to learn, we must be able to explore. Creative, open-ended exploration of the world is central to the acquisition of general knowledge. This paper is grounded in the insight that the same is true in machine learning: an intelligent agent will have an inherent sense of curiosity and an intrinsic ability to explore its environment. As such, we seek to determine what an agent can learn to accomplish in an unknown environment without external reward signals. This can be considered a form of unsupervised reinforcement learning, for it removes the influence of reward "labels" from the learning process. Our solution, which we term the MaxEnt algorithm, is an iterative approach to entropy maximization that is based on the conditional gradient algorithm. This paper explains and experimentally evaluates this approach in two classic control tasks and five robotic locomotion tasks. In the absence of rewards, MaxEnt agents learn a variety of novel exploratory behaviors. In the future, our maximum entropy approach can be used as an exploration component of a policy gradient algorithm in the presence of rewards.en_US
dc.format.mimetypeapplication/pdf-
dc.language.isoenen_US
dc.titleREINFORCEMENT LEARNING WITHOUT REWARDS: SIGNAL-FREE EXPLORATION WITH THE MAXENT AGENTen_US
dc.typePrinceton University Senior Theses-
pu.date.classyear2019en_US
pu.departmentComputer Scienceen_US
pu.pdf.coverpageSeniorThesisCoverPage-
pu.contributor.authorid960961604-
Appears in Collections:Computer Science, 1988-2020

Files in This Item:
File Description SizeFormat 
VANSOEST-ABBY-THESIS.pdf880.25 kBAdobe PDF    Request a copy


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.