Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01k643b414n
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorNarasimhan, Karthik-
dc.contributor.authorWang, Austin-
dc.date.accessioned2020-08-12T16:00:31Z-
dc.date.available2020-08-12T16:00:31Z-
dc.date.created2020-05-03-
dc.date.issued2020-08-12-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/dsp01k643b414n-
dc.description.abstractIn recent years, Deep Reinforcement Learning (DRL) has emerged as a state of the art approach to tasks ranging from games and robotics to natural language processing. However, DRL agents often need millions of samples to learn a task, and even for related tasks they need to be trained from scratch. Humans on the other hand, are able to leverage previously gained knowledge to learn new tasks efficiently. To improve the efficiency and robustness of DRL algorithms, there has been growing interest in exploiting domain knowledge found in text. One challenge of this approach is language grounding. We introduce two tasks, Defuse the Bomb and Item Drop and an attention-based text model that can simultaneously ground entities and dynamics. We find that our model is able to learn faster and achieve better generalization than baselines that do not use text. We also show that grounding game dynamics and entities occur in different parts of our model, and we leverage this fact to build chimera models that can perform well on previously unseen entity-task combinations.en_US
dc.format.mimetypeapplication/pdf-
dc.language.isoenen_US
dc.titleTEXTen_US
dc.titleTEXTen_US
dc.titleGrounding Entities and Dynamics through Gameplayen_US
dc.typePrinceton University Senior Theses-
pu.date.classyear2020en_US
pu.departmentComputer Scienceen_US
pu.pdf.coverpageSeniorThesisCoverPage-
pu.contributor.authorid920093438-
Appears in Collections:Computer Science, 1988-2020

Files in This Item:
File Description SizeFormat 
WANG-AUSTIN-THESIS.pdf2.59 MBAdobe PDF    Request a copy


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.