Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01jw827f60q
Title: ORIGINAL
ORIGINAL
Improving agents for text-based games with pretrained embeddings
Authors: Slavov, Stanislav
Advisors: Narasimhan, Karthik
Department: Mathematics
Certificate Program: Center for Statistics and Machine Learning
Class Year: 2020
Abstract: In recent years, there has been an increasing interest in text-based games in Natural Language Processing research, in specific in the area of language understanding, which aims to develop algorithms and paradigms of how a computer can develop an understanding of language on the higher level of semantics and meaning. Text-based games have been researched as a tool to achieve this goal. In these games, the agent has to take in instructions and information in the form of text and make decisions about their actions based on that. Through approaches in reinforcement learning, there have been many algorithms that successfully learn policies for such games that generalize well. This paper will investigate the difference between understanding and memorization in neural network approaches. I utilize the TextWorld platform - a game generation engine developed by Microsoft Research that can be used to create text games of various characteristics. I will investigate the agent's ability to correctly recognize the same situation given modified information. For the purposes of this, I will work on a stochastic version of a text-based game, where at each turn the agent is given a randomized modification of the original text given by the game engine. The goal is to develop a method that correctly identifies (clusters) these noisy versions of the same description. A DRRN baseline is employed and to improve its performance, the model is enriched with pretrained-embeddings. Performance is tested on 3 games generated by TextWorld. To further measure how the algorithms handle the stochastic information, cluster analysis case studies are done on the hidden states generated by the trained agents on the full database of descriptions.
URI: http://arks.princeton.edu/ark:/88435/dsp01jw827f60q
Type of Material: Princeton University Senior Theses
Language: en
Appears in Collections:Mathematics, 1934-2020

Files in This Item:
File Description SizeFormat 
SLAVOV-STANISLAV-THESIS.pdf1.41 MBAdobe PDF    Request a copy


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.