Please use this identifier to cite or link to this item:
http://arks.princeton.edu/ark:/88435/dsp01cc08hj47p
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Sircar, Ronnie | - |
dc.contributor.author | Kang, William | - |
dc.date.accessioned | 2019-08-16T13:59:12Z | - |
dc.date.available | 2019-08-16T13:59:12Z | - |
dc.date.created | 2019-04-16 | - |
dc.date.issued | 2019-08-16 | - |
dc.identifier.uri | http://arks.princeton.edu/ark:/88435/dsp01cc08hj47p | - |
dc.description.abstract | This paper aims to study Nash Equilibrium, a game theory solution concept, in the case of extensive-form games, where traditional methods of linear programming for equilibrium computation are difficult to apply due to exponentially increasing linear constraints. Recent literature focuses on approximating the upper bound of Nash Equilibrium through a technique called Counterfactual Regret Minimization. In particular, the Monte Carlo Counterfactual Regret Minimization (MCCFR) technique utilizes sampling to effectively minimize per-iteration regret in a large game tree. However even MCCFR has obvious limitations that prevent the algorithm from being suitable in the case of extensive games with imperfect recall. Therefore, in this study, a variant of the Outcome-Sampling MCCFR algorithm, which we call the Average-Outcome-Sampling MCCFR algorithm, is designed to account for imperfect recall, and its performance is tested through a classic extensive-form game called Goofspiel. The algorithm, though it fails to minimize counterfactual regret relative to the case with perfect recall, generates an average behavioral strategy with performance that is comparable to that of perfect recall. Thus, these results suggest that the effects of imperfect recall are not so significant at least for two-player smaller-sized games. | en_US |
dc.format.mimetype | application/pdf | - |
dc.language.iso | en | en_US |
dc.title | Approximating Equilibrium in Extensive Games with Imperfect Recall via Counterfactual Regret Minimization | en_US |
dc.type | Princeton University Senior Theses | - |
pu.date.classyear | 2019 | en_US |
pu.department | Operations Research and Financial Engineering | * |
pu.pdf.coverpage | SeniorThesisCoverPage | - |
pu.contributor.authorid | 961167188 | - |
pu.certificate | Applications of Computing Program | en_US |
Appears in Collections: | Operations Research and Financial Engineering, 2000-2019 |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
KANG-WILLIAM-THESIS.pdf | 591.67 kB | Adobe PDF | Request a copy |
Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.