Please use this identifier to cite or link to this item:
http://arks.princeton.edu/ark:/88435/dsp01np193c94m
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Powell, Warren Buckler | - |
dc.contributor.author | Aboagye, Nana | - |
dc.contributor.other | Operations Research and Financial Engineering Department | - |
dc.date.accessioned | 2018-10-09T21:09:39Z | - |
dc.date.available | 2018-10-09T21:09:39Z | - |
dc.date.issued | 2018 | - |
dc.identifier.uri | http://arks.princeton.edu/ark:/88435/dsp01np193c94m | - |
dc.description.abstract | In this dissertation, we study the behavior of value of information policies in the presence of a locally quadratic belief model. We show that the well-known behavior of many learning policies for look-up table belief model no longer applies when the underlying truth is parametric. We characterize the behavior of the knowledge gradient policy---a policy that maximizes the one-step value of information in the presence of a parametric belief model. We exploit this insight to derive a simple heuristic rule which we demonstrably show performs comparably to the knowledge gradient policy. The second contribution we make is extend this to a setting where the underlying function is not parametric but rather only locally quadratic. Classical response surface methods sample what it believed to be the optimum, but these experiments tend to have low value of information. By contrast, applying the knowledge gradient when we assume that the true function is quadratic (even though it is not) tends to encourage sampling points close to boundaries, but this ignores the reality that there tends to be very high bias far from the estimated optimal. We assume that there is a bias between the true function and the quadratic approximation that is Lipschitz continuous. When we imbed this in our belief as a form of uncertainty, distinct from experimental noise, the result is a policy that encourages sampling away from the estimated optimal, but not too far away (this depends on the Lipschitz constant). The final part of this dissertation is an application where we study the allocation of official developmental assistance among recipient countries. We extend the work done in an influential economics study by Collier & Dollar (2002)---which modeled the world in a static framework|to one that models the world over time under uncertainty. We propose a competing backward approximate dynamic programming policy, which considers the effect of immediate decisions on subsequent years, and compare its performance to the myopic policy of Collier & Dollar (2002). | - |
dc.language.iso | en | - |
dc.publisher | Princeton, NJ : Princeton University | - |
dc.relation.isformatof | The Mudd Manuscript Library retains one bound copy of each dissertation. Search for these copies in the library's main catalog: <a href=http://catalog.princeton.edu> catalog.princeton.edu </a> | - |
dc.subject | Bayesian optimization | - |
dc.subject | derivative-free optimization | - |
dc.subject | Knowledge gradient | - |
dc.subject | Locally parametric | - |
dc.subject | Stochastic optimization | - |
dc.subject.classification | Operations research | - |
dc.title | Knowledge gradient for expensive locally quadratic functions and stochastic optimization of aid allocation | - |
dc.type | Academic dissertations (Ph.D.) | - |
pu.projectgrantnumber | 690-2143 | - |
Appears in Collections: | Operations Research and Financial Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Aboagye_princeton_0181D_12715.pdf | 3.18 MB | Adobe PDF | View/Download |
Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.