Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp015q47rr67q
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorHazan, Elad-
dc.contributor.authorHallman, John-
dc.date.accessioned2020-07-24T11:44:13Z-
dc.date.available2020-07-24T11:44:13Z-
dc.date.created2020-05-04-
dc.date.issued2020-07-24-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/dsp015q47rr67q-
dc.description.abstractWe study the problem of non-stochastic online control in the bandit setting, where an agent iteratively observes the state of a dynamical system and selects control input signals with the objective of minimizing some cost over time. In particular, we consider linear dynamical systems with adversarial perturbations, where the only feedback available to the agent is the scalar cost at each time step, and the cost function itself is unknown. For this problem, with an either known or unknown system, we give an efficient sublinear regret algorithm. The main algorithmic difficulty is the dependence of the system on past choices of control signals, which means that one cannot directly apply regular convex optimization techniques to this setting. To overcome this issue, we propose an efficient algorithm for the general setting of bandit convex optimization for loss functions with memory, which may be of independent interest.en_US
dc.format.mimetypeapplication/pdf-
dc.language.isoenen_US
dc.titleORIGINALen_US
dc.titleORIGINALen_US
dc.titleORIGINALen_US
dc.titleNon-Stochastic Control with Bandit Feedbacken_US
dc.typePrinceton University Senior Theses-
pu.date.classyear2020en_US
pu.departmentMathematicsen_US
pu.pdf.coverpageSeniorThesisCoverPage-
pu.contributor.authorid920090365-
pu.certificateApplications of Computing Programen_US
pu.certificateCenter for Statistics and Machine Learning-
pu.certificateApplications of Computing Programen_US
pu.certificateApplications of Computing Programen_US
Appears in Collections:Mathematics, 1934-2020

Files in This Item:
File Description SizeFormat 
HALLMAN-JOHN-THESIS.pdf2.55 MBAdobe PDF    Request a copy


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.