ORIGINAL

Hallman, John

Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp015q47rr67q

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Hazan, Elad	-
dc.contributor.author	Hallman, John	-
dc.date.accessioned	2020-07-24T11:44:13Z	-
dc.date.available	2020-07-24T11:44:13Z	-
dc.date.created	2020-05-04	-
dc.date.issued	2020-07-24	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/dsp015q47rr67q	-
dc.description.abstract	We study the problem of non-stochastic online control in the bandit setting, where an agent iteratively observes the state of a dynamical system and selects control input signals with the objective of minimizing some cost over time. In particular, we consider linear dynamical systems with adversarial perturbations, where the only feedback available to the agent is the scalar cost at each time step, and the cost function itself is unknown. For this problem, with an either known or unknown system, we give an efficient sublinear regret algorithm. The main algorithmic difficulty is the dependence of the system on past choices of control signals, which means that one cannot directly apply regular convex optimization techniques to this setting. To overcome this issue, we propose an efficient algorithm for the general setting of bandit convex optimization for loss functions with memory, which may be of independent interest.	en_US
dc.format.mimetype	application/pdf	-
dc.language.iso	en	en_US
dc.title	ORIGINAL	en_US
dc.title	ORIGINAL	en_US
dc.title	ORIGINAL	en_US
dc.title	Non-Stochastic Control with Bandit Feedback	en_US
dc.type	Princeton University Senior Theses	-
pu.date.classyear	2020	en_US
pu.department	Mathematics	en_US
pu.pdf.coverpage	SeniorThesisCoverPage	-
pu.contributor.authorid	920090365	-
pu.certificate	Applications of Computing Program	en_US
pu.certificate	Center for Statistics and Machine Learning	-
pu.certificate	Applications of Computing Program	en_US
pu.certificate	Applications of Computing Program	en_US
Appears in Collections:	Mathematics, 1934-2020

Files in This Item:

File	Description	Size	Format
HALLMAN-JOHN-THESIS.pdf		2.55 MB	Adobe PDF	Request a copy

Show simple item record

Search

Browse