Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/99999/fk4kw6vx1p
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorFan, Jianqing
dc.contributor.authorXue, Lirong
dc.contributor.otherOperations Research and Financial Engineering Department
dc.date.accessioned2021-10-04T13:26:20Z-
dc.date.available2021-10-04T13:26:20Z-
dc.date.created2021-01-01
dc.date.issued2021
dc.identifier.urihttp://arks.princeton.edu/ark:/99999/fk4kw6vx1p-
dc.description.abstractThis dissertation focuses on understanding several finance and economics problems using big data. We first process and analyze financial textual data to investigate how sentiments can be learned directly from news data. We present a novel framework for textual analysis based on the factor model and sparsity regularization, called FarmPredict, to let machines learn financial returns from news data automatically without the help of prior knowledge like sentiment dictionaries. We validate our method using the articles in the Chinese stock market and analyze the magnitude, sources, and durations of effects caused by the positive or negative sentiments scored. Then we focus on high-frequency financial data and study the question of how and when returns are predictable at a high-frequency level. We quantify and document the predictability and its universality in very short horizons of a few seconds or transactions. We discover the inherent sources of this predictability and extrinsic cross-sectional and time-series factors and market environments affecting the predictability. We also examine how the predictability is affected by the timeliness of data. Finally, we investigate the problem of measuring housing activity in real estate economics. Timely measuring housing activeness at high-resolutions is critical for policy-making and urban design but hard, if not impossible. By using energy consumption data to infer housing activeness and modeling several disjoint datasets including nightlight satellite images and land-use data, we build an approach that can explain the majority of the spatial and temporal variation in monthly housing usages.
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.publisherPrinceton, NJ : Princeton University
dc.relation.isformatofThe Mudd Manuscript Library retains one bound copy of each dissertation. Search for these copies in the library's main catalog: <a href=http://catalog.princeton.edu>catalog.princeton.edu</a>
dc.subject.classificationStatistics
dc.subject.classificationFinance
dc.titleBig Data in Financial Economics
dc.typeAcademic dissertations (Ph.D.)
pu.date.classyear2021
pu.departmentOperations Research and Financial Engineering
Appears in Collections:Operations Research and Financial Engineering

Files in This Item:
File SizeFormat 
Xue_princeton_0181D_13805.pdf4.97 MBAdobe PDFView/Download


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.