Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01g445cd223
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorBlei, David Men_US
dc.contributor.authorGerrish, Sean Michaelen_US
dc.contributor.otherComputer Science Departmenten_US
dc.date.accessioned2013-05-08T13:42:51Z-
dc.date.available2013-05-08T13:42:51Z-
dc.date.issued2013en_US
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/dsp01g445cd223-
dc.description.abstractThe past twenty years have seen an avalanche of digital information which is overwhelming people in industry, government, and academics. This avalanche is two-sided: while the past decade has seen an onslaught of digitized records -- as governments, publishers, and researchers race to make their records digital, the electronic and software tools for computationally analyzing this data have quickly evolved to face this challenge. Many of these challenges evolve around recurring patterns, including the presence of text, bits of information about pairs of items, and sequential observations. In this work we present several methods to address these challenges in data analysis which take advantage of these recurring patterns. We begin with a method for identifying influential documents in a collection which evolves over time. We demonstrate that by encoding our assumptions about influential documents in a statistical model of the changes in textual themes, we are able to provide an alternative bibliometric which provides results consistent with---yet different from---traditional metrics of influence such as citation counts. We then introduce a model for measuring the relationships between pairs of countries over time. We will demonstrate that this model is able to learn meaningful relationships between countries which is extraordinarily consistent across different human labels. We next address limitations in existing models of legislative voting. In one extention we predict legislators' votes by using the text of the bills they are voting on combined with individual legislators' past voting behavior. We then introduce a method for inferring these lawmakers' positions on specific issues. A recurring theme in the methods we present is that by using a small set of statistical primitives, we are able to apply known (or mildly adapted) methods to new problems. Several advances in the past few decades in statistical modeling will make the development and discussion of our models easier, as they will provide both this set of primitives (which can be interchanged easily) and the tools for working with them. As a final contribution, we describe a new method for fitting a statistical model with variational inference, without the time investment typically required of practitioners.en_US
dc.language.isoenen_US
dc.publisherPrinceton, NJ : Princeton Universityen_US
dc.relation.isformatofThe Mudd Manuscript Library retains one bound copy of each dissertation. Search for these copies in the <a href=http://catalog.princeton.edu> library's main catalog </a>en_US
dc.subjectinfluenceen_US
dc.subjectprobabilityen_US
dc.subjectsocial sciencesen_US
dc.subjectstatisticsen_US
dc.subjecttexten_US
dc.subjectvotingen_US
dc.subject.classificationComputer scienceen_US
dc.subject.classificationStatisticsen_US
dc.subject.classificationSocial researchen_US
dc.titleApplications of Latent Variable Models for Modeling Influence and Decision Makingen_US
dc.typeAcademic dissertations (Ph.D.)en_US
pu.projectgrantnumber690-2143en_US
Appears in Collections:Computer Science

Files in This Item:
File Description SizeFormat 
Gerrish_princeton_0181D_10499.pdf3.43 MBAdobe PDFView/Download


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.