Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01sb397b99n
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorKernighan, Brian-
dc.contributor.authorJi, Jessica-
dc.date.accessioned2018-08-14T15:13:13Z-
dc.date.available2018-08-14T15:13:13Z-
dc.date.created2018-05-05-
dc.date.issued2018-08-14-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/dsp01sb397b99n-
dc.description.abstractThe emerging field of digital humanities attempts to address how computational tools and techniques can best be employed in the study of the humanities. The "humanities," as they are commonly understood, encompass a broad range of fields including language studies, literature, history, the classics, and the arts, although some of these fields may also overlap with the social sciences. Although few agree on what precisely the exact nature and scope of the digital humanities are, one fundamental challenge of the field is the complexity of data collection and analysis. This thesis centers on two guiding questions: why is digital humanities data analysis so challenging, and how can effective tools be designed to combat these challenges? To explore this second question more thoroughly this thesis introduces TableReader, a web application designed to simplify the extraction of tabular data from PDF documents, in the interest of developing a set of guiding principles surrounding digital humanities tool design and implementation.en_US
dc.format.mimetypeapplication/pdf-
dc.language.isoenen_US
dc.titleTableReader: A Digital Humanities PDF Extraction Toolen_US
dc.typePrinceton University Senior Theses-
pu.date.classyear2018en_US
pu.departmentComputer Scienceen_US
pu.pdf.coverpageSeniorThesisCoverPage-
pu.contributor.authorid960961878-
Appears in Collections:Computer Science, 1988-2020

Files in This Item:
File Description SizeFormat 
JI-JESSICA-THESIS.pdf1.94 MBAdobe PDF    Request a copy


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.