Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01cc08hj45c
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorWeinberg, Matthew-
dc.contributor.authorBalasubramanian, Rachana-
dc.date.accessioned2019-07-24T17:55:06Z-
dc.date.available2019-07-24T17:55:06Z-
dc.date.created2019-05-04-
dc.date.issued2019-07-24-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/dsp01cc08hj45c-
dc.description.abstractIn an increasingly online world, it has become progressively difficult to manually parse all the reviews that we interact with on a daily basis. From products to even scholarly work, the amount of data available has accumulated exponentially. Businesses like Amazon and Yelp have attempted to provide insight by adding voting and reporting systems that reward reviews that actually help people, such as through a "helpful" tick box or by voting up or down on reviews. However, this still requires manual effort. Past research has looked into judging review sentiment in order to more succinctly inform customers, but little work has been done in judging the "helpfulness" of a review. My work looks to improve the process of judging how helpful a review is using automated text analysis and classification. I compare a bag of words representation and word vector representation, and train these representations on a variety of machine learning models to attempt to classify a review as helpful or not helpful. The goal of this work is to gather insight into what makes a given review useful to the reader, and in testing a multitude of models and representations, find the most efficient and accurate method of classification. Using these automated techniques will allow for faster and more efficient parsing of the multitudes of review data that the average person is exposed to on a frequent basis.en_US
dc.format.mimetypeapplication/pdf-
dc.language.isoenen_US
dc.titleUsing Text Classification Models to Determine Review Helpfulnessen_US
dc.typePrinceton University Senior Theses-
pu.date.classyear2019en_US
pu.departmentComputer Scienceen_US
pu.pdf.coverpageSeniorThesisCoverPage-
pu.contributor.authorid961166869-
pu.certificateCenter for Statistics and Machine Learningen_US
Appears in Collections:Computer Science, 1988-2020

Files in This Item:
File Description SizeFormat 
BALASUBRAMANIAN-RACHANA-THESIS.pdf752.21 kBAdobe PDF    Request a copy


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.