Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01s4655k03t
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorFellbaum, Christiane-
dc.contributor.authorTang, Eugene-
dc.date.accessioned2016-06-30T15:11:41Z-
dc.date.available2016-06-30T15:11:41Z-
dc.date.created2016-04-29-
dc.date.issued2016-06-30-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/dsp01s4655k03t-
dc.description.abstractAffecting 350 million people worldwide, depression is a source of serious costs to both personal and public well-being. However, most individuals with depression do not receive proper treatment. One possible way to address this issue is through automatic screening of individuals for depression. In this study we explore the possibility of automatically detecting individuals with depression through their behavior on Twitter. We build off of previous studies in the following ways. First, we identify a group of depressed and non-depressed users through Twitter’s API. Next, we explore the effect of additional features—topic-based features, removing retweets, and standardization of tweets—on the predictive ability of the logistic regression and SVM models to differentiate between depressed and non-depressed users. Using data collected at two different time points, we test the robustness of our model over time and over different incidence rates. Finally, we explore the possible implementations of a screening tool based on our work. Our findings demonstrate, first, the feasibility of discriminating depressed from non-depressed users through Twitter’s API, and second, the relative robustness of our model over time, although our additional features do not make a large difference in predictive power. However, our classifier’s performance significantly decreases when the incidence rate of the testing dataset is decreased to a more realistic level of 7.6%. This finding indicates that the incidence rate of depression in the training and testing datasets is an important additional factor to consider in future studies. Overall, although much work remains to be done before such a tool could be implemented, our work provides additional evidence that it may indeed be feasible to identify individuals struggling with depression through their behavior on Twitter.en_US
dc.format.extent95 pages*
dc.language.isoen_USen_US
dc.titleIdentifying Signs of Depression on Twitteren_US
dc.typePrinceton University Senior Theses-
pu.date.classyear2016en_US
pu.departmentComputer Scienceen_US
pu.pdf.coverpageSeniorThesisCoverPage-
Appears in Collections:Computer Science, 1988-2020

Files in This Item:
File SizeFormat 
Tan_Eugene_2016_Thesis.pdf1.67 MBAdobe PDF    Request a copy


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.