pith. sign in

arxiv: 1404.3026 · v1 · pith:GIALFETBnew · submitted 2014-04-11 · 💻 cs.SI · cs.CL· cs.LG

On the Ground Validation of Online Diagnosis with Twitter and Medical Records

classification 💻 cs.SI cs.CLcs.LG
keywords diseaseindividualtwitteranalysisdatadetectiondevelopdiagnosis
0
0 comments X
read the original abstract

Social media has been considered as a data source for tracking disease. However, most analyses are based on models that prioritize strong correlation with population-level disease rates over determining whether or not specific individual users are actually sick. Taking a different approach, we develop a novel system for social-media based disease detection at the individual level using a sample of professionally diagnosed individuals. Specifically, we develop a system for making an accurate influenza diagnosis based on an individual's publicly available Twitter data. We find that about half (17/35 = 48.57%) of the users in our sample that were sick explicitly discuss their disease on Twitter. By developing a meta classifier that combines text analysis, anomaly detection, and social network analysis, we are able to diagnose an individual with greater than 99% accuracy even if she does not discuss her health.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.