pith. sign in

arxiv: 1906.06465 · v2 · pith:JE3YB7HZnew · submitted 2019-06-13 · 💻 cs.CL · cs.LG· cs.SI· stat.ML

Correlating Twitter Language with Community-Level Health Outcomes

classification 💻 cs.CL cs.LGcs.SIstat.ML
keywords languageoutcomesallowscommunity-levelmedicalmodelpotentiallyadditional
0
0 comments X
read the original abstract

We study how language on social media is linked to diseases such as atherosclerotic heart disease (AHD), diabetes and various types of cancer. Our proposed model leverages state-of-the-art sentence embeddings, followed by a regression model and clustering, without the need of additional labelled data. It allows to predict community-level medical outcomes from language, and thereby potentially translate these to the individual level. The method is applicable to a wide range of target variables and allows us to discover known and potentially novel correlations of medical outcomes with life-style aspects and other socioeconomic risk factors.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.