pith. sign in

arxiv: 1602.05568 · v1 · pith:CH4WNX33new · submitted 2016-02-17 · 💻 cs.LG

Multi-layer Representation Learning for Medical Concepts

classification 💻 cs.LG
keywords codesconceptsmedicalrepresentationsvisitsapplicationsvisitdiagnosis
0
0 comments X p. Extension
pith:CH4WNX33 Add to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{CH4WNX33}

Prints a linked pith:CH4WNX33 badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

Learning efficient representations for concepts has been proven to be an important basis for many applications such as machine translation or document classification. Proper representations of medical concepts such as diagnosis, medication, procedure codes and visits will have broad applications in healthcare analytics. However, in Electronic Health Records (EHR) the visit sequences of patients include multiple concepts (diagnosis, procedure, and medication codes) per visit. This structure provides two types of relational information, namely sequential order of visits and co-occurrence of the codes within each visit. In this work, we propose Med2Vec, which not only learns distributed representations for both medical codes and visits from a large EHR dataset with over 3 million visits, but also allows us to interpret the learned representations confirmed positively by clinical experts. In the experiments, Med2Vec displays significant improvement in key medical applications compared to popular baselines such as Skip-gram, GloVe and stacked autoencoder, while providing clinically meaningful interpretation.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.