pith. sign in

arxiv: 1811.06183 · v1 · pith:6MSTC3DMnew · submitted 2018-11-15 · 💻 cs.CL · cs.AI

Characterizing Design Patterns of EHR-Driven Phenotype Extraction Algorithms

classification 💻 cs.CL cs.AI
keywords designpatternsalgorithmsphenotypeautomaticclassificationdevelopmentattribution
0
0 comments X p. Extension
pith:6MSTC3DM Add to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{6MSTC3DM}

Prints a linked pith:6MSTC3DM badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

The automatic development of phenotype algorithms from Electronic Health Record data with machine learning (ML) techniques is of great interest given the current practice is very time-consuming and resource intensive. The extraction of design patterns from phenotype algorithms is essential to understand their rationale and standard, with great potential to automate the development process. In this pilot study, we perform network visualization on the design patterns and their associations with phenotypes and sites. We classify design patterns using the fragments from previously annotated phenotype algorithms as the ground truth. The classification performance is used as a proxy for coherence at the attribution level. The bag-of-words representation with knowledge-based features generated a good performance in the classification task (0.79 macro-f1 scores). Good classification accuracy with simple features demonstrated the attribution coherence and the feasibility of automatic identification of design patterns. Our results point to both the feasibility and challenges of automatic identification of phenotyping design patterns, which would power the automatic development of phenotype algorithms.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.