pith. sign in

arxiv: 1606.07137 · v1 · pith:2ZR35L32new · submitted 2016-06-22 · 💻 cs.AI · cs.CL· cs.IR

Automated Extraction of Number of Subjects in Randomised Controlled Trials

classification 💻 cs.AI cs.CLcs.IR
keywords approachcontrollednumberrandomisedsmallsubjectstrialsabstracts
0
0 comments X
read the original abstract

We present a simple approach for automatically extracting the number of subjects involved in randomised controlled trials (RCT). Our approach first applies a set of rule-based techniques to extract candidate study sizes from the abstracts of the articles. Supervised classification is then performed over the candidates with support vector machines, using a small set of lexical, structural, and contextual features. With only a small annotated training set of 201 RCTs, we obtained an accuracy of 88\%. We believe that this system will aid complex medical text processing tasks such as summarisation and question answering.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.