pith. sign in

arxiv: 1805.06975 · v1 · pith:525RNYDHnew · submitted 2018-05-17 · 💻 cs.CL

Tracking State Changes in Procedural Text: A Challenge Dataset and Models for Process Paragraph Comprehension

classification 💻 cs.CL
keywords modelsdatasettextproparachangesdataexistencelocation
0
0 comments X
read the original abstract

We present a new dataset and models for comprehending paragraphs about processes (e.g., photosynthesis), an important genre of text describing a dynamic world. The new dataset, ProPara, is the first to contain natural (rather than machine-generated) text about a changing world along with a full annotation of entity states (location and existence) during those changes (81k datapoints). The end-task, tracking the location and existence of entities through the text, is challenging because the causal effects of actions are often implicit and need to be inferred. We find that previous models that have worked well on synthetic data achieve only mediocre performance on ProPara, and introduce two new neural models that exploit alternative mechanisms for state prediction, in particular using LSTM input encoding and span prediction. The new models improve accuracy by up to 19%. The dataset and models are available to the community at http://data.allenai.org/propara.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Be Consistent! Improving Procedural Text Comprehension using Label Consistency

    cs.CL 2019-06 unverdicted novelty 5.0

    A label consistency training framework improves F1 on the ProPara benchmark for procedural text comprehension by using multiple independent descriptions of the same process.