Rationalizing Neural Predictions

Regina Barzilay; Tao Lei; Tommi Jaakkola

arxiv: 1606.04155 · v2 · pith:KQYV37KOnew · submitted 2016-06-13 · 💻 cs.CL · cs.NE

Rationalizing Neural Predictions

Tao Lei , Regina Barzilay , Tommi Jaakkola This is my paper

classification 💻 cs.CL cs.NE

keywords rationalesapproachpredictionencodergeneratortextanalysisannotated

0 comments

read the original abstract

Prediction without justification has limited applicability. As a remedy, we learn to extract pieces of input text as justifications -- rationales -- that are tailored to be short and coherent, yet sufficient for making the same prediction. Our approach combines two modular components, generator and encoder, which are trained to operate well together. The generator specifies a distribution over text fragments as candidate rationales and these are passed through the encoder for prediction. Rationales are never given during training. Instead, the model is regularized by desiderata for rationales. We evaluate the approach on multi-aspect sentiment analysis against manually annotated test cases. Our approach outperforms attention-based baseline by a significant margin. We also successfully illustrate the method on the question retrieval task.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 5 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Whose Story Gets Told? Positionality and Bias in LLM Summaries of Life Narratives
cs.CL 2026-04 unverdicted novelty 6.0

A proposed pipeline shows LLMs introduce detectable race and gender biases when summarizing life narratives, creating potential for representational harm in research.
HuggingFace's Transformers: State-of-the-art Natural Language Processing
cs.CL 2019-10 accept novelty 6.0

Hugging Face releases an open-source Python library that supplies a unified API and pretrained weights for major Transformer architectures used in natural language processing.
Towards A Rigorous Science of Interpretable Machine Learning
stat.ML 2017-02 unverdicted novelty 6.0

The authors define interpretability for machine learning, specify when it is required, and propose a taxonomy for its rigorous evaluation while identifying open research questions.
Visual Interaction with Deep Learning Models through Collaborative Semantic Inference
cs.HC 2019-07 unverdicted novelty 5.0

Proposes the CSI framework for co-designing visual interactions and deep learning models to expose and allow semantic control over intermediate reasoning processes, shown in a summarization case study.
Learning Patient Engagement in Care Management: Performance vs. Interpretability
cs.LG 2019-06 unverdicted novelty 3.0

A behavioral engagement scoring method predicts patient response propensity in care management using real-world data and supplies interpretable insights via prototypical patients without performance loss.