editor =

Jay DeYoung, Sarthak Jain, Nazneen Fatema Rajani, Eric Lehman, Caiming Xiong, Richard Socher, Byron C Wallace · 2020 · DOI 10.18653/v1/2020.acl-main.408

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

open at publisher browse 7 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

What LLMs explain is not what they believe: Evaluating explanation sufficiency under models' own input beliefs

cs.LG · 2026-06-26 · unverdicted · novelty 6.0

Proposes SCSuff metric for evaluating LLM explanation sufficiency via model-generated alternative inputs, showing explanations are typically insufficient and predictable from hidden states.

Don't Go Breaking My LLM: The Impact of Pruning Attention Layers on Explanation Faithfulness and Confidence Calibration

cs.LG · 2026-06-23 · unverdicted · novelty 6.0

Pruning attention layers in five LLMs across eight datasets maintains accuracy but degrades faithfulness and calibration.

Beyond Topical Similarity: Contrastive Evidence Retrieval with Interpretable Attention Alignment in RAG

cs.CL · 2026-05-31 · unverdicted · novelty 6.0

CERA fine-tunes a dense retriever with triplet contrastive learning plus attention alignment to human rationales, claiming better retrieval effectiveness and faithfulness on clinical trial reports than Contriever and standard hard-negative baselines.

From Articles to Premises: Building PrimeFacts, an Extraction Methodology and Resource for Fact-Checking Evidence

cs.CL · 2026-05-07 · unverdicted · novelty 6.0

PrimeFacts extracts decontextualized premises from fact-check articles, raising evidence retrieval MRR by up to 30% and verdict prediction Macro-F1 by 10-20 points over baselines.

EPPC-OASIS: Ontology-Aware Adaptation and Structured Inference Refinement for Electronic Patient-Provider Communication Mining in Secure Messages

cs.AI · 2026-05-22 · unverdicted · novelty 5.0

EPPC-OASIS combines ontology-aware fine-tuning via Wasserstein alignment with structured inference refinement to extract EPPC codes from secure messages, reporting 77.13% Code+Sub-code F1 and 63.83% Triplet F1 with small gains over supervised fine-tuning baselines.

NEURON: A Neuro-symbolic System for Grounded Clinical Explainability

cs.AI · 2026-05-02 · unverdicted · novelty 5.0 · 2 refs

NEURON integrates SNOMED CT, ML, and RAG LLM to raise AUC from 0.74-0.77 to 0.84-0.88 and human-aligned explainability scores from 0.50 to 0.85 on MIMIC-IV acute heart failure data.

Large Language Models as Explainable Cyberattack Detectors for Energy Industrial Control Systems

cs.CR · 2026-04-28 · unverdicted · novelty 5.0

An off-the-shelf LLM prompted on tokenized Modbus traffic from public ICS datasets matches supervised baselines in normal-versus-critical classification accuracy while generating token-grounded audit records without any model updates.

citing papers explorer

Showing 7 of 7 citing papers after filters.

What LLMs explain is not what they believe: Evaluating explanation sufficiency under models' own input beliefs cs.LG · 2026-06-26 · unverdicted · none · ref 54
Proposes SCSuff metric for evaluating LLM explanation sufficiency via model-generated alternative inputs, showing explanations are typically insufficient and predictable from hidden states.
Don't Go Breaking My LLM: The Impact of Pruning Attention Layers on Explanation Faithfulness and Confidence Calibration cs.LG · 2026-06-23 · unverdicted · none · ref 11
Pruning attention layers in five LLMs across eight datasets maintains accuracy but degrades faithfulness and calibration.
Beyond Topical Similarity: Contrastive Evidence Retrieval with Interpretable Attention Alignment in RAG cs.CL · 2026-05-31 · unverdicted · none · ref 13
CERA fine-tunes a dense retriever with triplet contrastive learning plus attention alignment to human rationales, claiming better retrieval effectiveness and faithfulness on clinical trial reports than Contriever and standard hard-negative baselines.
From Articles to Premises: Building PrimeFacts, an Extraction Methodology and Resource for Fact-Checking Evidence cs.CL · 2026-05-07 · unverdicted · none · ref 13
PrimeFacts extracts decontextualized premises from fact-check articles, raising evidence retrieval MRR by up to 30% and verdict prediction Macro-F1 by 10-20 points over baselines.
EPPC-OASIS: Ontology-Aware Adaptation and Structured Inference Refinement for Electronic Patient-Provider Communication Mining in Secure Messages cs.AI · 2026-05-22 · unverdicted · none · ref 10
EPPC-OASIS combines ontology-aware fine-tuning via Wasserstein alignment with structured inference refinement to extract EPPC codes from secure messages, reporting 77.13% Code+Sub-code F1 and 63.83% Triplet F1 with small gains over supervised fine-tuning baselines.
NEURON: A Neuro-symbolic System for Grounded Clinical Explainability cs.AI · 2026-05-02 · unverdicted · none · ref 26 · 2 links
NEURON integrates SNOMED CT, ML, and RAG LLM to raise AUC from 0.74-0.77 to 0.84-0.88 and human-aligned explainability scores from 0.50 to 0.85 on MIMIC-IV acute heart failure data.
Large Language Models as Explainable Cyberattack Detectors for Energy Industrial Control Systems cs.CR · 2026-04-28 · unverdicted · none · ref 4
An off-the-shelf LLM prompted on tokenized Modbus traffic from public ICS datasets matches supervised baselines in normal-versus-critical classification accuracy while generating token-grounded audit records without any model updates.

editor =

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer