The disagreement problem in explainable machine learning: A practi- tioner’s perspective

· 2022 · arXiv 2202.01602

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

read on arXiv browse 6 citing papers

citation-role summary

background 3

citation-polarity summary

background 2 support 1

representative citing papers

The Attribution Impossibility: No Feature Ranking Is Faithful, Stable, and Complete Under Collinearity

cs.LG · 2026-04-08 · accept · novelty 8.0

Proves an impossibility theorem that no feature attribution ranking can be faithful, stable, and complete under collinearity, characterizes the design space as two families, introduces the DASH ensemble method, and formally verifies all claims in Lean 4.

Quantifying Explanation Consistency: The C-Score Metric for CAM-Based Explainability in Medical Image Classification

cs.CV · 2026-04-09 · unverdicted · novelty 7.0

The C-Score quantifies intra-class explanation consistency for CAM methods via confidence-weighted pairwise soft IoU and detects AUC-consistency dissociation as an early warning for model instability on chest X-ray classification.

Explanation Bias is a Product: Revealing the Hidden Lexical and Position Preferences in Post-Hoc Feature Attribution

cs.CL · 2025-12-11 · unverdicted · novelty 6.0

Explanation biases in feature attribution methods are systematic products of lexical and positional preferences, with observed trade-offs across models and higher bias in anomalous explanations.

Interpretable and Explainable Surrogate Modeling for Simulations: A State-of-the-Art Survey and Perspectives on Explainable AI for Decision-Making

cs.AI · 2026-04-15 · unverdicted · novelty 5.0

This survey synthesizes XAI methods with surrogate modeling workflows for simulations and outlines a research agenda to embed explainability into simulation-driven design and decision-making.

ToxiTrace: Gradient-Aligned Training for Explainable Chinese Toxicity Detection

cs.CL · 2026-04-14 · unverdicted · novelty 5.0

ToxiTrace combines CuSA for LLM-refined toxic spans, GCLoss for gradient-focused saliency, and ARCL for contrastive toxic/non-toxic boundaries to improve Chinese toxicity classification and explainable span extraction.

Persistent and Conversational Multi-Method Explainability for Trustworthy Financial AI

cs.AI · 2026-05-12 · unverdicted · novelty 4.0

An architecture stores XAI explanations persistently in searchable storage and uses RAG to synthesize multiple methods conversationally, cutting hallucination rates by 36% in a FinBERT financial sentiment demo.

citing papers explorer

Showing 6 of 6 citing papers.

The Attribution Impossibility: No Feature Ranking Is Faithful, Stable, and Complete Under Collinearity cs.LG · 2026-04-08 · accept · full · ref 67
Proves an impossibility theorem that no feature attribution ranking can be faithful, stable, and complete under collinearity, characterizes the design space as two families, introduces the DASH ensemble method, and formally verifies all claims in Lean 4.
Quantifying Explanation Consistency: The C-Score Metric for CAM-Based Explainability in Medical Image Classification cs.CV · 2026-04-09 · unverdicted · none · ref 20
The C-Score quantifies intra-class explanation consistency for CAM methods via confidence-weighted pairwise soft IoU and detects AUC-consistency dissociation as an early warning for model instability on chest X-ray classification.
Explanation Bias is a Product: Revealing the Hidden Lexical and Position Preferences in Post-Hoc Feature Attribution cs.CL · 2025-12-11 · unverdicted · none · ref 3
Explanation biases in feature attribution methods are systematic products of lexical and positional preferences, with observed trade-offs across models and higher bias in anomalous explanations.
Interpretable and Explainable Surrogate Modeling for Simulations: A State-of-the-Art Survey and Perspectives on Explainable AI for Decision-Making cs.AI · 2026-04-15 · unverdicted · none · ref 223
This survey synthesizes XAI methods with surrogate modeling workflows for simulations and outlines a research agenda to embed explainability into simulation-driven design and decision-making.
ToxiTrace: Gradient-Aligned Training for Explainable Chinese Toxicity Detection cs.CL · 2026-04-14 · unverdicted · none · ref 2
ToxiTrace combines CuSA for LLM-refined toxic spans, GCLoss for gradient-focused saliency, and ARCL for contrastive toxic/non-toxic boundaries to improve Chinese toxicity classification and explainable span extraction.
Persistent and Conversational Multi-Method Explainability for Trustworthy Financial AI cs.AI · 2026-05-12 · unverdicted · none · ref 11
An architecture stores XAI explanations persistently in searchable storage and uses RAG to synthesize multiple methods conversationally, cutting hallucination rates by 36% in a FinBERT financial sentiment demo.

The disagreement problem in explainable machine learning: A practi- tioner’s perspective

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer