will you find these shortcuts?

Yu-Neng Chuang, Guanchu Wang, Chia-Yuan Chang, Ruixiang Tang, Shaochen Zhong, Fan Yang, Mengnan Du, Xuanting Cai, Xia Hu · 2024 · arXiv 2402.04678

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

representative citing papers

MIMIC: Multimodal Inversion for Model Interpretation and Conceptualization

cs.CV · 2025-08-11 · unverdicted · novelty 7.0

MIMIC is a new inversion framework that recovers visual concepts from VLM internal states using joint inversion, feature alignment, and three regularizers.

When AI Persuades: Adversarial Explanation Attacks on Human Trust in AI-Assisted Decision Making

cs.AI · 2026-02-03 · unverdicted · novelty 6.0

Adversarial explanation attacks preserve nearly all human trust in wrong AI outputs by using persuasive framing, shown in a study varying reasoning, evidence, style, and format with over 200 participants.

ToxiTrace: Gradient-Aligned Training for Explainable Chinese Toxicity Detection

cs.CL · 2026-04-14 · unverdicted · novelty 5.0

ToxiTrace combines CuSA for LLM-refined toxic spans, GCLoss for gradient-focused saliency, and ARCL for contrastive toxic/non-toxic boundaries to improve Chinese toxicity classification and explainable span extraction.

citing papers explorer

Showing 3 of 3 citing papers.

MIMIC: Multimodal Inversion for Model Interpretation and Conceptualization cs.CV · 2025-08-11 · unverdicted · none · ref 1
MIMIC is a new inversion framework that recovers visual concepts from VLM internal states using joint inversion, feature alignment, and three regularizers.
When AI Persuades: Adversarial Explanation Attacks on Human Trust in AI-Assisted Decision Making cs.AI · 2026-02-03 · unverdicted · none · ref 14
Adversarial explanation attacks preserve nearly all human trust in wrong AI outputs by using persuasive framing, shown in a study varying reasoning, evidence, style, and format with over 200 participants.
ToxiTrace: Gradient-Aligned Training for Explainable Chinese Toxicity Detection cs.CL · 2026-04-14 · unverdicted · none · ref 1
ToxiTrace combines CuSA for LLM-refined toxic spans, GCLoss for gradient-focused saliency, and ARCL for contrastive toxic/non-toxic boundaries to improve Chinese toxicity classification and explainable span extraction.

will you find these shortcuts?

fields

years

verdicts

representative citing papers

citing papers explorer