hub

Captum: A unified and generic model interpretability library for PyTorch

Captum: A unified, generic model interpretability library for pytorch , author= · 2009 · arXiv 2009.07896

14 Pith papers cite this work. Polarity classification is still indexing.

14 Pith papers citing it

read on arXiv browse 14 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 2 method 1

citation-polarity summary

background 2 use method 1

representative citing papers

MetaBackdoor: Exploiting Positional Encoding as a Backdoor Attack Surface in LLMs

cs.CR · 2026-05-14 · unverdicted · novelty 7.0

MetaBackdoor shows that LLMs can be backdoored using positional triggers like sequence length, enabling stealthy activation on clean inputs to leak system prompts or trigger malicious behavior.

Modeling Subjective Urban Perception with Human Gaze

cs.CV · 2026-05-01 · unverdicted · novelty 7.0

Gaze data from eye-tracking carries predictive signals for subjective urban perception and improves accuracy when fused with image-based scene representations.

Feature Attribution Stability Suite: How Stable Are Post-Hoc Attributions?

cs.CV · 2026-04-02 · unverdicted · novelty 7.0

FASS benchmark shows post-hoc attributions remain unstable under geometric perturbations even after filtering for unchanged predictions, with Grad-CAM exhibiting the highest stability across ImageNet, COCO, and CIFAR-10.

MobileMold: A Smartphone-Based Microscopy Dataset for Food Mold Detection

cs.CV · 2026-03-02 · unverdicted · novelty 7.0

MobileMold provides 4941 smartphone microscopy images and shows deep learning models reach 99.5% accuracy on mold detection and food classification tasks.

AIMing for Standardised Explainability Evaluation in GNNs: A Framework and Case Study on Graph Kernel Networks

cs.LG · 2026-05-14 · unverdicted · novelty 6.0

AIM is a new evaluation framework for explainability in GNNs that combines accuracy, instance-level, and model-level measures, applied to graph kernel networks to create an improved model xGKN.

Instructions Shape Production of Language, not Processing

cs.CL · 2026-05-11 · unverdicted · novelty 6.0 · 2 refs

Instructions trigger a production-centered mechanism in language models, with task-specific information stable in input tokens but varying strongly in output tokens and correlating with behavior.

Enabling Performant and Flexible Model-Internal Observability for LLM Inference

cs.LG · 2026-05-11 · unverdicted · novelty 6.0

DMI-Lib delivers 0.4-6.8% overhead for offline batch LLM inference and ~6% for moderate online serving while exposing rich internal signals across backends, cutting latency overhead 2-15x versus prior observability baselines.

Scaling Vision Models Does Not Consistently Improve Localisation-Based Explanation Quality

cs.CV · 2026-05-11 · accept · novelty 6.0

Scaling vision models by depth and parameter count does not consistently improve localisation-based explanation quality across architectures, datasets, and post-hoc methods; smaller models often perform comparably or better.

Local Intrinsic Dimension Unveils Hallucinations in Diffusion Models

cs.CV · 2026-05-06 · unverdicted · novelty 6.0

Hallucinations in diffusion models are driven by local intrinsic dimension instabilities on the manifold, which Intrinsic Quenching corrects by deflating it.

X-SYS: A Reference Architecture for Interactive Explanation Systems

cs.AI · 2026-02-13 · unverdicted · novelty 6.0

X-SYS is a reference architecture for interactive explanation systems organized around STAR quality attributes and five service components, demonstrated via SemanticLens for vision-language models.

Delta-XAI: A Unified Framework for Explaining Prediction Changes in Online Time Series Monitoring

cs.LG · 2025-11-28 · unverdicted · novelty 6.0

Delta-XAI wraps existing XAI methods for online time series and introduces SWING to explain prediction changes while accounting for temporal dependencies.

ExECG: An Explainable AI Framework for ECG models

cs.LG · 2026-05-19 · unverdicted · novelty 5.0

ExECG is a Python framework providing Wrapper, Explainer, and Visualizer stages to unify XAI methods for ECG models and improve reproducibility.

Predicting the thermodynamics in the chromosphere from the translation of SDO data into the IRIS$^{2}$ inversion results using a visual transformer model

astro-ph.SR · 2026-04-23 · unverdicted · novelty 5.0

A visual transformer model trained on IRIS inversions predicts chromospheric temperature and density from SDO data with correlations around 0.8 on 80% of test cases.

Many-Shot CoT-ICL: Making In-Context Learning Truly Learn

cs.CL · 2026-05-13

citing papers explorer

Showing 14 of 14 citing papers.

MetaBackdoor: Exploiting Positional Encoding as a Backdoor Attack Surface in LLMs cs.CR · 2026-05-14 · unverdicted · none · ref 46
MetaBackdoor shows that LLMs can be backdoored using positional triggers like sequence length, enabling stealthy activation on clean inputs to leak system prompts or trigger malicious behavior.
Modeling Subjective Urban Perception with Human Gaze cs.CV · 2026-05-01 · unverdicted · none · ref 28
Gaze data from eye-tracking carries predictive signals for subjective urban perception and improves accuracy when fused with image-based scene representations.
Feature Attribution Stability Suite: How Stable Are Post-Hoc Attributions? cs.CV · 2026-04-02 · unverdicted · none · ref 16
FASS benchmark shows post-hoc attributions remain unstable under geometric perturbations even after filtering for unchanged predictions, with Grad-CAM exhibiting the highest stability across ImageNet, COCO, and CIFAR-10.
MobileMold: A Smartphone-Based Microscopy Dataset for Food Mold Detection cs.CV · 2026-03-02 · unverdicted · none · ref 20
MobileMold provides 4941 smartphone microscopy images and shows deep learning models reach 99.5% accuracy on mold detection and food classification tasks.
AIMing for Standardised Explainability Evaluation in GNNs: A Framework and Case Study on Graph Kernel Networks cs.LG · 2026-05-14 · unverdicted · none · ref 40
AIM is a new evaluation framework for explainability in GNNs that combines accuracy, instance-level, and model-level measures, applied to graph kernel networks to create an improved model xGKN.
Instructions Shape Production of Language, not Processing cs.CL · 2026-05-11 · unverdicted · none · ref 246 · 2 links
Instructions trigger a production-centered mechanism in language models, with task-specific information stable in input tokens but varying strongly in output tokens and correlating with behavior.
Enabling Performant and Flexible Model-Internal Observability for LLM Inference cs.LG · 2026-05-11 · unverdicted · none · ref 18
DMI-Lib delivers 0.4-6.8% overhead for offline batch LLM inference and ~6% for moderate online serving while exposing rich internal signals across backends, cutting latency overhead 2-15x versus prior observability baselines.
Scaling Vision Models Does Not Consistently Improve Localisation-Based Explanation Quality cs.CV · 2026-05-11 · accept · none · ref 45
Scaling vision models by depth and parameter count does not consistently improve localisation-based explanation quality across architectures, datasets, and post-hoc methods; smaller models often perform comparably or better.
Local Intrinsic Dimension Unveils Hallucinations in Diffusion Models cs.CV · 2026-05-06 · unverdicted · none · ref 37
Hallucinations in diffusion models are driven by local intrinsic dimension instabilities on the manifold, which Intrinsic Quenching corrects by deflating it.
X-SYS: A Reference Architecture for Interactive Explanation Systems cs.AI · 2026-02-13 · unverdicted · none · ref 54
X-SYS is a reference architecture for interactive explanation systems organized around STAR quality attributes and five service components, demonstrated via SemanticLens for vision-language models.
Delta-XAI: A Unified Framework for Explaining Prediction Changes in Online Time Series Monitoring cs.LG · 2025-11-28 · unverdicted · none · ref 5
Delta-XAI wraps existing XAI methods for online time series and introduces SWING to explain prediction changes while accounting for temporal dependencies.
ExECG: An Explainable AI Framework for ECG models cs.LG · 2026-05-19 · unverdicted · none · ref 6
ExECG is a Python framework providing Wrapper, Explainer, and Visualizer stages to unify XAI methods for ECG models and improve reproducibility.
Predicting the thermodynamics in the chromosphere from the translation of SDO data into the IRIS$^{2}$ inversion results using a visual transformer model astro-ph.SR · 2026-04-23 · unverdicted · none · ref 17
A visual transformer model trained on IRIS inversions predicts chromospheric temperature and density from SDO data with correlations around 0.8 on 80% of test cases.
Many-Shot CoT-ICL: Making In-Context Learning Truly Learn cs.CL · 2026-05-13 · unreviewed · ref 6

Captum: A unified and generic model interpretability library for PyTorch

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer