A new framework combines AI-derived concept embeddings with high-dimensional selective inference to enable statistically principled, interpretable discovery from unstructured data in empirical economics.
Title resolution pending
3 Pith papers cite this work. Polarity classification is still indexing.
years
2025 3verdicts
UNVERDICTED 3representative citing papers
Introduces Modality Dominance Score (MDS) to measure modality-specific features in VLMs and applies training-free editing to improve bias mitigation, adversarial generation, and modality control.
TEXTER generates zero-shot textual explanations for image classifiers by isolating decision-critical features from contributing neurons, mapping them into CLIP space, and using sparse autoencoders for improved interpretability in Transformers.
citing papers explorer
-
Making Interpretable Discoveries from Unstructured Data: A High-Dimensional Multiple Hypothesis Testing Approach
A new framework combines AI-derived concept embeddings with high-dimensional selective inference to enable statistically principled, interpretable discovery from unstructured data in empirical economics.
-
Beyond Cross-Modal Alignment: Measuring and Leveraging Modality Gap in Vision-Language Models
Introduces Modality Dominance Score (MDS) to measure modality-specific features in VLMs and applies training-free editing to improve bias mitigation, adversarial generation, and modality control.
-
Zero-Shot Textual Explanations via Translating Decision-Critical Features
TEXTER generates zero-shot textual explanations for image classifiers by isolating decision-critical features from contributing neurons, mapping them into CLIP space, and using sparse autoencoders for improved interpretability in Transformers.