Label-Free Concept Bottleneck Models

Lam M. Nguyen; Subhro Das; Tsui-Wei Weng; Tuomas Oikarinen

arxiv: 2304.06129 · v2 · pith:X3EFSBVRnew · submitted 2023-04-12 · 💻 cs.LG · cs.CV

Label-Free Concept Bottleneck Models

Tuomas Oikarinen , Subhro Das , Lam M. Nguyen , Tsui-Wei Weng This is my paper

classification 💻 cs.LG cs.CV

keywords conceptlabel-freeneuralaccuracybottleneckcbmsconceptscreating

0 comments

read the original abstract

Concept bottleneck models (CBM) are a popular way of creating more interpretable neural networks by having hidden layer neurons correspond to human-understandable concepts. However, existing CBMs and their variants have two crucial limitations: first, they need to collect labeled data for each of the predefined concepts, which is time consuming and labor intensive; second, the accuracy of a CBM is often significantly lower than that of a standard neural network, especially on more complex datasets. This poor performance creates a barrier for adopting CBMs in practical real world applications. Motivated by these challenges, we propose Label-free CBM which is a novel framework to transform any neural network into an interpretable CBM without labeled concept data, while retaining a high accuracy. Our Label-free CBM has many advantages, it is: scalable - we present the first CBM scaled to ImageNet, efficient - creating a CBM takes only a few hours even for very large datasets, and automated - training it for a new dataset requires minimal human effort. Our code is available at https://github.com/Trustworthy-ML-Lab/Label-free-CBM. Finally, in Appendix B we conduct a large scale user evaluation of the interpretability of our method.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 14 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Bridging Vision and Language Concepts through Optimal Transport Semantic Flow
cs.CV 2026-06 unverdicted novelty 7.0

OTF-CBM replaces static cosine similarity in vision-language CBMs with data-driven optimal transport flow to improve concept alignment, accuracy, and faithfulness.
Concept Flow Models: Anchoring Concept-Based Reasoning with Hierarchical Bottlenecks
cs.LG 2026-06 unverdicted novelty 7.0

Concept Flow Models use hierarchical concept-driven decision trees to mitigate information leakage in concept bottleneck models while matching their predictive performance.
OceanCBM: A Concept Bottleneck Model for Mechanistic Interpretability in Ocean Forecasting
cs.LG 2026-05 unverdicted novelty 7.0

OceanCBM is the first concept bottleneck model for spatiotemporal ocean prediction that uses mixed supervision on physical concepts and a free concept to deliver consistent mechanistic representations for mixed layer ...
Concept-Based Abductive and Contrastive Explanations for Behaviors of Vision Models
cs.LG 2026-05 unverdicted novelty 7.0

Concept-based abductive and contrastive explanations find minimal high-level concepts that causally determine vision model outcomes on individual images or groups sharing a specified behavior.
Concept Inconsistency in Dermoscopic Concept Bottleneck Models: A Rough-Set Analysis of the Derm7pt Dataset
cs.LG 2026-04 conditional novelty 7.0

Rough-set analysis finds 16.4% of 305 concept profiles in Derm7pt inconsistent (306 images), capping hard CBM accuracy at 92.1%; symmetric filtering produces a 705-image consistent benchmark where EfficientNet-B5 reac...
Bridging Expert Knowledge and Automated Feature Engineering via Self-Evolution
cs.AI 2026-06 unverdicted novelty 6.0

FEST uses self-evolving trees to produce expert-aligned, auditable features from unstructured data and outperforms baselines on brand, authenticity, and stress tasks while releasing the BrandGuide dataset.
scCBGM: Interpretable Single-Cell Counterfactual Editing
cs.LG 2026-06 unverdicted novelty 6.0

scCBGM adapts concept bottleneck generative models with skip connections and cross-covariance penalties for single-cell data, enabling interpretable counterfactual editing and showing superior combinatorial generaliza...
Measuring What Matters: Synthetic Benchmarks for Concept Bottleneck Models
cs.LG 2026-06 unverdicted novelty 6.0

Introduces synthetic benchmarks for concept bottleneck models that control data modality, concept choice, annotation quality, and completeness to evaluate performance in decision support and automation.
Learning Label-Efficient Interpretable Medical Image Diagnosis via Semi-supervised Hypergraph Concept Bottleneck Model
cs.CV 2026-06 unverdicted novelty 6.0

A new semi-supervised hypergraph Concept Bottleneck Model framework improves label efficiency and interpretability for medical image diagnosis on PAS ultrasound, breast ultrasound, and SkinCon datasets.
Understanding Annotator Safety Policy with Interpretability
cs.AI 2026-05 unverdicted novelty 6.0

Annotator Policy Models learn safety policies from labeling behavior alone, accurately predicting responses and revealing sources of disagreement like policy ambiguity and value pluralism.
Boosting Ultrasound Image Classification via Attribute-Guided Dual-Branch Framework
cs.CV 2026-07 conditional novelty 5.0

An attribute-guided dual-branch framework fuses a standard classifier with an interpretable attribute-prior branch to boost ultrasound classification accuracy and explainability.
3D-CBM: A Framework for Concept-Based Interpretability in Generative 3D Modeling
cs.CV 2026-06 unverdicted novelty 5.0

Introduces 3D-CBM framework mapping raw 3D inputs to multi-tiered interpretable concepts, achieving 88.8% concept accuracy and test-time intervention on PartNet and ShapeNet.
A Composite Activation Function for Learning Stable Binary Representations
cs.LG 2026-05 unverdicted novelty 5.0

HTAF is a sigmoid-tanh composite that approximates the Heaviside function to allow stable gradient training of binary activation networks, yielding ICBMs with stable discretization and competitive performance on image tasks.
Formal Concept Lattices are Good Semantic Scaffolds for Concept-Based Learning
cs.CV 2026-06 unverdicted novelty 4.0

Formal concept lattices guide staged, hierarchical concept learning in deep networks to produce more interpretable and semantically structured representations.