Caption Bottleneck Models

Emre Akbas; Merve Tapli; Seref Baris Cagliyan; Umut Ozdemir

arxiv: 2607.00578 · v1 · pith:T3OMLUOYnew · submitted 2026-07-01 · 💻 cs.CV

Caption Bottleneck Models

Seref Baris Cagliyan , Umut Ozdemir , Merve Tapli , Emre Akbas This is my paper

Pith reviewed 2026-07-02 14:55 UTC · model grok-4.3

classification 💻 cs.CV

keywords Caption Bottleneck ModelsConcept Bottleneck ModelsInterpretabilityLarge Multimodal ModelsImage ClassificationLeakage PreventionConcept Discovery

0 comments

The pith

Caption Bottleneck Models train classifiers only on LMM captions to ensure leakage-free interpretability.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces Caption Bottleneck Models as an alternative to traditional Concept Bottleneck Models. Instead of relying on predefined lists of concepts, CaBM generates free-form natural language captions for each image using large multimodal models. A text-based classifier is then trained exclusively on these captions. This design prevents information leakage from visual features that bypass the bottleneck. The approach also enables the automatic discovery of relevant concepts by examining the trained text classifier.

Core claim

By representing images via LMM-generated captions and training a classifier strictly on this text, CaBM ensures a leakage-free architecture by construction. Additionally, by analyzing the text classifier post-training, CaBM autonomously discovers high-quality, dataset-specific concepts.

What carries the argument

The caption bottleneck, which converts images to LMM-generated natural language descriptions before any classification occurs.

If this is right

CaBM achieves competitive accuracy on fine-grained and coarse-grained image classification benchmarks.
The architecture is leakage-free because predictions depend only on the caption text.
Dataset-specific concepts are discovered without expert annotations or static dictionaries.
Interpretability is maintained through the use of human-readable natural language.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If LMM captions systematically omit certain visual cues, accuracy on tasks relying on those cues would suffer.
This method could extend to other vision tasks if suitable text-based models are available.
The discovered concepts might expose biases present in the underlying LMM caption generator.

Load-bearing premise

LMM-generated captions contain all the visual information necessary for accurate classification without missing details or introducing errors.

What would settle it

An experiment where CaBM accuracy drops significantly below a direct vision model on a dataset known to require visual features not easily captured in text captions.

Figures

Figures reproduced from arXiv: 2607.00578 by Emre Akbas, Merve Tapli, Seref Baris Cagliyan, Umut Ozdemir.

**Figure 1.** Figure 1: CaBM performs recognition through a caption bottleneck: a frozen LMM produces captions, a text classifier predicts labels from these captions, and openvocabulary concepts are extracted via post-hoc analysis. Concept Bottleneck Models (CBMs) provide a principled approach for interpretable by design predictors: they first map an input to a set of humaninterpretable concepts, and then predict the label fro… view at source ↗

**Figure 2.** Figure 2: Overview of CaBM. (1) A frozen LMM generates K diverse captions per image using a structured prompt P and multiple decoding temperatures; captions are deterministically censored by τ to remove class names and taxonomy terms. (2) A text encoder predicts per-caption logits, which are averaged for image-level inference. (3) Concepts are obtained post-hoc by proposing salient spans (gradient×embedding), scorin… view at source ↗

**Figure 3.** Figure 3: Qualitative concept comparison across methods on CUB200. [PITH_FULL_IMAGE:figures/full_fig_p011_3.png] view at source ↗

**Figure 4.** Figure 4: Concept importance across images. For four datasets, we select one random class and sample nine random test images. For each class, we show the top-5 concepts and their importance scores (higher indicates stronger influence under the ranking criterion in Sec. 3.3) [PITH_FULL_IMAGE:figures/full_fig_p012_4.png] view at source ↗

**Figure 5.** Figure 5: Human intervention example [PITH_FULL_IMAGE:figures/full_fig_p013_5.png] view at source ↗

read the original abstract

Concept Bottleneck Models (CBMs) provide interpretability by routing predictions through a layer of human-understandable concepts. However, defining an optimal concept set for a specific dataset remains an open challenge. Existing approaches rely on expensive expert annotations or LLM-generated lists based solely on class names. Even "open-vocabulary" variants typically depend on static concept sets, which restrict discovery and introduce label bias. Furthermore, traditional CBMs often suffer from information leakage, where unmodeled visual features bypass the bottleneck and compromise the integrity of the explanations. To overcome these limitations, we propose Caption Bottleneck Models (CaBM), a framework that circumvents the need for predefined concept sets by replacing rigid concept layers with free-form natural language. By representing images via LMM-generated captions and training a classifier strictly on this text, CaBM ensures a leakage-free architecture by construction. Additionally, by analyzing the text classifier post-training, CaBM autonomously discovers high-quality, dataset-specific concepts. Our results across fine- and coarse-grained benchmarks demonstrate that CaBM achieves competitive accuracy while preserving interpretability without the constraints of external dictionaries or manual labeling.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

CaBM replaces fixed concept sets with LMM captions so the classifier sees only text, which blocks leakage by construction while letting the model surface its own concepts from the trained text layer.

read the letter

The main move is straightforward: generate free-form captions from an LMM for each image, then train the downstream classifier exclusively on that text. No image features reach the final layer, so the usual leakage path in CBMs is closed at the architecture level.

This is new relative to the cited CBM work. Prior methods still needed a static list of concepts, whether expert-defined or LLM-generated from class names. Here the concept layer is replaced by natural language, and the paper claims the model can later extract useful dataset-specific concepts by inspecting the trained text classifier. If the numbers back it up, that removes the annotation burden and the risk of missing relevant concepts.

The results are described as competitive on both fine- and coarse-grained benchmarks, which is the part worth checking. The cleanest part of the argument is the leakage guarantee; it follows directly from the input restriction rather than from any fitting trick.

The soft spot is the reliance on caption quality. If the LMM systematically drops task-critical details or introduces hallucinations, accuracy will suffer even if leakage is prevented. The abstract does not show ablations on caption model choice or error analysis on what information is lost, so it is hard to judge how often this happens in practice. The post-training concept discovery step also needs concrete examples to show it is reproducible and not just post-hoc interpretation.

This is for groups already working on concept-based or interpretable vision models who want to avoid hand-crafted dictionaries. It is worth sending to peer review because the architectural fix is simple and the leakage claim is tight, even though the empirical validation will need close scrutiny on the numbers and failure cases.

Referee Report

1 major / 1 minor

Summary. The paper proposes Caption Bottleneck Models (CaBM) to address limitations in Concept Bottleneck Models (CBMs), including the difficulty of defining optimal concept sets and information leakage. CaBM replaces rigid concept layers with free-form natural language captions generated by Large Multimodal Models (LMMs); a downstream classifier is trained exclusively on this text, which the authors state ensures a leakage-free architecture by construction. The method further claims to enable autonomous discovery of high-quality, dataset-specific concepts via post-training analysis of the text classifier. Results on fine- and coarse-grained benchmarks are stated to show competitive accuracy while preserving interpretability without external dictionaries or manual labeling.

Significance. If the empirical claims hold, the work would be significant for interpretable vision models: the architectural choice directly guarantees leakage prevention (a known weakness of standard CBMs) without circular fitting or additional parameters, and the post-hoc concept discovery reduces dependence on expert annotations or static lists. This construction-based guarantee and the potential for data-driven concept extraction are clear strengths relative to prior CBM variants.

major comments (1)

[Abstract] Abstract: the central empirical claim that CaBM 'achieves competitive accuracy' on fine- and coarse-grained benchmarks is stated without any quantitative numbers, tables, ablation results, or error analysis. Because accuracy is required to establish that caption completeness does not systematically degrade performance, this omission is load-bearing for the overall contribution.

minor comments (1)

The description of how the trained text classifier is analyzed to 'autonomously discover' concepts lacks concrete details on the extraction procedure (e.g., attention weights, feature importance, or clustering method).

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback and positive recommendation for minor revision. We address the major comment below.

read point-by-point responses

Referee: [Abstract] Abstract: the central empirical claim that CaBM 'achieves competitive accuracy' on fine- and coarse-grained benchmarks is stated without any quantitative numbers, tables, ablation results, or error analysis. Because accuracy is required to establish that caption completeness does not systematically degrade performance, this omission is load-bearing for the overall contribution.

Authors: We agree that the abstract would be strengthened by including specific quantitative results. While the full manuscript contains detailed accuracy tables, ablations, and comparisons in the experiments section, the abstract currently states the claim at a high level. In the revised version we will update the abstract to report the key accuracy numbers (and brief baseline comparisons) from the fine- and coarse-grained benchmarks to directly support the claim that caption completeness does not systematically degrade performance. revision: yes

Circularity Check

0 steps flagged

No significant circularity in derivation chain

full rationale

The paper's central claim is that CaBM ensures leakage-free architecture by construction through an explicit architectural choice: mapping images to LMM captions and training a text-only classifier with no visual feature access. This is a definitional property of the proposed setup rather than a derivation that reduces to fitted parameters, self-citations, or ansatzes. No equations, uniqueness theorems, or load-bearing steps are shown that would make any prediction equivalent to its inputs by construction. The approach is self-contained as a modeling framework without circular reductions.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Abstract-only; no explicit free parameters, axioms, or invented entities are stated. The approach implicitly assumes caption sufficiency and LMM reliability without providing independent evidence for either.

axioms (1)

domain assumption LMM-generated captions contain all information necessary for the downstream classification task.
Invoked by the claim that training strictly on text yields competitive accuracy without leakage.

pith-pipeline@v0.9.1-grok · 5731 in / 1116 out tokens · 20022 ms · 2026-07-02T14:55:44.325661+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

33 extracted references · 6 canonical work pages · 4 internal anchors

[1]

In: International Conference on Learning Representations (2018)

Ancona, M., Ceolini, E., Öztireli, C., Gross, M.: Towards better understanding of gradient-based attribution methods for deep neural networks. In: International Conference on Learning Representations (2018)

2018
[2]

Qwen3-VL Technical Report

Bai, S., Cai, Y., Chen, R., Chen, K., Chen, X., Cheng, Z., Deng, L., Ding, W., Gao, C., Ge, C., et al.: Qwen3-vl technical report. arXiv preprint arXiv:2511.21631 (2025)

work page internal anchor Pith review Pith/arXiv arXiv 2025
[3]

In: European Conference on Computer Vision

Bossard, L., Guillaumin, M., Van Gool, L.: Food-101 – mining discriminative com- ponents with random forests. In: European Conference on Computer Vision. pp. 446–461 (2014)

2014
[4]

In: AAAI Conference on Artificial Intelligence

Chauhan, K., Tiwari, R., Freyberg, J., Shenoy, P., Dvijotham, K.: Interactive con- cept bottleneck models. In: AAAI Conference on Artificial Intelligence. pp. 5948– 5955 (2023)

2023
[5]

In: IEEE Conference on Computer Vision and Pattern Recognition

Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: A large- scale hierarchical image database. In: IEEE Conference on Computer Vision and Pattern Recognition. pp. 248–255 (2009)

2009
[6]

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)

work page internal anchor Pith review Pith/arXiv arXiv 2018
[7]

Havasi, M., Parbhoo, S., Doshi-Velez, F.: Addressing leakage in concept bottleneck models.In:AdvancesinNeuralInformationProcessingSystems.vol.35,pp.23386– 23397 (2022)

2022
[8]

In: International Conference on Machine Learning

Koh, P.W., Nguyen, T., Tang, Y.S., Mussmann, S., Pierson, E., Kim, B., Liang, P.: Concept bottleneck models. In: International Conference on Machine Learning. pp. 5338–5348 (2020)

2020
[9]

University of Toronto (2009)

Krizhevsky, A., Hinton, G.: Learning multiple layers of features from tiny images. University of Toronto (2009)

2009
[10]

Understanding Neural Networks through Representation Erasure

Li, J., Monroe, W., Jurafsky, D.: Understanding neural networks through repre- sentation erasure. arXiv preprint arXiv:1612.08220 (2016)

work page internal anchor Pith review Pith/arXiv arXiv 2016
[11]

In: IEEE Conference on Computer Vision and Pattern Recognition

Liu, Y., Zhang, T., Gu, S.: Hybrid concept bottleneck models. In: IEEE Conference on Computer Vision and Pattern Recognition. pp. 20179–20189 (2025)

2025
[12]

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D., Levy, O., Lewis, M., Zettlemoyer, L., Stoyanov, V.: RoBERTa: A robustly optimized BERT pretraining approach. arXiv preprint arXiv:1907.11692 (2019)

work page internal anchor Pith review Pith/arXiv arXiv 1907
[13]

Promises and pitfalls of black-box concept learning models.arXiv preprint arXiv:2106.13314,

Mahinpei, A., Clark, J., Lage, I., Doshi-Velez, F., Pan, W.: Promises and pitfalls of black-box concept learning models. arXiv preprint arXiv:2106.13314 (2021) 16 B. Cagliyan et al

work page arXiv 2021
[14]

Margeloiu, A., Ashman, M., Bhatt, U., Chen, Y., Jamnik, M., Weller, A.: Do con- cept bottleneck models learn as intended? arXiv preprint arXiv:2105.04289 (2021)

work page arXiv 2021
[15]

McInnes, L., Healy, J., Astels, S., et al.: hdbscan: Hierarchical density based clus- tering. J. Open Source Softw.2(11), 205 (2017)

2017
[16]

In: Indian Conference on Computer Vision, Graphics & Image Processing

Nilsback, M., Zisserman, A.: Automated flower classification over a large number of classes. In: Indian Conference on Computer Vision, Graphics & Image Processing. pp. 722–729 (2008)

2008
[17]

In: International Conference on Learning Representations (2023)

Oikarinen, T., Das, S., Nguyen, L.M., Weng, T.W.: Label-free concept bottleneck models. In: International Conference on Learning Representations (2023)

2023
[18]

In: IEEE Conference on Computer Vision and Pattern Recognition

Ozdemir, O., Christensen, A., Alaniz, S., Akata, Z., Akbas, E.: Explaining CLIP zero-shot predictions through concepts. In: IEEE Conference on Computer Vision and Pattern Recognition. pp. 31336–31345 (2026)

2026
[19]

In: Advances in Neural Information Processing Systems (2025)

Park, S., Mun, J., Oh, D., Lee, N.: An analysis of concept bottleneck models: Measuring, understanding, and mitigating the impact of noisy annotations. In: Advances in Neural Information Processing Systems (2025)

2025
[20]

In: International Conference on Machine Learning (2025)

Prasse, K., Knab, P., Marton, S., Bartelt, C., Keuper, M.: Dcbm: Data-efficient vi- sual concept bottleneck models. In: International Conference on Machine Learning (2025)

2025
[21]

In: International Conference on Machine Learning

Radford, A., Kim, J.W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., Clark, J., et al.: Learning transferable visual models from natural language supervision. In: International Conference on Machine Learning. pp. 8748–8763 (2021)

2021
[22]

In: European Conference on Computer Vision

Rao, S., Mahajan, S., Böhle, M., Schiele, B.: Discover-then-name: Task-agnostic concept bottlenecks via automated concept discovery. In: European Conference on Computer Vision. pp. 444–461 (2024)

2024
[23]

In: Advances in Neural Information Processing Systems

Srivastava, D., Yan, G., Weng, T.W.: Vlg-cbm: Training concept bottleneck mod- els with vision-language guidance. In: Advances in Neural Information Processing Systems. vol. 37, pp. 79057–79094 (2024)

2024
[24]

In: Advances in Neural Information Processing Systems (2025)

Steinmann, D., Stammer, W., Wüst, A., Kersting, K.: Object-centric concept- bottlenecks. In: Advances in Neural Information Processing Systems (2025)

2025
[25]

In: European Conference on Computer Vision

Tan, A., Zhou, F., Chen, H.: Explain via any concept: Concept bottleneck model with open vocabulary concepts. In: European Conference on Computer Vision. pp. 123–138 (2024)

2024
[26]

In: IEEE Conference on Computer Vision and Pattern Recognition (2026)

Tapli, M., Bouniot, Q., Stammer, W., Akata, Z., Akbas, E.: Rethinking concept bottleneck models: From pitfalls to solutions. In: IEEE Conference on Computer Vision and Pattern Recognition (2026)

2026
[27]

California Institute of Technology (2011)

Wah, C., Branson, S., Welinder, P., Perona, P., Belongie, S.: The Caltech-UCSD birds-200-2011 dataset. California Institute of Technology (2011)

2011
[28]

In: AAAI Conference on Artificial Intelligence

Yamaguchi, S., Nishida, K.: Explanation bottleneck models. In: AAAI Conference on Artificial Intelligence. pp. 21886–21894 (2025)

2025
[29]

In: International Conference on Computer Vision

Yan, A., Wang, Y., Zhong, Y., Dong, C., He, Z., Lu, Y., Wang, W.Y., Shang, J., McAuley, J.: Learning concise and descriptive attributes for visual recognition. In: International Conference on Computer Vision. pp. 3090–3100 (2023)

2023
[30]

In: IEEE Conference on Computer Vision and Pattern Recog- nition

Yang, Y., Panagopoulou, A., Zhou, S., Jin, D., Callison-Burch, C., Yatskar, M.: Language in a bottle: Language model guided concept bottlenecks for interpretable image classification. In: IEEE Conference on Computer Vision and Pattern Recog- nition. pp. 19187–19197 (2023)

2023
[31]

In: IEEE Conference on Computer Vision and Pattern Recognition

Yu, L., Han, H., Tao, Z., Yao, H., Xu, C.: Language guided concept bottleneck models for interpretable continual learning. In: IEEE Conference on Computer Vision and Pattern Recognition. pp. 14976–14986 (2025) Caption Bottleneck Models 17

2025
[32]

In: In- ternational Conference on Learning Representations (2023)

Yuksekgonul, M., Wang, M., Zou, J.: Post-hoc concept bottleneck models. In: In- ternational Conference on Learning Representations (2023)

2023
[33]

In: AAAI Conference on Artificial Intelligence (2026)

Zhao, D., Huang, Q., Yan, D., Sun, Y., Yu, J.: Partially shared concept bottleneck models. In: AAAI Conference on Artificial Intelligence (2026)

2026

[1] [1]

In: International Conference on Learning Representations (2018)

Ancona, M., Ceolini, E., Öztireli, C., Gross, M.: Towards better understanding of gradient-based attribution methods for deep neural networks. In: International Conference on Learning Representations (2018)

2018

[2] [2]

Qwen3-VL Technical Report

Bai, S., Cai, Y., Chen, R., Chen, K., Chen, X., Cheng, Z., Deng, L., Ding, W., Gao, C., Ge, C., et al.: Qwen3-vl technical report. arXiv preprint arXiv:2511.21631 (2025)

work page internal anchor Pith review Pith/arXiv arXiv 2025

[3] [3]

In: European Conference on Computer Vision

Bossard, L., Guillaumin, M., Van Gool, L.: Food-101 – mining discriminative com- ponents with random forests. In: European Conference on Computer Vision. pp. 446–461 (2014)

2014

[4] [4]

In: AAAI Conference on Artificial Intelligence

Chauhan, K., Tiwari, R., Freyberg, J., Shenoy, P., Dvijotham, K.: Interactive con- cept bottleneck models. In: AAAI Conference on Artificial Intelligence. pp. 5948– 5955 (2023)

2023

[5] [5]

In: IEEE Conference on Computer Vision and Pattern Recognition

Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: A large- scale hierarchical image database. In: IEEE Conference on Computer Vision and Pattern Recognition. pp. 248–255 (2009)

2009

[6] [6]

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)

work page internal anchor Pith review Pith/arXiv arXiv 2018

[7] [7]

Havasi, M., Parbhoo, S., Doshi-Velez, F.: Addressing leakage in concept bottleneck models.In:AdvancesinNeuralInformationProcessingSystems.vol.35,pp.23386– 23397 (2022)

2022

[8] [8]

In: International Conference on Machine Learning

Koh, P.W., Nguyen, T., Tang, Y.S., Mussmann, S., Pierson, E., Kim, B., Liang, P.: Concept bottleneck models. In: International Conference on Machine Learning. pp. 5338–5348 (2020)

2020

[9] [9]

University of Toronto (2009)

Krizhevsky, A., Hinton, G.: Learning multiple layers of features from tiny images. University of Toronto (2009)

2009

[10] [10]

Understanding Neural Networks through Representation Erasure

Li, J., Monroe, W., Jurafsky, D.: Understanding neural networks through repre- sentation erasure. arXiv preprint arXiv:1612.08220 (2016)

work page internal anchor Pith review Pith/arXiv arXiv 2016

[11] [11]

In: IEEE Conference on Computer Vision and Pattern Recognition

Liu, Y., Zhang, T., Gu, S.: Hybrid concept bottleneck models. In: IEEE Conference on Computer Vision and Pattern Recognition. pp. 20179–20189 (2025)

2025

[12] [12]

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D., Levy, O., Lewis, M., Zettlemoyer, L., Stoyanov, V.: RoBERTa: A robustly optimized BERT pretraining approach. arXiv preprint arXiv:1907.11692 (2019)

work page internal anchor Pith review Pith/arXiv arXiv 1907

[13] [13]

Promises and pitfalls of black-box concept learning models.arXiv preprint arXiv:2106.13314,

Mahinpei, A., Clark, J., Lage, I., Doshi-Velez, F., Pan, W.: Promises and pitfalls of black-box concept learning models. arXiv preprint arXiv:2106.13314 (2021) 16 B. Cagliyan et al

work page arXiv 2021

[14] [14]

Margeloiu, A., Ashman, M., Bhatt, U., Chen, Y., Jamnik, M., Weller, A.: Do con- cept bottleneck models learn as intended? arXiv preprint arXiv:2105.04289 (2021)

work page arXiv 2021

[15] [15]

McInnes, L., Healy, J., Astels, S., et al.: hdbscan: Hierarchical density based clus- tering. J. Open Source Softw.2(11), 205 (2017)

2017

[16] [16]

In: Indian Conference on Computer Vision, Graphics & Image Processing

Nilsback, M., Zisserman, A.: Automated flower classification over a large number of classes. In: Indian Conference on Computer Vision, Graphics & Image Processing. pp. 722–729 (2008)

2008

[17] [17]

In: International Conference on Learning Representations (2023)

Oikarinen, T., Das, S., Nguyen, L.M., Weng, T.W.: Label-free concept bottleneck models. In: International Conference on Learning Representations (2023)

2023

[18] [18]

In: IEEE Conference on Computer Vision and Pattern Recognition

Ozdemir, O., Christensen, A., Alaniz, S., Akata, Z., Akbas, E.: Explaining CLIP zero-shot predictions through concepts. In: IEEE Conference on Computer Vision and Pattern Recognition. pp. 31336–31345 (2026)

2026

[19] [19]

In: Advances in Neural Information Processing Systems (2025)

Park, S., Mun, J., Oh, D., Lee, N.: An analysis of concept bottleneck models: Measuring, understanding, and mitigating the impact of noisy annotations. In: Advances in Neural Information Processing Systems (2025)

2025

[20] [20]

In: International Conference on Machine Learning (2025)

Prasse, K., Knab, P., Marton, S., Bartelt, C., Keuper, M.: Dcbm: Data-efficient vi- sual concept bottleneck models. In: International Conference on Machine Learning (2025)

2025

[21] [21]

In: International Conference on Machine Learning

Radford, A., Kim, J.W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., Clark, J., et al.: Learning transferable visual models from natural language supervision. In: International Conference on Machine Learning. pp. 8748–8763 (2021)

2021

[22] [22]

In: European Conference on Computer Vision

Rao, S., Mahajan, S., Böhle, M., Schiele, B.: Discover-then-name: Task-agnostic concept bottlenecks via automated concept discovery. In: European Conference on Computer Vision. pp. 444–461 (2024)

2024

[23] [23]

In: Advances in Neural Information Processing Systems

Srivastava, D., Yan, G., Weng, T.W.: Vlg-cbm: Training concept bottleneck mod- els with vision-language guidance. In: Advances in Neural Information Processing Systems. vol. 37, pp. 79057–79094 (2024)

2024

[24] [24]

In: Advances in Neural Information Processing Systems (2025)

Steinmann, D., Stammer, W., Wüst, A., Kersting, K.: Object-centric concept- bottlenecks. In: Advances in Neural Information Processing Systems (2025)

2025

[25] [25]

In: European Conference on Computer Vision

Tan, A., Zhou, F., Chen, H.: Explain via any concept: Concept bottleneck model with open vocabulary concepts. In: European Conference on Computer Vision. pp. 123–138 (2024)

2024

[26] [26]

In: IEEE Conference on Computer Vision and Pattern Recognition (2026)

Tapli, M., Bouniot, Q., Stammer, W., Akata, Z., Akbas, E.: Rethinking concept bottleneck models: From pitfalls to solutions. In: IEEE Conference on Computer Vision and Pattern Recognition (2026)

2026

[27] [27]

California Institute of Technology (2011)

Wah, C., Branson, S., Welinder, P., Perona, P., Belongie, S.: The Caltech-UCSD birds-200-2011 dataset. California Institute of Technology (2011)

2011

[28] [28]

In: AAAI Conference on Artificial Intelligence

Yamaguchi, S., Nishida, K.: Explanation bottleneck models. In: AAAI Conference on Artificial Intelligence. pp. 21886–21894 (2025)

2025

[29] [29]

In: International Conference on Computer Vision

Yan, A., Wang, Y., Zhong, Y., Dong, C., He, Z., Lu, Y., Wang, W.Y., Shang, J., McAuley, J.: Learning concise and descriptive attributes for visual recognition. In: International Conference on Computer Vision. pp. 3090–3100 (2023)

2023

[30] [30]

In: IEEE Conference on Computer Vision and Pattern Recog- nition

Yang, Y., Panagopoulou, A., Zhou, S., Jin, D., Callison-Burch, C., Yatskar, M.: Language in a bottle: Language model guided concept bottlenecks for interpretable image classification. In: IEEE Conference on Computer Vision and Pattern Recog- nition. pp. 19187–19197 (2023)

2023

[31] [31]

In: IEEE Conference on Computer Vision and Pattern Recognition

Yu, L., Han, H., Tao, Z., Yao, H., Xu, C.: Language guided concept bottleneck models for interpretable continual learning. In: IEEE Conference on Computer Vision and Pattern Recognition. pp. 14976–14986 (2025) Caption Bottleneck Models 17

2025

[32] [32]

In: In- ternational Conference on Learning Representations (2023)

Yuksekgonul, M., Wang, M., Zou, J.: Post-hoc concept bottleneck models. In: In- ternational Conference on Learning Representations (2023)

2023

[33] [33]

In: AAAI Conference on Artificial Intelligence (2026)

Zhao, D., Huang, Q., Yan, D., Sun, Y., Yu, J.: Partially shared concept bottleneck models. In: AAAI Conference on Artificial Intelligence (2026)

2026