CARE: Class-Adaptive Expert Consensus for Reliable Learning with Long-Tailed Noisy Labels
Pith reviewed 2026-05-25 04:41 UTC · model grok-4.3
The pith
Class-adaptive consensus across noisy labels, text embeddings and visual features rectifies long-tailed noisy labels more reliably.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
CARE uses a class-adaptive expert consensus mechanism that requires higher agreement among the three sources for tail classes and allows more flexibility for head classes, then aggregates high-confidence predictions to filter noise and recalibrate class distributions.
What carries the argument
The class-adaptive expert consensus mechanism that sets agreement strictness based on class frequency to combine predictions from observed noisy labels, VLM text embeddings, and visual features.
If this is right
- Improved correction for tail classes without over-regularizing head classes.
- Consistent outperformance on synthetic and real-world long-tailed noisy benchmarks.
- Up to 3.0% performance gains over prior methods.
- Parameter-efficient framework suitable for practical use.
Where Pith is reading between the lines
- If the three sources prove complementary in more settings, the approach could extend to other noisy label scenarios beyond vision.
- Further gains might come from incorporating additional expert sources like audio or text-only data.
- Testing on extremely imbalanced distributions could reveal limits of the frequency-based adaptation.
Load-bearing premise
The three supervision sources provide complementary information that can be weighted adaptively by class frequency without introducing additional errors or biases.
What would settle it
Experiments on a dataset where VLM embeddings are corrupted in a way that aligns with the label noise, showing whether the consensus still improves or degrades performance.
Figures
read the original abstract
Learning from real-world data is frequently hindered by the compound challenge of long-tailed class distributions and noisy annotations. Existing methods partially address these issues but typically ignore the non-uniform impact of label noise across classes, resulting in ineffective correction for tail classes and over-regularization for head classes. To address this issue, we propose Class-Adaptive Rectification with Experts (CARE), a parameter-efficient framework that leverages three complementary supervision sources from vision-language models (VLM): observed noisy labels, VLM text embeddings, and visual features. CARE introduces a class-adaptive expert consensus mechanism that enforces stricter agreement for tail classes and more permissive agreement for head classes based on class frequency. By aggregating high-confidence predictions across these sources, CARE filters unreliable signals and recalibrates class distributions, yielding more reliable rectification under long-tailed distributions. Extensive experiments on both synthetic and real-world benchmarks demonstrate that CARE consistently outperforms state-of-the-art methods, achieving up to 3.0\% performance gains. The source code is available at https://github.com/qwq123-study/CARE.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents CARE, a framework for reliable learning with long-tailed noisy labels. It aggregates high-confidence predictions from observed noisy labels, VLM text embeddings, and visual features using a class-adaptive expert consensus that enforces stricter agreement for tail classes based on class frequency. The method is claimed to filter unreliable signals and recalibrate class distributions, with experiments showing up to 3.0% gains over state-of-the-art on synthetic and real-world benchmarks.
Significance. Should the results prove robust, this work could contribute to handling compound challenges in real-world data by adaptively using VLM sources. The parameter-efficient nature and code release are strengths. However, the small reported gains require strong evidence to establish significance.
major comments (2)
- [Abstract] The central performance claim of up to 3.0% gains lacks any details on baselines, statistical tests, error bars, or data splits. This is load-bearing for assessing the effectiveness of the proposed class-adaptive mechanism.
- [Abstract] The assumption that VLM sources provide complementary reliable supervision for tail classes without introducing biases is not supported by any mentioned per-class validation or ablation on the frequency-based thresholds versus uniform ones. This directly impacts the validity of the recalibration for long-tailed distributions.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback. We address each major comment below and outline the revisions we will incorporate.
read point-by-point responses
-
Referee: [Abstract] The central performance claim of up to 3.0% gains lacks any details on baselines, statistical tests, error bars, or data splits. This is load-bearing for assessing the effectiveness of the proposed class-adaptive mechanism.
Authors: We agree the abstract would benefit from added context. The full manuscript reports results against multiple state-of-the-art baselines across synthetic and real-world long-tailed noisy datasets, with error bars from repeated runs, statistical significance tests, and explicit data-split details in Sections 4 and 5. We will revise the abstract to name the primary baselines and note that these supporting statistics appear in the experiments. revision: yes
-
Referee: [Abstract] The assumption that VLM sources provide complementary reliable supervision for tail classes without introducing biases is not supported by any mentioned per-class validation or ablation on the frequency-based thresholds versus uniform ones. This directly impacts the validity of the recalibration for long-tailed distributions.
Authors: The manuscript contains per-class performance breakdowns and an ablation comparing class-frequency-based thresholds against uniform thresholds, showing gains concentrated on tail classes with no evidence of introduced bias. These appear in the experimental analysis. We will revise the abstract to reference these validations and the frequency-based ablation to better substantiate the assumption. revision: yes
Circularity Check
No derivation chain; empirical method evaluated on external benchmarks
full rationale
The paper describes an empirical framework (CARE) that aggregates signals from observed labels, VLM text embeddings, and visual features via a class-adaptive consensus rule modulated by class frequency. No equations, fitted parameters renamed as predictions, self-definitional loops, or load-bearing self-citations appear in the provided text. The method is validated through experiments on synthetic and real-world benchmarks, making the central claims falsifiable outside any internal construction. This matches the default expectation of no significant circularity.
Axiom & Free-Parameter Ledger
free parameters (1)
- class-adaptive agreement thresholds
axioms (1)
- domain assumption VLM text embeddings and visual features supply reliable complementary supervision for label rectification
Reference graph
Works this paper leans on
-
[1]
Li, Yuhang and Li, Zhuying and Jia, Yuheng , booktitle=ICCV, title=. 2025 , pages=
work page 2025
-
[2]
Zhe Zhao and Haibin Wen and Pengkun Wang and Shuang Wang and Zhenkun Wang and Qingfu Zhang and Yang Wang , title =
-
[3]
Zhe Zhao and Pengkun Wang and Haibin Wen and Wei Xu and Song Lai and Qingfu Zhang and Yang Wang , title =
-
[4]
Pengkun Wang and Zhe Zhao and Haibin Wen and Fanfu Wang and Binwu Wang and Qingfu Zhang and Yang Wang , title =
-
[5]
Binwu Wang and Pengkun Wang and Wei Xu and Xu Wang and Yudong Zhang and Kun Wang and Yang Wang , title =
-
[6]
Junnan Li and Dongxu Li and Caiming Xiong and Steven C. H. Hoi , title =
-
[7]
Ilya O. Tolstikhin and Neil Houlsby and Alexander Kolesnikov and Lucas Beyer and Xiaohua Zhai and Thomas Unterthiner and Jessica Yung and Andreas Steiner and Daniel Keysers and Jakob Uszkoreit and Mario Lucic and Alexey Dosovitskiy , title =
-
[8]
Jeffrey Pennington and Richard Socher and Christopher D. Manning , title =
-
[9]
Robust training with ensemble consensus , booktitle = ICLR, year =
Jisoo Lee and Sae. Robust training with ensemble consensus , booktitle = ICLR, year =
-
[10]
Lukas Hoyer and Dengxin Dai and Luc Van Gool , title =
-
[11]
Class imbalance in out-of-distribution datasets: Improving the robustness of the TextCNN for the classification of rare cancer types , journal = JBI, volume =. 2022 , author =
work page 2022
-
[12]
O'Connor and Kevin McGuinness , title =
Paul Albert and Eric Arazo and Tarun Krishna and Noel E. O'Connor and Kevin McGuinness , title =
-
[13]
Lei Feng and Bo An , title =
-
[14]
Adaptive Graph Guided Disambiguation for Partial Label Learning , journal = TPAMI, volume =
Deng. Adaptive Graph Guided Disambiguation for Partial Label Learning , journal = TPAMI, volume =
-
[15]
Xinyuan Ji and Zhaowei Zhu and Wei Xi and Olga Gadyatskaya and Zilong Song and Yong Cai and Yang Liu , title =
- [16]
-
[17]
Lefan Zhang and Zhang. Learning from Long-Tailed Noisy Data with Sample Selection and Balanced Loss , booktitle = IJCAI, pages =
-
[18]
Junnan Li and Yongkang Wong and Qi Zhao and Mohan S. Kankanhalli , title =
-
[19]
Yisen Wang and Xingjun Ma and Zaiyi Chen and Yuan Luo and Jinfeng Yi and James Bailey , title =
-
[20]
2020 , booktitle = ICML, volume =
Zheng, Songzhu and Wu, Pengxiang and Goswami, Aman and Goswami, Mayank and Metaxas, Dimitris and Chen, Chao , title =. 2020 , booktitle = ICML, volume =
work page 2020
-
[21]
Probabilistic End-To-End Noise Correction for Learning With Noisy Labels , author=. 2019 , pages=
work page 2019
- [22]
-
[23]
Tanaka, Daiki and Ikami, Daiki and Yamasaki, Toshihiko and Aizawa, Kiyoharu , booktitle=CVPR, title=. 2018 , pages=
work page 2018
-
[24]
Dimensionality-Driven Learning with Noisy Labels , author =. 2018 , volume =
work page 2018
-
[25]
Kuang. CleanNet: Transfer Learning for Scalable Image Classifier Training With Label Noise , booktitle = CVPR, pages =
-
[26]
Mingcai Chen and Yuntao Du and Wenyu Jiang and Baoming Zhang and Shuai Feng and Yi Xin and Chongjun Wang , title =
-
[27]
Mengmeng Sheng and Zeren Sun and Zhenhuang Cai and Tao Chen and Yichao Zhou and Yazhou Yao , title =
-
[28]
DINOv2: Learning Robust Visual Features without Supervision , journal = TMLR, volume =
Maxime Oquab and Timoth. DINOv2: Learning Robust Visual Features without Supervision , journal = TMLR, volume =
-
[29]
Zhi-Fan Wu and Tong Wei and Jianwen Jiang and Chaojie Mao and Mingqian Tang and Yufeng Li , title =
-
[30]
Junnan Li and Caiming Xiong and Steven C. H. Hoi , title =
-
[31]
Li, Junnan and Xiong, Caiming and Hoi, Steven C.H. , booktitle=ICCV, title=. 2021 , pages=
work page 2021
-
[32]
Yikai Zhang and Songzhu Zheng and Pengxiang Wu and Mayank Goswami and Chao Chen , title =
-
[33]
Early-Learning Regularization Prevents Memorization of Noisy Labels , volume =
Liu, Sheng and Niles-Weed, Jonathan and Razavian, Narges and Fernandez-Granda, Carlos , booktitle = NIPS, pages =. Early-Learning Regularization Prevents Memorization of Noisy Labels , volume =
-
[34]
Understanding and Utilizing Deep Neural Networks Trained with Noisy Labels , author=. 2019 , pages=
work page 2019
-
[35]
Lu Jiang and Di Huang and Mason Liu and Weilong Yang , title =
-
[36]
Influence-balanced loss for imbalanced visual classification , author=
-
[37]
Selective-supervised contrastive learning with noisy labels , author=
-
[38]
Robust early-learning: Hindering the memorization of noisy labels , author=
-
[39]
Improving the Accuracy of Learning Example Weights for Imbalance Classification , author=
-
[40]
Adaptformer: Adapting vision transformers for scalable visual recognition , author=. 2022 , volume=
work page 2022
-
[41]
Stochastic Feature Averaging for Learning with Long-Tailed Noisy Labels , author =
-
[42]
Prototype-Anchored Learning for Learning with Imperfect Annotations , author =. 2022 , volume =
work page 2022
-
[43]
Delving into Sample Loss Curve to Embrace Noisy and Imbalanced Data , volume =
Jiang, Shenwang and Li, Jianan and Wang, Ying and Huang, Bo and Zhang, Zhang and Xu, Tingfa , year =. Delving into Sample Loss Curve to Embrace Noisy and Imbalanced Data , volume =
-
[44]
Prototypical Classifier for Robust Class-Imbalanced Learning
Wei, Tong and Shi, Jiang-Xin and Li, Yu-Feng and Zhang, Min-Ling. Prototypical Classifier for Robust Class-Imbalanced Learning. 2022
work page 2022
-
[45]
Faster R-CNN: Towards real-time object detection with region proposal networks , author=. 2017 , publisher=
work page 2017
-
[46]
Learning from massive noisy labeled data for image classification , author=
-
[47]
Mentornet: Learning data-driven curriculum for very deep neural networks on corrupted labels , author=. 2018 , pages=
work page 2018
-
[48]
Co-teaching: Robust training of deep neural networks with extremely noisy labels , author=
-
[49]
Dividemix: Learning with noisy labels as semi-supervised learning , author=
-
[50]
Identifying mislabeled data using the area under the margin ranking , author =
-
[51]
Unicon: Combating label noise through uniform selection and contrastive learning , author=
-
[52]
Yao, Yazhou and Sun, Zeren and Zhang, Chuanyi and Shen, Fumin and Wu, Qi and Zhang, Jian and Tang, Zhenmin , booktitle=CVPR, pages=. Jo-
-
[53]
Making deep neural networks robust to label noise: A loss correction approach , author=
-
[54]
Using trusted data to train deep networks on labels corrupted by severe noise , author=
-
[55]
Meta-weight-net: Learning an explicit mapping for sample weighting , author=
-
[56]
Dual T: reducing estimation error for transition matrix in label-noise learning , author=
-
[57]
Part-dependent Label Noise: Towards Instance-dependent Label Noise , volume =
Xia, Xiaobo and Liu, Tongliang and Han, Bo and Wang, Nannan and Gong, Mingming and Liu, Haifeng and Niu, Gang and Tao, Dacheng and Sugiyama, Masashi , booktitle = NIPS, pages =. Part-dependent Label Noise: Towards Instance-dependent Label Noise , volume =
-
[58]
Class-dependent label-noise learning with cycle-consistency regularization , author=
- [59]
-
[60]
Sample selection with uncertainty of losses for learning with noisy labels , author=
-
[61]
Tong Wei and Jiang-Xin Shi and Wei-Wei Tu and Yu-Feng Li , title =. Front. Comput. Sci. , volume =
-
[62]
Foster adaptivity and balance in learning with noisy labels , author=. 2024 , organization=
work page 2024
-
[63]
Heteroskedastic and imbalanced deep learning with adaptive regularization , author =
-
[64]
Uncertainty-aware learning against label noise on imbalanced datasets , author =
-
[65]
When noisy labels meet long tail dilemmas: A representation calibration method , author=
-
[66]
Label-noise learning with intrinsically long-tailed data , author =
-
[67]
Extracting Clean and Balanced Subset for Noisy Long-tailed Classification , year =
Li, Zhuo and Zhao, He and Li, Zhen and Liu, Tongliang and Guo, Dandan and Wan, Xiang , journal =. Extracting Clean and Balanced Subset for Noisy Long-tailed Classification , year =
-
[68]
Long-Tail Learning with Foundation Model: Heavy Fine-Tuning Hurts , author=
-
[69]
Class-balanced loss based on effective number of samples , author=
-
[70]
Khan and Munawar Hayat and Mohammed Bennamoun and Ferdous Ahmed Sohel and Roberto Togneri , title =
Salman H. Khan and Munawar Hayat and Mohammed Bennamoun and Ferdous Ahmed Sohel and Roberto Togneri , title =. 2018 , doi =
work page 2018
-
[71]
Learning imbalanced datasets with label-distribution-aware margin loss , author=
-
[72]
Wang, Jiaqi and Zhang, Wenwei and Zang, Yuhang and Cao, Yuhang and Pang, Jiangmiao and Gong, Tao and Chen, Kai and Liu, Ziwei and Loy, Chen Change and Lin, Dahua , title =. 2021 , pages =
work page 2021
-
[73]
Cao, Dong and Zhu, Xiangyu and Huang, Xingyu and Guo, Jianzhu and Lei, Zhen , title =
-
[74]
Zhang, Wanping and Chen, Yongru and Yang, Wenming and Wang, Guijin and Xue, Jing-Hao and Liao, Qingmin , journal=TNNLS, title=. 2021 , volume=
work page 2021
-
[75]
Tan, Jingru and Wang, Changbao and Li, Buyu and Li, Quanquan and Ouyang, Wanli and Yin, Changqing and Yan, Junjie , title =
-
[76]
Hong, Youngkyu and Han, Seungju and Choi, Kwanghee and Seo, Seokjun and Kim, Beomsu and Chang, Buru , title =. 2021 , pages =
work page 2021
-
[77]
Ren, Mengye and Zeng, Wenyuan and Yang, Bin and Urtasun, Raquel , title =
-
[78]
Huang, Chen and Li, Yining and Loy, Chen Change and Tang, Xiaoou , journal=PAMI, title=. 2020 , volume=
work page 2020
-
[79]
Focal Loss for Dense Object Detection , journal = TPAMI, volume =
Tsung. Focal Loss for Dense Object Detection , journal = TPAMI, volume =
-
[80]
Cubuk, Ekin D. and Zoph, Barret and Man. AutoAugment: Learning Augmentation Strategies From Data , booktitle = CVPR, pages =
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.