Damien Ernst, Pierre Geurts, and Louis Wehenkel

Rich Caruana · 1997 · Machine Learning · DOI 10.1023/a:1007379606734

10 Pith papers cite this work, alongside 4,684 external citations. Polarity classification is still indexing.

10 Pith papers citing it

4,684 external citations · Crossref

open at publisher browse 10 citing papers

citation-role summary

method 2

citation-polarity summary

use method 2

representative citing papers

Hypothesis generation and updating in large language models

cs.LG · 2026-05-07 · unverdicted · novelty 6.0

LLMs exhibit Bayesian-like hypothesis updating with strong-sampling bias and an evaluation-generation gap but generalize poorly outside observed data.

The Geometric Canary: Predicting Steerability and Detecting Drift via Representational Stability

cs.LG · 2026-04-20 · unverdicted · novelty 6.0

Task-aligned supervised geometric stability predicts linear steerability with high accuracy while unsupervised stability detects representational drift earlier and with lower false alarms than CKA or Procrustes.

Toward Unified Fine-Grained Vehicle Classification and Automatic License Plate Recognition

cs.CV · 2026-04-07 · accept · novelty 6.0

UFPR-VeSV is a new real-world dataset for fine-grained vehicle classification and automatic license plate recognition collected from Brazilian police cameras, with benchmarks demonstrating its difficulty and the value of joint task use.

Response-free item difficulty modelling for multiple-choice items with fine-tuned transformers: Component-wise representation and multi-task learning

cs.CL · 2026-05-16 · conditional · novelty 5.0

Fine-tuned transformers with multi-task learning recover substantial wording-derived signal for item difficulty at small sample sizes typical in applied testing.

Compositional Q-learning for electrolyte repletion with imbalanced patient sub-populations

cs.LG · 2021-10-06 · unverdicted · novelty 5.0

CFQI extends fitted Q-iteration by using separate modules for compositional task variants to learn policies robust to imbalanced patient sub-populations in medical RL.

Benchmarking Machine Learning Architectures for Antimicrobial Stewardship in Pediatric ICUs

cs.LG · 2026-05-21 · unverdicted · novelty 4.0

Benchmarking in pediatric ICU antimicrobial stewardship shows performance depends mainly on target prevalence and dataset traits rather than model complexity, with sequence models improving precision-recall at 24-hour resolution but showing poorer calibration than tabular models.

Multi-task learning on partially labeled datasets via invariant/equivariant semi-supervised learning

cs.CV · 2026-05-17 · unverdicted · novelty 4.0

Invariant and equivariant semi-supervised learning improves multi-task detection and segmentation performance on partially labeled vision datasets compared to supervised baselines.

Opportunistic Bone-Loss Screening from Routine Knee Radiographs Using a Multi-Task Deep Learning Framework with Sensitivity-Constrained Threshold Optimization

cs.CV · 2026-04-22 · unverdicted · novelty 4.0

STR-Net achieves AUROC of 0.933 for binary bone-loss screening and 0.801 correlation for T-score estimation from knee X-rays on a held-out test set.

SG-UniBuc-NLP at SemEval-2026 Task 6: Multi-Head RoBERTa with Chunking for Long-Context Evasion Detection

cs.CL · 2026-04-29 · unverdicted · novelty 3.0

A multi-head RoBERTa model with overlapping chunking and max-pooling achieves Macro-F1 of 0.80 on 3-way clarity classification and 0.51 on 9-way evasion strategy detection, ranking 11th in both subtasks of SemEval-2026 Task 6.

YEZE at SemEval-2026 Task 9: Detecting Multilingual, Multicultural and Multievent Online Polarization via Heterogeneous Ensembling

cs.CL · 2026-05-07 · unverdicted · novelty 2.0 · 2 refs

A heterogeneous ensemble of XLM-RoBERTa-large and mDeBERTa-v3-base with independent task modeling and class weighting is reported as effective for multilingual, multicultural, and multievent online polarization detection.

citing papers explorer

Showing 10 of 10 citing papers.

Hypothesis generation and updating in large language models cs.LG · 2026-05-07 · unverdicted · none · ref 87
LLMs exhibit Bayesian-like hypothesis updating with strong-sampling bias and an evaluation-generation gap but generalize poorly outside observed data.
The Geometric Canary: Predicting Steerability and Detecting Drift via Representational Stability cs.LG · 2026-04-20 · unverdicted · none · ref 70
Task-aligned supervised geometric stability predicts linear steerability with high accuracy while unsupervised stability detects representational drift earlier and with lower false alarms than CKA or Procrustes.
Toward Unified Fine-Grained Vehicle Classification and Automatic License Plate Recognition cs.CV · 2026-04-07 · accept · none · ref 6
UFPR-VeSV is a new real-world dataset for fine-grained vehicle classification and automatic license plate recognition collected from Brazilian police cameras, with benchmarks demonstrating its difficulty and the value of joint task use.
Response-free item difficulty modelling for multiple-choice items with fine-tuned transformers: Component-wise representation and multi-task learning cs.CL · 2026-05-16 · conditional · none · ref 6
Fine-tuned transformers with multi-task learning recover substantial wording-derived signal for item difficulty at small sample sizes typical in applied testing.
Compositional Q-learning for electrolyte repletion with imbalanced patient sub-populations cs.LG · 2021-10-06 · unverdicted · none · ref 5
CFQI extends fitted Q-iteration by using separate modules for compositional task variants to learn policies robust to imbalanced patient sub-populations in medical RL.
Benchmarking Machine Learning Architectures for Antimicrobial Stewardship in Pediatric ICUs cs.LG · 2026-05-21 · unverdicted · none · ref 73
Benchmarking in pediatric ICU antimicrobial stewardship shows performance depends mainly on target prevalence and dataset traits rather than model complexity, with sequence models improving precision-recall at 24-hour resolution but showing poorer calibration than tabular models.
Multi-task learning on partially labeled datasets via invariant/equivariant semi-supervised learning cs.CV · 2026-05-17 · unverdicted · none · ref 3
Invariant and equivariant semi-supervised learning improves multi-task detection and segmentation performance on partially labeled vision datasets compared to supervised baselines.
Opportunistic Bone-Loss Screening from Routine Knee Radiographs Using a Multi-Task Deep Learning Framework with Sensitivity-Constrained Threshold Optimization cs.CV · 2026-04-22 · unverdicted · none · ref 2
STR-Net achieves AUROC of 0.933 for binary bone-loss screening and 0.801 correlation for T-score estimation from knee X-rays on a held-out test set.
SG-UniBuc-NLP at SemEval-2026 Task 6: Multi-Head RoBERTa with Chunking for Long-Context Evasion Detection cs.CL · 2026-04-29 · unverdicted · none · ref 6
A multi-head RoBERTa model with overlapping chunking and max-pooling achieves Macro-F1 of 0.80 on 3-way clarity classification and 0.51 on 9-way evasion strategy detection, ranking 11th in both subtasks of SemEval-2026 Task 6.
YEZE at SemEval-2026 Task 9: Detecting Multilingual, Multicultural and Multievent Online Polarization via Heterogeneous Ensembling cs.CL · 2026-05-07 · unverdicted · none · ref 32 · 2 links
A heterogeneous ensemble of XLM-RoBERTa-large and mDeBERTa-v3-base with independent task modeling and class weighting is reported as effective for multilingual, multicultural, and multievent online polarization detection.

Damien Ernst, Pierre Geurts, and Louis Wehenkel

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer