Attention-based Deep Multiple Instance Learning

Maximilian Ilse , Jakub M. Tomczak , Max Welling

Authors on Pith no claims yet

classification 💻 cs.LG stat.ML

keywords labellearninginstanceattention-baseddatasetsmethodsmultipleneural

read the original abstract

Multiple instance learning (MIL) is a variation of supervised learning where a single class label is assigned to a bag of instances. In this paper, we state the MIL problem as learning the Bernoulli distribution of the bag label where the bag label probability is fully parameterized by neural networks. Furthermore, we propose a neural network-based permutation-invariant aggregation operator that corresponds to the attention mechanism. Notably, an application of the proposed attention-based operator provides insight into the contribution of each instance to the bag label. We show empirically that our approach achieves comparable performance to the best MIL methods on benchmark MIL datasets and it outperforms other methods on a MNIST-based MIL dataset and two real-life histopathology datasets without sacrificing interpretability.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 5 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Beyond Independent Frames: Latent Attention Masked Autoencoders for Multi-View Echocardiography
cs.CV 2026-04 unverdicted novelty 6.0

LAMAE adds latent-space attention to masked autoencoders so multi-view echocardiography videos can exchange information across frames and views, yielding representations that transfer from adult to pediatric hearts an...
Discrete Diffusion for Codebook-Based Beam Candidate Generation
eess.SP 2026-04 unverdicted novelty 6.0

A discrete denoising diffusion model learns from probing histories to generate promising beam candidates, yielding better SNR, lower beam-miss probability, and reduced probe regret than baselines under tight probing budgets.
Enabling clinical use of foundation models for computational pathology
cs.CV 2026-02 conditional novelty 6.0

Novel robustness losses added during downstream training on foundation-model features from pathology slides improve both robustness to technical variation and classification accuracy.
Weakly Supervised Multicenter Nancy Index Scoring in Ulcerative Colitis Using Foundation Models
cs.CV 2026-04 unverdicted novelty 5.0

Weakly supervised MIL with foundation models enables robust five-grade Nancy index prediction and neutrophilic activity assessment from slide-level labels in multicenter UC biopsies.
Validation of an AI-based end-to-end model for prostate pathology using long-term archived routine samples
cs.CV 2026-05 unverdicted novelty 4.0

GleasonAI achieves quadratic-weighted kappa of 0.86 on ISUP grading of 10,366 long-term archived prostate biopsy cores, with performance stable over 17 years and a clear prognostic gradient for cancer-specific mortality.