Transactions on Machine Learning Research , issn=

Maxime Oquab, Timoth · 2024

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

browse 9 citing papers

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

See What Matters: Differentiable Grid Sample Pruning for Generalizable Vision-Language-Action Model

cs.RO · 2026-05-12 · conditional · novelty 7.0

GridS is a plug-and-play differentiable module for geometry-aware visual token resampling in VLA models that achieves under 10% token retention and 76% FLOPs reduction with no success-rate loss.

Taming the Entropy Cliff: Variable Codebook Size Quantization for Autoregressive Visual Generation

cs.CV · 2026-05-07 · unverdicted · novelty 7.0

Variable codebook sizes that increase along the sequence in visual tokenizers reduce generation FID scores significantly for autoregressive models on ImageNet.

Label-Efficient Dataset Pruning via Semi-Supervised Pseudo-Labeling

cs.LG · 2026-05-22 · unverdicted · novelty 6.0

SemiPrune uses a small labeled subset and semi-supervised pseudo-labeling to enable supervised dataset pruning methods, achieving state-of-the-art results on domain-specific, image-corrupted, and long-tailed datasets.

Ada-Diffuser: Latent-Aware Adaptive Diffusion for Decision-Making

cs.LG · 2026-05-15 · unverdicted · novelty 6.0

Ada-Diffuser is a causal diffusion model that jointly learns observed interaction structure and underlying latent dynamics from minimal observations for adaptive planning and policy learning.

Composition of Memory Experts for Diffusion World Models

cs.LG · 2026-05-12 · unverdicted · novelty 6.0

A compositional diffusion world model integrates three specialized memory experts via contrastive product-of-experts to improve temporal consistency, past recall, and navigation while scaling to long contexts without quadratic costs.

Rethinking Temporal Consistency in Video Object-Centric Learning: From Prediction to Correspondence

cs.CV · 2026-05-05 · unverdicted · novelty 6.0

Grounded Correspondence maintains temporal consistency via deterministic bipartite matching on frozen backbone features instead of learned predictors, achieving competitive results on MOVi and YouTube-VIS with zero learnable temporal parameters.

Information theoretic underpinning of self-supervised learning by clustering

cs.LG · 2026-05-12 · unverdicted · novelty 5.0

SSL clustering is derived as KL-divergence optimization where a teacher-distribution constraint normalizes via inverse cluster priors and simplifies to batch centering by Jensen's inequality.

Unlocking Compositional Generalization in Continual Few-Shot Learning

cs.LG · 2026-05-12 · unverdicted · novelty 5.0 · 2 refs

A decoupling strategy optimizes object slots for holistic class identity during training and composes them at inference to achieve better generalization to unseen concepts in continual few-shot settings.

APEX: Assumption-free Projection-based Embedding eXamination Metric for Image Quality Assessment

cs.CV · 2026-05-08 · unverdicted · novelty 5.0

APEX is an assumption-free image quality metric using Sliced Wasserstein Distance on CLIP and DINOv2 embeddings that claims superior robustness to degradations and cross-dataset stability.

citing papers explorer

Showing 9 of 9 citing papers.

See What Matters: Differentiable Grid Sample Pruning for Generalizable Vision-Language-Action Model cs.RO · 2026-05-12 · conditional · none · ref 39
GridS is a plug-and-play differentiable module for geometry-aware visual token resampling in VLA models that achieves under 10% token retention and 76% FLOPs reduction with no success-rate loss.
Taming the Entropy Cliff: Variable Codebook Size Quantization for Autoregressive Visual Generation cs.CV · 2026-05-07 · unverdicted · none · ref 4
Variable codebook sizes that increase along the sequence in visual tokenizers reduce generation FID scores significantly for autoregressive models on ImageNet.
Label-Efficient Dataset Pruning via Semi-Supervised Pseudo-Labeling cs.LG · 2026-05-22 · unverdicted · none · ref 32
SemiPrune uses a small labeled subset and semi-supervised pseudo-labeling to enable supervised dataset pruning methods, achieving state-of-the-art results on domain-specific, image-corrupted, and long-tailed datasets.
Ada-Diffuser: Latent-Aware Adaptive Diffusion for Decision-Making cs.LG · 2026-05-15 · unverdicted · none · ref 207
Ada-Diffuser is a causal diffusion model that jointly learns observed interaction structure and underlying latent dynamics from minimal observations for adaptive planning and policy learning.
Composition of Memory Experts for Diffusion World Models cs.LG · 2026-05-12 · unverdicted · none · ref 40
A compositional diffusion world model integrates three specialized memory experts via contrastive product-of-experts to improve temporal consistency, past recall, and navigation while scaling to long contexts without quadratic costs.
Rethinking Temporal Consistency in Video Object-Centric Learning: From Prediction to Correspondence cs.CV · 2026-05-05 · unverdicted · none · ref 24
Grounded Correspondence maintains temporal consistency via deterministic bipartite matching on frozen backbone features instead of learned predictors, achieving competitive results on MOVi and YouTube-VIS with zero learnable temporal parameters.
Information theoretic underpinning of self-supervised learning by clustering cs.LG · 2026-05-12 · unverdicted · none · ref 54
SSL clustering is derived as KL-divergence optimization where a teacher-distribution constraint normalizes via inverse cluster priors and simplifies to batch centering by Jensen's inequality.
Unlocking Compositional Generalization in Continual Few-Shot Learning cs.LG · 2026-05-12 · unverdicted · none · ref 7 · 2 links
A decoupling strategy optimizes object slots for holistic class identity during training and composes them at inference to achieve better generalization to unseen concepts in continual few-shot settings.
APEX: Assumption-free Projection-based Embedding eXamination Metric for Image Quality Assessment cs.CV · 2026-05-08 · unverdicted · none · ref 24
APEX is an assumption-free image quality metric using Sliced Wasserstein Distance on CLIP and DINOv2 embeddings that claims superior robustness to degradations and cross-dataset stability.

Transactions on Machine Learning Research , issn=

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer