stable-pretraining- v1: Foundation model research made simple.arXiv preprint arXiv:2511.19484, 2025

Randall Balestriero, Hugues Van Assel, Sami BuGhanem, Lucas Maes · 2025 · arXiv 2511.19484

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

LeVLJEPA: End-to-End Vision-Language Pretraining Without Negatives

cs.CV · 2026-07-01 · unverdicted · novelty 7.0

LeVLJEPA is the first non-contrastive vision-language pretraining method that learns via cross-modal prediction without negatives, producing stronger dense features than contrastive baselines on VQA and segmentation tasks.

FF-JEPA: Long-Horizon Planning in World Models with Latent Planners

cs.AI · 2026-06-08 · unverdicted · novelty 6.0

FF-JEPA introduces a two-model hierarchical structure with an action-free latent planner to decompose long-horizon planning into short subgoals in latent world models.

Generate in Reconstruction Space, Match in Semantic Space: Transport Geometry for One-Step Generation

cs.LG · 2026-05-30 · unverdicted · novelty 6.0

Matching in semantic SSL feature space via Sinkhorn divergence enables effective one-step generation on ImageNet by inducing compact geometry for distribution matching, with training and evaluation features best kept distinct.

citing papers explorer

Showing 1 of 1 citing paper after filters.

LeVLJEPA: End-to-End Vision-Language Pretraining Without Negatives cs.CV · 2026-07-01 · unverdicted · none · ref 4
LeVLJEPA is the first non-contrastive vision-language pretraining method that learns via cross-modal prediction without negatives, producing stronger dense features than contrastive baselines on VQA and segmentation tasks.

stable-pretraining- v1: Foundation model research made simple.arXiv preprint arXiv:2511.19484, 2025

fields

years

verdicts

representative citing papers

citing papers explorer