arXiv preprint arXiv:2505.12477 , year=

Joint Embedding vs Reconstruction: Provable Benefits of Latent Space Prediction for Self Supervised Learning , author= · 2025 · arXiv 2505.12477

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

representative citing papers

ToLL: Topological Layout Learning with Asymmetric Cross-View Structural Distillation for 3D Scene Graph Generation Pretraining

cs.CV · 2026-03-30 · unverdicted · novelty 7.0

ToLL pretrains 3D scene graph generators via anchor-conditioned topological layout recovery and asymmetric structural distillation to learn predicate constraints rather than geometric interpolation shortcuts.

Information bottleneck for learning the phase space of dynamics from high-dimensional experimental data

physics.data-an · 2026-04-27 · unverdicted · novelty 6.0

DySIB recovers a two-dimensional representation matching the phase space of a physical pendulum from high-dimensional video data by maximizing predictive mutual information in latent space.

LeJEPA: Provable and Scalable Self-Supervised Learning Without the Heuristics

cs.LG · 2025-11-11 · conditional · novelty 6.0

LeJEPA derives an optimal isotropic Gaussian target for embeddings and enforces it via sketched regularization to deliver scalable, heuristics-free self-supervised pretraining with 79% ImageNet linear accuracy on ViT-H/14.

Stylistic-STORM (ST-STORM) : Perceiving the Semantic Nature of Appearance

cs.CV · 2026-04-17 · unverdicted · novelty 5.0

ST-STORM introduces a dual-branch SSL framework that disentangles semantic content from stylistic appearance using gated latent streams, JEPA for content invariance, and adversarial constraints for style capture.

citing papers explorer

Showing 4 of 4 citing papers.

ToLL: Topological Layout Learning with Asymmetric Cross-View Structural Distillation for 3D Scene Graph Generation Pretraining cs.CV · 2026-03-30 · unverdicted · none · ref 15
ToLL pretrains 3D scene graph generators via anchor-conditioned topological layout recovery and asymmetric structural distillation to learn predicate constraints rather than geometric interpolation shortcuts.
Information bottleneck for learning the phase space of dynamics from high-dimensional experimental data physics.data-an · 2026-04-27 · unverdicted · none · ref 43
DySIB recovers a two-dimensional representation matching the phase space of a physical pendulum from high-dimensional video data by maximizing predictive mutual information in latent space.
LeJEPA: Provable and Scalable Self-Supervised Learning Without the Heuristics cs.LG · 2025-11-11 · conditional · none · ref 100
LeJEPA derives an optimal isotropic Gaussian target for embeddings and enforces it via sketched regularization to deliver scalable, heuristics-free self-supervised pretraining with 79% ImageNet linear accuracy on ViT-H/14.
Stylistic-STORM (ST-STORM) : Perceiving the Semantic Nature of Appearance cs.CV · 2026-04-17 · unverdicted · none · ref 22
ST-STORM introduces a dual-branch SSL framework that disentangles semantic content from stylistic appearance using gated latent streams, JEPA for content invariance, and adversarial constraints for style capture.

arXiv preprint arXiv:2505.12477 , year=

fields

years

verdicts

representative citing papers

citing papers explorer