hub

A cookbook of self-supervised learning

URLhttps://arxiv · 2023 · arXiv 2304.12210

11 Pith papers cite this work. Polarity classification is still indexing.

11 Pith papers citing it

read on arXiv browse 11 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 2 method 1

citation-polarity summary

background 2 use method 1

representative citing papers

Optimal Representations for Generalized Contrastive Learning with Imbalanced Datasets

cs.LG · 2026-05-11 · unverdicted · novelty 7.0

In generalized contrastive learning with imbalanced classes, optimal representations collapse to class means whose angular geometry is determined by class proportions via convex optimization, and extreme imbalance causes all minority classes to collapse to one vector.

RankUp: Towards High-rank Representations for Large Scale Advertising Recommender Systems

cs.IR · 2026-04-20 · unverdicted · novelty 6.0 · 2 refs

RankUp raises effective rank of representations in deep MetaFormer recommenders via randomized splitting and multi-embeddings, delivering 2-5% GMV gains in production deployments at Weixin.

Grounding Hierarchical Vision-Language-Action Models Through Explicit Language-Action Alignment

cs.RO · 2026-04-07 · unverdicted · novelty 6.0

A contrastive alignment model plus offline preference learning explicitly grounds hierarchical VLA language descriptions to actions and visuals on LanguageTable, achieving performance comparable to fully supervised fine-tuning while reducing annotation needs.

Rapidly deploying on-device eye tracking by distilling visual foundation models

cs.CV · 2026-04-02 · unverdicted · novelty 6.0

DistillGaze reduces median gaze error by 58.62% on a 2000+ participant dataset by distilling foundation models into a 256K-parameter on-device model using synthetic labeled data and unlabeled real data.

LeJEPA: Provable and Scalable Self-Supervised Learning Without the Heuristics

cs.LG · 2025-11-11 · conditional · novelty 6.0

LeJEPA derives an optimal isotropic Gaussian target for embeddings and enforces it via sketched regularization to deliver scalable, heuristics-free self-supervised pretraining with 79% ImageNet linear accuracy on ViT-H/14.

Self-supervised neural operator for solving partial differential equations

physics.comp-ph · 2025-08-31 · unverdicted · novelty 6.0

Self-supervised neural operator uses Bayesian PINNs to generate training data and a Transformer to learn PDE operators, achieving high accuracy on 1D/2D reaction-diffusion and fluid vibration problems with optional lightweight finetuning.

Statistical learnability of smooth boundaries via pairwise binary classification with deep ReLU networks

math.ST · 2025-01-13 · unverdicted · novelty 6.0

Proves learnability of ordered multiple smooth boundaries in pairwise binary classification via localized deep ReLU networks.

Learning General Representation of 12-Lead Electrocardiogram with a Joint-Embedding Predictive Architecture

cs.LG · 2024-10-11 · unverdicted · novelty 6.0

ECG-JEPA applies a joint-embedding predictive architecture with Cross-Pattern Attention to learn semantic representations from unlabeled 12-lead ECG data and reports state-of-the-art results on diagnostic classification, feature extraction, and segmentation.

Self-Supervised Learning of Plant Image Representations

cs.CV · 2026-04-30 · unverdicted · novelty 5.0

Domain-specific augmentations and plant-only training data produce stronger self-supervised representations for fine-grained plant recognition than standard SSL pipelines or ImageNet pretraining.

There Will Be a Scientific Theory of Deep Learning

stat.ML · 2026-04-23 · unverdicted · novelty 2.0

A mechanics of the learning process is emerging in deep learning theory, characterized by dynamics, coarse statistics, and falsifiable predictions across idealized settings, limits, laws, hyperparameters, and universal behaviors.

Next-Latent Prediction Transformers Learn Compact World Models

cs.LG · 2025-11-08

citing papers explorer

Showing 11 of 11 citing papers.

Optimal Representations for Generalized Contrastive Learning with Imbalanced Datasets cs.LG · 2026-05-11 · unverdicted · none · ref 25
In generalized contrastive learning with imbalanced classes, optimal representations collapse to class means whose angular geometry is determined by class proportions via convex optimization, and extreme imbalance causes all minority classes to collapse to one vector.
RankUp: Towards High-rank Representations for Large Scale Advertising Recommender Systems cs.IR · 2026-04-20 · unverdicted · none · ref 3 · 2 links
RankUp raises effective rank of representations in deep MetaFormer recommenders via randomized splitting and multi-embeddings, delivering 2-5% GMV gains in production deployments at Weixin.
Grounding Hierarchical Vision-Language-Action Models Through Explicit Language-Action Alignment cs.RO · 2026-04-07 · unverdicted · none · ref 2
A contrastive alignment model plus offline preference learning explicitly grounds hierarchical VLA language descriptions to actions and visuals on LanguageTable, achieving performance comparable to fully supervised fine-tuning while reducing annotation needs.
Rapidly deploying on-device eye tracking by distilling visual foundation models cs.CV · 2026-04-02 · unverdicted · none · ref 30
DistillGaze reduces median gaze error by 58.62% on a 2000+ participant dataset by distilling foundation models into a 256K-parameter on-device model using synthetic labeled data and unlabeled real data.
LeJEPA: Provable and Scalable Self-Supervised Learning Without the Heuristics cs.LG · 2025-11-11 · conditional · none · ref 101
LeJEPA derives an optimal isotropic Gaussian target for embeddings and enforces it via sketched regularization to deliver scalable, heuristics-free self-supervised pretraining with 79% ImageNet linear accuracy on ViT-H/14.
Self-supervised neural operator for solving partial differential equations physics.comp-ph · 2025-08-31 · unverdicted · none · ref 27
Self-supervised neural operator uses Bayesian PINNs to generate training data and a Transformer to learn PDE operators, achieving high accuracy on 1D/2D reaction-diffusion and fluid vibration problems with optional lightweight finetuning.
Statistical learnability of smooth boundaries via pairwise binary classification with deep ReLU networks math.ST · 2025-01-13 · unverdicted · none · ref 4
Proves learnability of ordered multiple smooth boundaries in pairwise binary classification via localized deep ReLU networks.
Learning General Representation of 12-Lead Electrocardiogram with a Joint-Embedding Predictive Architecture cs.LG · 2024-10-11 · unverdicted · none · ref 14
ECG-JEPA applies a joint-embedding predictive architecture with Cross-Pattern Attention to learn semantic representations from unlabeled 12-lead ECG data and reports state-of-the-art results on diagnostic classification, feature extraction, and segmentation.
Self-Supervised Learning of Plant Image Representations cs.CV · 2026-04-30 · unverdicted · none · ref 1
Domain-specific augmentations and plant-only training data produce stronger self-supervised representations for fine-grained plant recognition than standard SSL pipelines or ImageNet pretraining.
There Will Be a Scientific Theory of Deep Learning stat.ML · 2026-04-23 · unverdicted · none · ref 38
A mechanics of the learning process is emerging in deep learning theory, characterized by dynamics, coarse statistics, and falsifiable predictions across idealized settings, limits, laws, hyperparameters, and universal behaviors.
Next-Latent Prediction Transformers Learn Compact World Models cs.LG · 2025-11-08 · unreviewed · ref 4

A cookbook of self-supervised learning

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer