Title resolution pending

doi: 10 · 1903 · DOI 10.1137/0330046

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

open at publisher browse 7 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

cs.CV · 2020-10-22 · accept · novelty 9.0

Vision Transformer (ViT) applies a standard transformer directly to image patches and matches or exceeds state-of-the-art CNN performance on classification benchmarks after large-scale pre-training.

Contour Refinement using Discrete Diffusion in Low Data Regime

cs.CV · 2026-02-05 · unverdicted · novelty 7.0

A CNN-based discrete diffusion method refines sparse contours from segmentation masks using simplified denoising steps and minimal post-processing, outperforming baselines on small medical and environmental datasets while running 3.5 times faster.

On What We Can Learn from Low-Resolution Data

cs.LG · 2026-05-12 · unverdicted · novelty 6.0

Low-resolution data improves high-resolution model performance when high-resolution samples are limited, via KL-divergence bounds and experiments on vision transformers and CNNs.

Model Merging: Foundations and Algorithms

cs.LG · 2026-05-02 · unverdicted · novelty 6.0

New cycle-consistent optimization, task vector theory, singular vector decompositions, adaptive routing, and efficient evolutionary search provide foundations for merging neural network weights across tasks.

Towards foundation-style models for energy-frontier heterogeneous neutrino detectors via self-supervised pre-training

hep-ex · 2026-04-08 · conditional · novelty 6.0

Self-supervised pre-training on multimodal neutrino detector simulations produces reusable representations that improve downstream classification, regression, and data efficiency over training from scratch.

Variance Matters: Improving Domain Adaptation via Stratified Sampling

cs.LG · 2025-12-04 · unverdicted · novelty 6.0

VaRDASS improves unsupervised domain adaptation by using stratified sampling to reduce variance in discrepancy estimation for measures like correlation alignment and MMD, with derived error bounds, an optimality proof for MMD under assumptions, and a k-means style algorithm.

Generalization Guarantees on Data-Driven Tuning of Gradient Descent with Langevin Updates

cs.LG · 2026-04-13 · unverdicted · novelty 5.0

LGD reaches Bayes optimality at optimal hyperparameters and admits an O(dh) pseudo-dimension bound for meta-learning hyperparameters on convex regression tasks.

citing papers explorer

Showing 7 of 7 citing papers.

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale cs.CV · 2020-10-22 · accept · none · ref 3
Vision Transformer (ViT) applies a standard transformer directly to image patches and matches or exceeds state-of-the-art CNN performance on classification benchmarks after large-scale pre-training.
Contour Refinement using Discrete Diffusion in Low Data Regime cs.CV · 2026-02-05 · unverdicted · none · ref 22
A CNN-based discrete diffusion method refines sparse contours from segmentation masks using simplified denoising steps and minimal post-processing, outperforming baselines on small medical and environmental datasets while running 3.5 times faster.
On What We Can Learn from Low-Resolution Data cs.LG · 2026-05-12 · unverdicted · none · ref 28
Low-resolution data improves high-resolution model performance when high-resolution samples are limited, via KL-divergence bounds and experiments on vision transformers and CNNs.
Model Merging: Foundations and Algorithms cs.LG · 2026-05-02 · unverdicted · none · ref 141
New cycle-consistent optimization, task vector theory, singular vector decompositions, adaptive routing, and efficient evolutionary search provide foundations for merging neural network weights across tasks.
Towards foundation-style models for energy-frontier heterogeneous neutrino detectors via self-supervised pre-training hep-ex · 2026-04-08 · conditional · none · ref 59
Self-supervised pre-training on multimodal neutrino detector simulations produces reusable representations that improve downstream classification, regression, and data efficiency over training from scratch.
Variance Matters: Improving Domain Adaptation via Stratified Sampling cs.LG · 2025-12-04 · unverdicted · none · ref 32
VaRDASS improves unsupervised domain adaptation by using stratified sampling to reduce variance in discrepancy estimation for measures like correlation alignment and MMD, with derived error bounds, an optimality proof for MMD under assumptions, and a k-means style algorithm.
Generalization Guarantees on Data-Driven Tuning of Gradient Descent with Langevin Updates cs.LG · 2026-04-13 · unverdicted · none · ref 12
LGD reaches Bayes optimality at optimal hyperparameters and admits an O(dh) pseudo-dimension bound for meta-learning hyperparameters on convex regression tasks.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer