Polyak and Anatoli B

Polyak, B · 1992 · DOI 10.1137/0330046

10 Pith papers cite this work. Polarity classification is still indexing.

10 Pith papers citing it

open at publisher browse 10 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

cs.CV · 2020-10-22 · accept · novelty 9.0

Vision Transformer (ViT) applies a standard transformer directly to image patches and matches or exceeds state-of-the-art CNN performance on classification benchmarks after large-scale pre-training.

Large-scale Uncertainty Quantification for Latent Variable Models Using Subsampling Markov Chain Monte Carlo

cs.LG · 2026-05-29 · unverdicted · novelty 7.0

Derives joint asymptotic jump-diffusion limit for global parameters and latent variables in SGLD-Gibbs under space-time rescaling, yielding explicit hyperparameter tuning guidance for calibrated uncertainty quantification.

Contour Refinement using Discrete Diffusion in Low Data Regime

cs.CV · 2026-02-05 · unverdicted · novelty 7.0

A CNN-based discrete diffusion method refines sparse contours from segmentation masks using simplified denoising steps and minimal post-processing, outperforming baselines on small medical and environmental datasets while running 3.5 times faster.

Geometrically Averaged Hard Target Updates for Linear Q-Learning

cs.LG · 2026-06-09 · unverdicted · novelty 6.0

Introduces and analyzes the λ-target update for linear Q-learning via geometric averaging of periodic target maps, studied with a switching-system model in the deterministic case.

On What We Can Learn from Low-Resolution Data

cs.LG · 2026-05-12 · unverdicted · novelty 6.0

Low-resolution data improves high-resolution model performance when high-resolution samples are limited, via KL-divergence bounds and experiments on vision transformers and CNNs.

Model Merging: Foundations and Algorithms

cs.LG · 2026-05-02 · unverdicted · novelty 6.0

New cycle-consistent optimization, task vector theory, singular vector decompositions, adaptive routing, and efficient evolutionary search provide foundations for merging neural network weights across tasks.

Towards foundation-style models for energy-frontier heterogeneous neutrino detectors via self-supervised pre-training

hep-ex · 2026-04-08 · conditional · novelty 6.0

Self-supervised pre-training on multimodal neutrino detector simulations produces reusable representations that improve downstream classification, regression, and data efficiency over training from scratch.

Variance Matters: Improving Domain Adaptation via Stratified Sampling

cs.LG · 2025-12-04 · unverdicted · novelty 6.0

VaRDASS improves unsupervised domain adaptation by using stratified sampling to reduce variance in discrepancy estimation for measures like correlation alignment and MMD, with derived error bounds, an optimality proof for MMD under assumptions, and a k-means style algorithm.

Multiscale reconstruction of protein conformations from cryo-EM images

eess.IV · 2026-06-16 · unverdicted · novelty 5.0

A multiscale optimization method using explicit protein backbone geometry reconstructs atomic models from cryo-EM data, showing improved RMSD and TM scores on three simulated datasets.

Generalization Guarantees on Data-Driven Tuning of Gradient Descent with Langevin Updates

cs.LG · 2026-04-13 · unverdicted · novelty 5.0

LGD reaches Bayes optimality at optimal hyperparameters and admits an O(dh) pseudo-dimension bound for meta-learning hyperparameters on convex regression tasks.

citing papers explorer

Showing 8 of 8 citing papers after filters.

Large-scale Uncertainty Quantification for Latent Variable Models Using Subsampling Markov Chain Monte Carlo cs.LG · 2026-05-29 · unverdicted · none · ref 40
Derives joint asymptotic jump-diffusion limit for global parameters and latent variables in SGLD-Gibbs under space-time rescaling, yielding explicit hyperparameter tuning guidance for calibrated uncertainty quantification.
Contour Refinement using Discrete Diffusion in Low Data Regime cs.CV · 2026-02-05 · unverdicted · none · ref 22
A CNN-based discrete diffusion method refines sparse contours from segmentation masks using simplified denoising steps and minimal post-processing, outperforming baselines on small medical and environmental datasets while running 3.5 times faster.
Geometrically Averaged Hard Target Updates for Linear Q-Learning cs.LG · 2026-06-09 · unverdicted · none · ref 25
Introduces and analyzes the λ-target update for linear Q-learning via geometric averaging of periodic target maps, studied with a switching-system model in the deterministic case.
On What We Can Learn from Low-Resolution Data cs.LG · 2026-05-12 · unverdicted · none · ref 28
Low-resolution data improves high-resolution model performance when high-resolution samples are limited, via KL-divergence bounds and experiments on vision transformers and CNNs.
Model Merging: Foundations and Algorithms cs.LG · 2026-05-02 · unverdicted · none · ref 141
New cycle-consistent optimization, task vector theory, singular vector decompositions, adaptive routing, and efficient evolutionary search provide foundations for merging neural network weights across tasks.
Towards foundation-style models for energy-frontier heterogeneous neutrino detectors via self-supervised pre-training hep-ex · 2026-04-08 · conditional · none · ref 59
Self-supervised pre-training on multimodal neutrino detector simulations produces reusable representations that improve downstream classification, regression, and data efficiency over training from scratch.
Multiscale reconstruction of protein conformations from cryo-EM images eess.IV · 2026-06-16 · unverdicted · none · ref 200
A multiscale optimization method using explicit protein backbone geometry reconstructs atomic models from cryo-EM data, showing improved RMSD and TM scores on three simulated datasets.
Generalization Guarantees on Data-Driven Tuning of Gradient Descent with Langevin Updates cs.LG · 2026-04-13 · unverdicted · none · ref 12
LGD reaches Bayes optimality at optimal hyperparameters and admits an O(dh) pseudo-dimension bound for meta-learning hyperparameters on convex regression tasks.

Polyak and Anatoli B

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer