Ryumei Nakada and Masaaki Imaizumi

doi: 10 · 1964 · DOI 10.1137/1109020

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

open at publisher browse 7 citing papers

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

Sparse Attention as Compact Kernel Regression

cs.LG · 2026-01-30 · unverdicted · novelty 8.0

Sparse attention arises from compact kernel regression, with Epanechnikov and similar kernels mapping to normalized ReLU, sparsemax, and alpha-entmax attention.

Understanding In-Context Learning on Structured Manifolds: Bridging Attention to Kernel Methods

cs.LG · 2025-06-12 · unverdicted · novelty 8.0

Transformers perform kernel-based prediction for Hölder regression on manifolds and achieve intrinsic-dimension-dependent minimax rates with sufficient training tasks.

Direct Estimation of Schr\"odinger Bridge Time-Series Drifts: Finite-Sample, Asymptotic, and Adaptive Guarantees

math.ST · 2026-05-06 · unverdicted · novelty 7.0

A direct plug-in kernel estimator for Schrödinger bridge time-series drifts achieves uniform non-asymptotic bounds, pointwise CLT under undersmoothing, and minimax-rate optimal adaptive selection.

VeloTree: Inferring single-cell trajectories from RNA velocity fields with varifold distances

q-bio.GN · 2026-04-01 · unverdicted · novelty 7.0

VeloTree infers differentiation trees from RNA velocity fields by defining cell dissimilarity as the squared varifold distance between integral curves of the velocity field.

Towards Scalable Persistence-Based Topological Optimization

cs.CG · 2026-05-09 · unverdicted · novelty 6.0

Random slicing for subsampling combined with Nadaraya-Watson smoothing enables faster and improved persistence-based topological optimization of point clouds in 2D and 3D.

Optimal Experimental Design for Reliable Learning of History-Dependent Constitutive Laws

cond-mat.mtrl-sci · 2026-03-12 · unverdicted · novelty 6.0

A Bayesian optimal experimental design framework with Gaussian approximation of expected information gain and surrogate Fisher information enables optimized uniaxial tests that significantly improve identifiability of history-dependent constitutive parameters over random designs.

What makes a word hard to learn? Modeling L1 influence on English vocabulary difficulty

cs.CL · 2026-05-12 · unverdicted · novelty 5.0

Gradient-boosted models with SHAP analysis find word familiarity as the dominant predictor of English vocabulary difficulty across Spanish, German, and Chinese L1 learners, with orthographic transfer adding value only for the first two groups.

citing papers explorer

Showing 7 of 7 citing papers.

Sparse Attention as Compact Kernel Regression cs.LG · 2026-01-30 · unverdicted · none · ref 11
Sparse attention arises from compact kernel regression, with Epanechnikov and similar kernels mapping to normalized ReLU, sparsemax, and alpha-entmax attention.
Understanding In-Context Learning on Structured Manifolds: Bridging Attention to Kernel Methods cs.LG · 2025-06-12 · unverdicted · none · ref 8
Transformers perform kernel-based prediction for Hölder regression on manifolds and achieve intrinsic-dimension-dependent minimax rates with sufficient training tasks.
Direct Estimation of Schr\"odinger Bridge Time-Series Drifts: Finite-Sample, Asymptotic, and Adaptive Guarantees math.ST · 2026-05-06 · unverdicted · none · ref 23
A direct plug-in kernel estimator for Schrödinger bridge time-series drifts achieves uniform non-asymptotic bounds, pointwise CLT under undersmoothing, and minimax-rate optimal adaptive selection.
VeloTree: Inferring single-cell trajectories from RNA velocity fields with varifold distances q-bio.GN · 2026-04-01 · unverdicted · none · ref 25
VeloTree infers differentiation trees from RNA velocity fields by defining cell dissimilarity as the squared varifold distance between integral curves of the velocity field.
Towards Scalable Persistence-Based Topological Optimization cs.CG · 2026-05-09 · unverdicted · none · ref 10
Random slicing for subsampling combined with Nadaraya-Watson smoothing enables faster and improved persistence-based topological optimization of point clouds in 2D and 3D.
Optimal Experimental Design for Reliable Learning of History-Dependent Constitutive Laws cond-mat.mtrl-sci · 2026-03-12 · unverdicted · none · ref 102
A Bayesian optimal experimental design framework with Gaussian approximation of expected information gain and surrogate Fisher information enables optimized uniaxial tests that significantly improve identifiability of history-dependent constitutive parameters over random designs.
What makes a word hard to learn? Modeling L1 influence on English vocabulary difficulty cs.CL · 2026-05-12 · unverdicted · none · ref 29
Gradient-boosted models with SHAP analysis find word familiarity as the dominant predictor of English vocabulary difficulty across Spanish, German, and Chinese L1 learners, with orthographic transfer adding value only for the first two groups.

Ryumei Nakada and Masaaki Imaizumi

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer