New Bounds for Kernel Sums via Fast Spherical Embeddings

Tal Wagner

arxiv: 2605.01263 · v1 · submitted 2026-05-02 · 💻 cs.DS · cs.LG

New Bounds for Kernel Sums via Fast Spherical Embeddings

Tal Wagner This is my paper

Pith reviewed 2026-05-09 18:34 UTC · model grok-4.3

classification 💻 cs.DS cs.LG

keywords kernel mean estimationGaussian kernelspherical embeddingsquery time boundsadditive errordata diameterfast embeddingsgeometric data structures

0 comments

The pith

A new fast spherical embedding yields the bound tilde O(d + epsilon Delta squared plus 1 over epsilon cubed) for estimating Gaussian kernel means up to additive error epsilon.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper proves an improved upper bound on the time needed to estimate the average value of a Gaussian kernel function over a finite dataset for a query point. Earlier results gave bounds of O(d over epsilon squared), tilde O(d plus 1 over epsilon to the fourth), and tilde O(d plus Delta squared over epsilon squared), where Delta is the diameter of the point set. The new bound improves on those when the allowed error epsilon is small and the diameter Delta sits in an intermediate range. The proof rests on a spherical embedding procedure that keeps nearby points at roughly correct distances while capping the overall spread of the embedded set and preventing far-away points from collapsing together. If the bound holds, kernel-mean queries become practical for higher-precision or moderately spread data without paying the full previous cost in either dimension or error terms.

Core claim

The authors prove a new query-time bound of tilde O(d + epsilon Delta squared + 1 over epsilon cubed) for approximating the kernel mean one over the size of X times the sum over x in X of k(x, y) up to additive error epsilon, for the Gaussian kernel. This improves the prior bounds precisely when epsilon is small and Delta is intermediate. The argument centers on a fast spherical embedding theorem, in the style of Bartal, Recht and Schulman, that limits the diameter of the embedded point set while preserving local Euclidean distances and avoiding distance collapse at larger scales.

What carries the argument

Fast spherical embedding theorem that limits embedded data diameter while preserving local Euclidean distances and avoiding distance collapse at larger scales.

If this is right

For Gaussian kernel mean queries with small epsilon and moderate Delta the new bound is asymptotically faster than the three earlier results.
The embedding procedure supplies a geometric primitive that can be substituted into other kernel or distance-based algorithms.
The same embedding argument may tighten bounds for related summation or averaging tasks that depend on local geometry.
Independent use of the embedding theorem is possible in any setting that needs controlled-diameter embeddings without local distortion.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Implementing the embedding and measuring actual query times on real data sets with controlled diameters would test whether the asymptotic improvement appears at practical scales.
The technique might extend to kernels other than Gaussian by replacing the distance-preservation condition with an analogous local similarity requirement.
The diameter-control property could interact with dimension-reduction methods to produce combined bounds that depend on both intrinsic dimension and diameter.

Load-bearing premise

The new fast spherical embedding theorem must hold: it must limit the diameter of the embedded points while keeping local distances accurate and preventing collapse of distant points.

What would settle it

A concrete point set X and parameters epsilon and Delta for which every spherical embedding either violates local-distance preservation, fails to bound the diameter, or produces a query procedure whose running time exceeds tilde O(d + epsilon Delta squared + 1 over epsilon cubed) while still achieving additive error epsilon.

Figures

Figures reproduced from arXiv: 2605.01263 by Tal Wagner.

**Figure 1.** Figure 1: Best regime per method in view at source ↗

read the original abstract

We study query time bounds for the fundamental problem of estimating the kernel mean $\frac1{|X|}\sum_{x\in X}\mathbf{k}(x,y)$ of a query $y$ in a finite dataset $X\subset\mathbb{R}^d$ up to a prescribed additive error $\varepsilon$. The best known bounds for the Gaussian kernel are $O(d/\varepsilon^2)$, $\widetilde O(d+1/\varepsilon^4)$, and $\widetilde O(d+\Delta^2/\varepsilon^2)$, where $\Delta$ is the diameter of a region containing the points. We prove the new bound $\tilde O(d+\varepsilon\Delta^2+1/\varepsilon^3)$, which improves over the previous ones in regimes with small error $\varepsilon$ and intermediate diameter $\Delta$. At the center of our proof is a new fast spherical embedding theorem in the sense introduced by Bartal, Recht and Schulman (2011), which limits the embedded data diameter while preserving local Euclidean distances and avoiding ``distance collapse'' at larger scales. This fast embedding theorem may be of independent interest.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper improves the query bound for Gaussian kernel mean estimation to tilde O(d + epsilon Delta^2 + 1/epsilon^3) via a new fast spherical embedding that controls diameter without collapsing distances.

read the letter

The main takeaway is a tighter query-time guarantee for approximating the average Gaussian kernel value over a dataset. The new bound tilde O(d + epsilon Delta^2 + 1/epsilon^3) beats the three cited priors in the small-epsilon, moderate-Delta regime by swapping a Delta^2/epsilon^2 term for something milder. The proof route is a fresh spherical embedding theorem that keeps nearby Euclidean distances intact, caps the embedded diameter, and prevents far points from collapsing together. That construction is presented as the key technical step and looks like it could stand alone for other distance-sensitive problems. The abstract comparison to Bartal-Recht-Schulman style embeddings is direct and shows exactly where the improvement lands. The argument avoids circularity; the target quantity is not defined in terms of itself, and the free parameters are just the usual epsilon and Delta. If the embedding runs in near-linear time and the distortion analysis holds without extra logs that eat the gain, this is a clean advance for kernel primitives. The soft spot is that the bound remains regime-specific and does not dominate the older ones everywhere, so practitioners would still pick among them. The embedding details matter most: without seeing the explicit construction and its runtime, it is hard to judge whether the query-time claim survives the embedding cost or if hidden factors appear in high dimensions. The paper is aimed at theorists who care about fine-grained bounds on kernel sums or high-dimensional similarity. A reader working on dimensionality reduction or ML algorithm design would get concrete value from the new embedding primitive. It deserves a serious referee because the claim is specific, the improvement is measurable against prior work, and the central object is a reusable theorem rather than a one-off calculation.

Referee Report

0 major / 2 minor

Summary. The manuscript claims improved query-time bounds for estimating the Gaussian kernel mean (1/|X|) sum_{x in X} k(x,y) up to additive error ε. Prior bounds are O(d/ε²), ~O(d + 1/ε⁴), and ~O(d + Δ²/ε²) where Δ is the diameter of a containing region; the new bound is ~O(d + ε Δ² + 1/ε³), which improves in the small-ε, intermediate-Δ regime. The proof centers on a new fast spherical embedding theorem (in the sense of Bartal-Recht-Schulman 2011) that bounds the embedded diameter while preserving local Euclidean distances and avoiding distance collapse at larger scales; the embedding may be of independent interest.

Significance. If the embedding theorem and its application to the kernel-sum estimator are correct, the result tightens query complexity for a fundamental primitive in kernel methods and computational geometry without introducing free parameters. The improvement is regime-specific and directly traceable to replacing the Δ²/ε² term with εΔ² + 1/ε³ under the stated embedding properties. The new embedding construction itself could be reusable in other approximation algorithms that require controlled distortion at multiple scales.

minor comments (2)

Abstract: the statement that the new bound 'improves over the previous ones in regimes with small error ε and intermediate diameter Δ' would be strengthened by a short explicit comparison (e.g., when εΔ² + 1/ε³ < Δ²/ε²) or a small table in the introduction.
The formal statement of the fast spherical embedding theorem (presumably in the section containing the main proof) should include an explicit quantitative guarantee that distances larger than some threshold are not collapsed (e.g., a lower bound on the embedded distance in terms of the original distance), so that the avoidance of collapse is verifiable from the theorem statement alone.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their positive summary of the manuscript, recognition of the regime-specific improvement, and recommendation of minor revision. No specific major comments or requested changes were provided in the report.

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper's central derivation introduces a new fast spherical embedding theorem (in the sense of Bartal et al. 2011) that independently limits embedded diameter while preserving local Euclidean distances and avoiding collapse at larger scales. This theorem is stated as a standalone contribution and is used to derive the improved query-time bound tilde O(d + εΔ² + 1/ε³) without any reduction of the target quantity to fitted parameters, self-definitions, or load-bearing self-citations. The comparison to prior bounds O(d/ε²), tilde O(d + 1/ε⁴), and tilde O(d + Δ²/ε²) follows directly from the embedding properties. No steps match the enumerated circularity patterns.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 1 invented entities

The central addition is the new embedding theorem; no free parameters are mentioned and the bound is derived from standard geometric properties plus the new construction.

axioms (2)

standard math Gaussian kernel is a standard positive-definite function on Euclidean space
Invoked implicitly when defining the kernel mean estimation problem.
standard math Euclidean distance satisfies triangle inequality and is preserved locally under the embedding
Used to control approximation error after embedding.

invented entities (1)

Fast spherical embedding no independent evidence
purpose: Map points to a sphere that limits diameter while preserving local distances and avoiding collapse at larger scales
New theorem introduced as the key technical tool for the improved bound.

pith-pipeline@v0.9.0 · 5480 in / 1382 out tokens · 52926 ms · 2026-05-09T18:34:07.741059+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

66 extracted references · 66 canonical work pages

[1]

Advances in neural information processing systems , volume=

Random features for large-scale kernel machines , author=. Advances in neural information processing systems , volume=

work page
[2]

Proceedings of the international conference on machine learning , volume=

Fastfood-approximating kernel expansions in loglinear time , author=. Proceedings of the international conference on machine learning , volume=

work page
[3]

SIAM Journal on computing , volume=

The fast Johnson--Lindenstrauss transform and approximate nearest neighbors , author=. SIAM Journal on computing , volume=. 2009 , publisher=

work page 2009
[4]

Proceedings of the twenty-second annual ACM-SIAM symposium on Discrete Algorithms , pages=

Dimensionality reduction: beyond the Johnson-Lindenstrauss bound , author=. Proceedings of the twenty-second annual ACM-SIAM symposium on Discrete Algorithms , pages=. 2011 , organization=

work page 2011
[5]

Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing , pages=

Uniform approximations for randomized hadamard transforms with applications , author=. Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing , pages=

work page
[6]

International Conference on Machine Learning , pages=

Recycling randomness with structure for sublinear time kernel expansions , author=. International Conference on Machine Learning , pages=. 2016 , organization=

work page 2016
[7]

International Conference on Artificial Intelligence and Statistics , pages=

The geometry of random features , author=. International Conference on Artificial Intelligence and Statistics , pages=. 2018 , organization=

work page 2018
[8]

2018 , publisher=

High-dimensional probability: An introduction with applications in data science , author=. 2018 , publisher=

work page 2018
[9]

Lecture notes , year=

Introduction to Malliavin calculus , author=. Lecture notes , year=

work page
[10]

2015 , publisher=

Introduction to uncertainty quantification , author=. 2015 , publisher=

work page 2015
[11]

Summer school on machine learning , pages=

Concentration inequalities , author=. Summer school on machine learning , pages=. 2003 , publisher=

work page 2003
[12]

American Journal of Mathematics , volume=

The homogeneous chaos , author=. American Journal of Mathematics , volume=. 1938 , publisher=

work page 1938
[13]

Applied and Computational Harmonic Analysis , volume=

Approximation by exponential sums revisited , author=. Applied and Computational Harmonic Analysis , volume=. 2010 , publisher=

work page 2010
[14]

Journal of Machine Learning Research , volume=

Differential privacy for functions and functional data , author=. Journal of Machine Learning Research , volume=

work page
[15]

2013 , school=

New Statistical Applications for Differential Privacy , author=. 2013 , school=

work page 2013
[16]

Journal of Machine Learning Research , volume=

Differentially private data releasing for smooth queries , author=. Journal of Machine Learning Research , volume=

work page
[17]

Thirty-First AAAI Conference on Artificial Intelligence , year=

The bernstein mechanism: Function release under differential privacy , author=. Thirty-First AAAI Conference on Artificial Intelligence , year=

work page
[18]

2021 , booktitle =

Coleman, Benjamin and Shrivastava, Anshumali , title =. 2021 , booktitle =

work page 2021
[19]

International Conference on Machine Learning (ICML) , pages=

Fast private kernel density estimation via locality sensitive quantization , author=. International Conference on Machine Learning (ICML) , pages=. 2023 , organization=

work page 2023
[20]

International Conference on Learning Representations (ICLR) , year=

Efficiently Computing Similarities to Private Datasets , author=. International Conference on Learning Representations (ICLR) , year=

work page
[21]

International Conference on Learning Representations (ICLR) , year=

Learning from End User Data with Shuffled Differential Privacy over Kernel Densities , author=. International Conference on Learning Representations (ICLR) , year=

work page
[22]

Theory of cryptography conference (TCC) , pages=

Calibrating noise to sensitivity in private data analysis , author=. Theory of cryptography conference (TCC) , pages=. 2006 , organization=

work page 2006
[23]

Foundations and Trends

The algorithmic foundations of differential privacy , author=. Foundations and Trends. 2014 , publisher=

work page 2014
[24]

Contemporary mathematics , volume=

Extensions of Lipschitz mappings into a Hilbert space , author=. Contemporary mathematics , volume=

work page
[25]

SIAM Journal on Scientific and Statistical Computing , volume=

The fast Gauss transform , author=. SIAM Journal on Scientific and Statistical Computing , volume=. 1991 , publisher=

work page 1991
[26]

Advances in neural information processing systems , volume=

The unreasonable effectiveness of structured random orthogonal embeddings , author=. Advances in neural information processing systems , volume=

work page
[27]

Advances in neural information processing systems , volume=

Practical and optimal LSH for angular distance , author=. Advances in neural information processing systems , volume=

work page
[28]

Advances in neural information processing systems , volume=

Orthogonal random features , author=. Advances in neural information processing systems , volume=

work page
[29]

2006 47th annual IEEE symposium on foundations of computer science (FOCS'06) , pages=

Improved approximation algorithms for large matrices via random projections , author=. 2006 47th annual IEEE symposium on foundations of computer science (FOCS'06) , pages=. 2006 , organization=

work page 2006
[30]

Advances in Adaptive Data Analysis , volume=

Improved analysis of the subsampled randomized Hadamard transform , author=. Advances in Adaptive Data Analysis , volume=. 2011 , publisher=

work page 2011
[31]

Numerische mathematik , volume=

Faster least squares approximation , author=. Numerische mathematik , volume=. 2011 , publisher=

work page 2011
[32]

SIAM Journal on Matrix Analysis and Applications , volume=

Improved matrix algorithms via the subsampled randomized Hadamard transform , author=. SIAM Journal on Matrix Analysis and Applications , volume=. 2013 , publisher=

work page 2013
[33]

ACM Transactions on Algorithms (TALG) , volume=

An almost optimal unrestricted fast Johnson-Lindenstrauss transform , author=. ACM Transactions on Algorithms (TALG) , volume=. 2013 , publisher=

work page 2013
[34]

Advances in neural information processing systems , volume=

Faster ridge regression via the subsampled randomized hadamard transform , author=. Advances in neural information processing systems , volume=

work page
[35]

2023 IEEE 64th Annual Symposium on Foundations of Computer Science (FOCS) , pages=

Sub-quadratic (1+ )-approximate Euclidean Spanners, with Applications , author=. 2023 IEEE 64th Annual Symposium on Foundations of Computer Science (FOCS) , pages=. 2023 , organization=

work page 2023
[36]

Advances in Neural Information Processing Systems , volume=

Differentially private approximate near neighbor counting in high dimensions , author=. Advances in Neural Information Processing Systems , volume=

work page
[37]

Discrete & Computational Geometry , volume=

A nonlinear approach to dimension reduction , author=. Discrete & Computational Geometry , volume=. 2015 , publisher=

work page 2015
[38]

Conference on Learning Theory , pages=

Discrepancy, coresets, and sketches in machine learning , author=. Conference on Learning Theory , pages=. 2019 , organization=

work page 2019
[39]

International Conference on Machine Learning , pages=

Towards a learning theory of cause-effect inference , author=. International Conference on Machine Learning , pages=. 2015 , organization=

work page 2015
[40]

Artificial Intelligence and Statistics , pages=

Sequential kernel herding: Frank-Wolfe optimization for particle filtering , author=. Artificial Intelligence and Statistics , pages=. 2015 , organization=

work page 2015
[41]

Discrete & Computational Geometry , volume=

Near-optimal coresets of kernel density estimates , author=. Discrete & Computational Geometry , volume=. 2020 , publisher=

work page 2020
[42]

Journal of Machine Learning Research , volume=

Kernel thinning , author=. Journal of Machine Learning Research , volume=

work page
[43]

2020 IEEE 61st Annual Symposium on Foundations of Computer Science , pages=

Kernel density estimation through density constrained near neighbor search , author=. 2020 IEEE 61st Annual Symposium on Foundations of Computer Science , pages=. 2020 , organization=

work page 2020
[44]

2017 IEEE 58th Annual Symposium on Foundations of Computer Science , pages=

Hashing-based-estimators for kernel density in high dimensions , author=. 2017 IEEE 58th Annual Symposium on Foundations of Computer Science , pages=. 2017 , organization=

work page 2017
[45]

Advances in neural information processing systems , volume=

Space and time efficient kernel density estimation in high dimensions , author=. Advances in neural information processing systems , volume=

work page
[46]

Rehashing kernel evaluation in high dimensions , Year =

Siminelakis, Paris and Rong, Kexin and Bailis, Peter and Charikar, Moses and Levis, Philip , Booktitle =. Rehashing kernel evaluation in high dimensions , Year =

work page
[47]

Efficient Density Evaluation for Smooth Kernels , Year =

Backurs, Arturs and Charikar, Moses and Indyk, Piotr and Siminelakis, Paris , Booktitle =. Efficient Density Evaluation for Smooth Kernels , Year =

work page
[48]

International Conference on Artificial Intelligence and Statistics , pages=

Deann: Speeding up kernel-density estimation using approximate nearest neighbor search , author=. International Conference on Artificial Intelligence and Statistics , pages=. 2022 , organization=

work page 2022
[49]

Faster Kernel Density Estimation via Hashing Based Time--Space Tradeoffs , author=

work page
[50]

International Conference on Machine Learning , pages=

Faster kernel matrix algebra via density estimation , author=. International Conference on Machine Learning , pages=. 2021 , organization=

work page 2021
[51]

2023 , title=

Bakshi, Ainesh and Indyk, Piotr and Kacham, Praneeth and Silwal, Sandeep and Zhou, Samson , booktitle=. 2023 , title=

work page 2023
[52]

arXiv preprint arXiv:2510.02540 , year=

Even Faster Kernel Matrix Linear Algebra via Density Estimation , author=. arXiv preprint arXiv:2510.02540 , year=

work page arXiv
[53]

International Conference on Machine Learning , pages=

Kdeformer: Accelerating transformers via kernel density estimation , author=. International Conference on Machine Learning , pages=. 2023 , organization=

work page 2023
[54]

Advances in neural information processing systems , volume=

Attention is all you need , author=. Advances in neural information processing systems , volume=

work page
[55]

International Conference on Learning Representations (ICLR) , year=

Rethinking attention with performers , author=. International Conference on Learning Representations (ICLR) , year=

work page
[56]

Xiong, Yunyang and Zeng, Zhanpeng and Chakraborty, Rudrasis and Tan, Mingxing and Fung, Glenn and Li, Yin and Singh, Vikas , booktitle=. Nystr

work page
[57]

ICML 2025 Workshop on Long-Context Foundation Models , year=

Thinformer: Guaranteed Attention Approximation via Low-Rank Thinning , author=. ICML 2025 Workshop on Long-Context Foundation Models , year=

work page 2025
[58]

International Conference on Learning Representations (ICLR) , year=

Improved algorithms for kernel matrix-vector multiplication under sparsity assumptions , author=. International Conference on Learning Representations (ICLR) , year=

work page
[59]

Journal of Machine Learning Research , volume=

Quasi-Monte Carlo feature maps for shift-invariant kernels , author=. Journal of Machine Learning Research , volume=

work page
[60]

Journal of machine learning research , volume=

On the equivalence between kernel quadrature rules and random feature expansions , author=. Journal of machine learning research , volume=

work page
[61]

Advances in neural information processing systems , volume=

Quadrature-based features for kernel approximation , author=. Advances in neural information processing systems , volume=

work page
[62]

Forty-first International Conference on Machine Learning , year=

Quasi-monte carlo features for kernel approximation , author=. Forty-first International Conference on Machine Learning , year=

work page
[63]

Less is more: Nystr

Rudi, Alessandro and Camoriano, Raffaello and Rosasco, Lorenzo , journal=. Less is more: Nystr

work page
[64]

Revisiting the Nystr

Gittens, Alex and Mahoney, Michael W , journal=. Revisiting the Nystr. 2016 , publisher=

work page 2016
[65]

Advances in neural information processing systems , volume=

Recursive sampling for the nystrom method , author=. Advances in neural information processing systems , volume=

work page
[66]

arXiv preprint arXiv:2601.09477 , year=

Engineering Compressed Matrix Multiplication with the Fast Walsh-Hadamard Transform , author=. arXiv preprint arXiv:2601.09477 , year=

work page arXiv

[1] [1]

Advances in neural information processing systems , volume=

Random features for large-scale kernel machines , author=. Advances in neural information processing systems , volume=

work page

[2] [2]

Proceedings of the international conference on machine learning , volume=

Fastfood-approximating kernel expansions in loglinear time , author=. Proceedings of the international conference on machine learning , volume=

work page

[3] [3]

SIAM Journal on computing , volume=

The fast Johnson--Lindenstrauss transform and approximate nearest neighbors , author=. SIAM Journal on computing , volume=. 2009 , publisher=

work page 2009

[4] [4]

Proceedings of the twenty-second annual ACM-SIAM symposium on Discrete Algorithms , pages=

Dimensionality reduction: beyond the Johnson-Lindenstrauss bound , author=. Proceedings of the twenty-second annual ACM-SIAM symposium on Discrete Algorithms , pages=. 2011 , organization=

work page 2011

[5] [5]

Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing , pages=

Uniform approximations for randomized hadamard transforms with applications , author=. Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing , pages=

work page

[6] [6]

International Conference on Machine Learning , pages=

Recycling randomness with structure for sublinear time kernel expansions , author=. International Conference on Machine Learning , pages=. 2016 , organization=

work page 2016

[7] [7]

International Conference on Artificial Intelligence and Statistics , pages=

The geometry of random features , author=. International Conference on Artificial Intelligence and Statistics , pages=. 2018 , organization=

work page 2018

[8] [8]

2018 , publisher=

High-dimensional probability: An introduction with applications in data science , author=. 2018 , publisher=

work page 2018

[9] [9]

Lecture notes , year=

Introduction to Malliavin calculus , author=. Lecture notes , year=

work page

[10] [10]

2015 , publisher=

Introduction to uncertainty quantification , author=. 2015 , publisher=

work page 2015

[11] [11]

Summer school on machine learning , pages=

Concentration inequalities , author=. Summer school on machine learning , pages=. 2003 , publisher=

work page 2003

[12] [12]

American Journal of Mathematics , volume=

The homogeneous chaos , author=. American Journal of Mathematics , volume=. 1938 , publisher=

work page 1938

[13] [13]

Applied and Computational Harmonic Analysis , volume=

Approximation by exponential sums revisited , author=. Applied and Computational Harmonic Analysis , volume=. 2010 , publisher=

work page 2010

[14] [14]

Journal of Machine Learning Research , volume=

Differential privacy for functions and functional data , author=. Journal of Machine Learning Research , volume=

work page

[15] [15]

2013 , school=

New Statistical Applications for Differential Privacy , author=. 2013 , school=

work page 2013

[16] [16]

Journal of Machine Learning Research , volume=

Differentially private data releasing for smooth queries , author=. Journal of Machine Learning Research , volume=

work page

[17] [17]

Thirty-First AAAI Conference on Artificial Intelligence , year=

The bernstein mechanism: Function release under differential privacy , author=. Thirty-First AAAI Conference on Artificial Intelligence , year=

work page

[18] [18]

2021 , booktitle =

Coleman, Benjamin and Shrivastava, Anshumali , title =. 2021 , booktitle =

work page 2021

[19] [19]

International Conference on Machine Learning (ICML) , pages=

Fast private kernel density estimation via locality sensitive quantization , author=. International Conference on Machine Learning (ICML) , pages=. 2023 , organization=

work page 2023

[20] [20]

International Conference on Learning Representations (ICLR) , year=

Efficiently Computing Similarities to Private Datasets , author=. International Conference on Learning Representations (ICLR) , year=

work page

[21] [21]

International Conference on Learning Representations (ICLR) , year=

Learning from End User Data with Shuffled Differential Privacy over Kernel Densities , author=. International Conference on Learning Representations (ICLR) , year=

work page

[22] [22]

Theory of cryptography conference (TCC) , pages=

Calibrating noise to sensitivity in private data analysis , author=. Theory of cryptography conference (TCC) , pages=. 2006 , organization=

work page 2006

[23] [23]

Foundations and Trends

The algorithmic foundations of differential privacy , author=. Foundations and Trends. 2014 , publisher=

work page 2014

[24] [24]

Contemporary mathematics , volume=

Extensions of Lipschitz mappings into a Hilbert space , author=. Contemporary mathematics , volume=

work page

[25] [25]

SIAM Journal on Scientific and Statistical Computing , volume=

The fast Gauss transform , author=. SIAM Journal on Scientific and Statistical Computing , volume=. 1991 , publisher=

work page 1991

[26] [26]

Advances in neural information processing systems , volume=

The unreasonable effectiveness of structured random orthogonal embeddings , author=. Advances in neural information processing systems , volume=

work page

[27] [27]

Advances in neural information processing systems , volume=

Practical and optimal LSH for angular distance , author=. Advances in neural information processing systems , volume=

work page

[28] [28]

Advances in neural information processing systems , volume=

Orthogonal random features , author=. Advances in neural information processing systems , volume=

work page

[29] [29]

2006 47th annual IEEE symposium on foundations of computer science (FOCS'06) , pages=

Improved approximation algorithms for large matrices via random projections , author=. 2006 47th annual IEEE symposium on foundations of computer science (FOCS'06) , pages=. 2006 , organization=

work page 2006

[30] [30]

Advances in Adaptive Data Analysis , volume=

Improved analysis of the subsampled randomized Hadamard transform , author=. Advances in Adaptive Data Analysis , volume=. 2011 , publisher=

work page 2011

[31] [31]

Numerische mathematik , volume=

Faster least squares approximation , author=. Numerische mathematik , volume=. 2011 , publisher=

work page 2011

[32] [32]

SIAM Journal on Matrix Analysis and Applications , volume=

Improved matrix algorithms via the subsampled randomized Hadamard transform , author=. SIAM Journal on Matrix Analysis and Applications , volume=. 2013 , publisher=

work page 2013

[33] [33]

ACM Transactions on Algorithms (TALG) , volume=

An almost optimal unrestricted fast Johnson-Lindenstrauss transform , author=. ACM Transactions on Algorithms (TALG) , volume=. 2013 , publisher=

work page 2013

[34] [34]

Advances in neural information processing systems , volume=

Faster ridge regression via the subsampled randomized hadamard transform , author=. Advances in neural information processing systems , volume=

work page

[35] [35]

2023 IEEE 64th Annual Symposium on Foundations of Computer Science (FOCS) , pages=

Sub-quadratic (1+ )-approximate Euclidean Spanners, with Applications , author=. 2023 IEEE 64th Annual Symposium on Foundations of Computer Science (FOCS) , pages=. 2023 , organization=

work page 2023

[36] [36]

Advances in Neural Information Processing Systems , volume=

Differentially private approximate near neighbor counting in high dimensions , author=. Advances in Neural Information Processing Systems , volume=

work page

[37] [37]

Discrete & Computational Geometry , volume=

A nonlinear approach to dimension reduction , author=. Discrete & Computational Geometry , volume=. 2015 , publisher=

work page 2015

[38] [38]

Conference on Learning Theory , pages=

Discrepancy, coresets, and sketches in machine learning , author=. Conference on Learning Theory , pages=. 2019 , organization=

work page 2019

[39] [39]

International Conference on Machine Learning , pages=

Towards a learning theory of cause-effect inference , author=. International Conference on Machine Learning , pages=. 2015 , organization=

work page 2015

[40] [40]

Artificial Intelligence and Statistics , pages=

Sequential kernel herding: Frank-Wolfe optimization for particle filtering , author=. Artificial Intelligence and Statistics , pages=. 2015 , organization=

work page 2015

[41] [41]

Discrete & Computational Geometry , volume=

Near-optimal coresets of kernel density estimates , author=. Discrete & Computational Geometry , volume=. 2020 , publisher=

work page 2020

[42] [42]

Journal of Machine Learning Research , volume=

Kernel thinning , author=. Journal of Machine Learning Research , volume=

work page

[43] [43]

2020 IEEE 61st Annual Symposium on Foundations of Computer Science , pages=

Kernel density estimation through density constrained near neighbor search , author=. 2020 IEEE 61st Annual Symposium on Foundations of Computer Science , pages=. 2020 , organization=

work page 2020

[44] [44]

2017 IEEE 58th Annual Symposium on Foundations of Computer Science , pages=

Hashing-based-estimators for kernel density in high dimensions , author=. 2017 IEEE 58th Annual Symposium on Foundations of Computer Science , pages=. 2017 , organization=

work page 2017

[45] [45]

Advances in neural information processing systems , volume=

Space and time efficient kernel density estimation in high dimensions , author=. Advances in neural information processing systems , volume=

work page

[46] [46]

Rehashing kernel evaluation in high dimensions , Year =

Siminelakis, Paris and Rong, Kexin and Bailis, Peter and Charikar, Moses and Levis, Philip , Booktitle =. Rehashing kernel evaluation in high dimensions , Year =

work page

[47] [47]

Efficient Density Evaluation for Smooth Kernels , Year =

Backurs, Arturs and Charikar, Moses and Indyk, Piotr and Siminelakis, Paris , Booktitle =. Efficient Density Evaluation for Smooth Kernels , Year =

work page

[48] [48]

International Conference on Artificial Intelligence and Statistics , pages=

Deann: Speeding up kernel-density estimation using approximate nearest neighbor search , author=. International Conference on Artificial Intelligence and Statistics , pages=. 2022 , organization=

work page 2022

[49] [49]

Faster Kernel Density Estimation via Hashing Based Time--Space Tradeoffs , author=

work page

[50] [50]

International Conference on Machine Learning , pages=

Faster kernel matrix algebra via density estimation , author=. International Conference on Machine Learning , pages=. 2021 , organization=

work page 2021

[51] [51]

2023 , title=

Bakshi, Ainesh and Indyk, Piotr and Kacham, Praneeth and Silwal, Sandeep and Zhou, Samson , booktitle=. 2023 , title=

work page 2023

[52] [52]

arXiv preprint arXiv:2510.02540 , year=

Even Faster Kernel Matrix Linear Algebra via Density Estimation , author=. arXiv preprint arXiv:2510.02540 , year=

work page arXiv

[53] [53]

International Conference on Machine Learning , pages=

Kdeformer: Accelerating transformers via kernel density estimation , author=. International Conference on Machine Learning , pages=. 2023 , organization=

work page 2023

[54] [54]

Advances in neural information processing systems , volume=

Attention is all you need , author=. Advances in neural information processing systems , volume=

work page

[55] [55]

International Conference on Learning Representations (ICLR) , year=

Rethinking attention with performers , author=. International Conference on Learning Representations (ICLR) , year=

work page

[56] [56]

Xiong, Yunyang and Zeng, Zhanpeng and Chakraborty, Rudrasis and Tan, Mingxing and Fung, Glenn and Li, Yin and Singh, Vikas , booktitle=. Nystr

work page

[57] [57]

ICML 2025 Workshop on Long-Context Foundation Models , year=

Thinformer: Guaranteed Attention Approximation via Low-Rank Thinning , author=. ICML 2025 Workshop on Long-Context Foundation Models , year=

work page 2025

[58] [58]

International Conference on Learning Representations (ICLR) , year=

Improved algorithms for kernel matrix-vector multiplication under sparsity assumptions , author=. International Conference on Learning Representations (ICLR) , year=

work page

[59] [59]

Journal of Machine Learning Research , volume=

Quasi-Monte Carlo feature maps for shift-invariant kernels , author=. Journal of Machine Learning Research , volume=

work page

[60] [60]

Journal of machine learning research , volume=

On the equivalence between kernel quadrature rules and random feature expansions , author=. Journal of machine learning research , volume=

work page

[61] [61]

Advances in neural information processing systems , volume=

Quadrature-based features for kernel approximation , author=. Advances in neural information processing systems , volume=

work page

[62] [62]

Forty-first International Conference on Machine Learning , year=

Quasi-monte carlo features for kernel approximation , author=. Forty-first International Conference on Machine Learning , year=

work page

[63] [63]

Less is more: Nystr

Rudi, Alessandro and Camoriano, Raffaello and Rosasco, Lorenzo , journal=. Less is more: Nystr

work page

[64] [64]

Revisiting the Nystr

Gittens, Alex and Mahoney, Michael W , journal=. Revisiting the Nystr. 2016 , publisher=

work page 2016

[65] [65]

Advances in neural information processing systems , volume=

Recursive sampling for the nystrom method , author=. Advances in neural information processing systems , volume=

work page

[66] [66]

arXiv preprint arXiv:2601.09477 , year=

Engineering Compressed Matrix Multiplication with the Fast Walsh-Hadamard Transform , author=. arXiv preprint arXiv:2601.09477 , year=

work page arXiv