Derives explicit bounds decomposing KL(GP || LNP) into three costs with decay rates O(e^{-c d^{2/d_x}}) for squared-exponential kernels and O(d^{-2ν/d_x}) for Matérn kernels, plus recommendations to predict variance from locations alone and use second-order pooling.
Title resolution pending
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 3verdicts
UNVERDICTED 3roles
background 1polarities
background 1representative citing papers
Transformer components arise as the natural solution to precision-weighted directional state estimation on the hypersphere.
Neural feature maps create expressive kernels that enable fast, scalable, and consistent exact Gaussian process inference for regression and classification.
citing papers explorer
-
Three Costs of Amortizing Gaussian Process Inference with Neural Processes
Derives explicit bounds decomposing KL(GP || LNP) into three costs with decay rates O(e^{-c d^{2/d_x}}) for squared-exponential kernels and O(d^{-2ν/d_x}) for Matérn kernels, plus recommendations to predict variance from locations alone and use second-order pooling.
-
RT-Transformer: The Transformer Block as a Spherical State Estimator
Transformer components arise as the natural solution to precision-weighted directional state estimation on the hypersphere.
-
Scalable Gaussian process inference via neural feature maps
Neural feature maps create expressive kernels that enable fast, scalable, and consistent exact Gaussian process inference for regression and classification.