pith. sign in

Title resolution pending

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

years

2026 3

verdicts

UNVERDICTED 3

roles

background 1

polarities

background 1

clear filters

representative citing papers

Three Costs of Amortizing Gaussian Process Inference with Neural Processes

cs.LG · 2026-05-20 · unverdicted · novelty 7.0

Derives explicit bounds decomposing KL(GP || LNP) into three costs with decay rates O(e^{-c d^{2/d_x}}) for squared-exponential kernels and O(d^{-2ν/d_x}) for Matérn kernels, plus recommendations to predict variance from locations alone and use second-order pooling.

The Transformer as a Polar State Estimator

cs.LG · 2026-05-10 · unverdicted · novelty 6.0

The standard Transformer block arises as a first-order approximation to a polar state estimator on the hypersphere, with a Polar Transformer retaining higher-order terms.

citing papers explorer

Showing 3 of 3 citing papers after filters.

  • Three Costs of Amortizing Gaussian Process Inference with Neural Processes cs.LG · 2026-05-20 · unverdicted · none · ref 2

    Derives explicit bounds decomposing KL(GP || LNP) into three costs with decay rates O(e^{-c d^{2/d_x}}) for squared-exponential kernels and O(d^{-2ν/d_x}) for Matérn kernels, plus recommendations to predict variance from locations alone and use second-order pooling.

  • The Transformer as a Polar State Estimator cs.LG · 2026-05-10 · unverdicted · none · ref 73

    The standard Transformer block arises as a first-order approximation to a polar state estimator on the hypersphere, with a Polar Transformer retaining higher-order terms.

  • Scalable Gaussian process inference via neural feature maps stat.ML · 2026-05-11 · unverdicted · none · ref 11

    Neural feature maps create expressive kernels that enable fast, scalable, and consistent exact Gaussian process inference for regression and classification.