pith:TYFP47YC
State-of-art minibatches via novel DPP kernels: discretization, wavelets, and rough objectives
Wavelet-based DPPs on Euclidean space discretize to low-rank kernels that preserve superior variance reduction for minibatches on rough objectives.
arxiv:2605.13127 v1 · 2026-05-13 · stat.ML · cs.LG · math.PR
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{TYFP47YCUZP2S3VOTW5JJBPB4J}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more
Record completeness
Claims
We propose new DPPs on the Euclidean space based on wavelets, with provably better accuracy guarantees than the best known rates. Second, we introduce a general method to convert such continuous DPPs into discrete kernels, which simultaneously preserves the desired variance decay and reveals a low-rank decomposition of the discrete kernel.
The discretization procedure preserves the variance reduction properties of the continuous wavelet DPPs with only negligible degradation when applied to finite datasets, and that the low-rank structure remains exploitable without hidden computational costs.
Wavelet DPP kernels deliver improved continuous variance reduction and a discretization procedure that preserves decay rates for discrete ML subsampling tasks including rough objectives.
References
Receipt and verification
| First computed | 2026-05-18T03:08:57.855857Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
9e0afe7f02a65fa96eae9dba9485e1e24e0a3c97a4edf33d682ca35143f11771
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/TYFP47YCUZP2S3VOTW5JJBPB4J \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 9e0afe7f02a65fa96eae9dba9485e1e24e0a3c97a4edf33d682ca35143f11771
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "37fa01c8078fcd21ad6b556a4f96d3ee0e352ad10bb897463c20c6a22e816d0d",
"cross_cats_sorted": [
"cs.LG",
"math.PR"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "stat.ML",
"submitted_at": "2026-05-13T07:54:37Z",
"title_canon_sha256": "4c3b6fc00a830b55b9144ad0c1642d3ff24bfb8d45287df12715478db6a34303"
},
"schema_version": "1.0",
"source": {
"id": "2605.13127",
"kind": "arxiv",
"version": 1
}
}