pith. sign in
Pith Number

pith:5WBQNOLB

pith:2026:5WBQNOLBLLQ2RD6CJTHLXB4WZV
not attested not anchored not stored refs resolved

TTP: A Hardware-Efficient Design for Precise Prefetching in Ray Tracing

Anshul Naithani, Huiyang Zhou, Yavuz Selim Tozlu

A prefetcher that monitors consecutive pops from ray tracing traversal stacks delivers 1.48x average speedup with negligible hardware overhead.

arxiv:2605.16253 v1 · 2026-05-15 · cs.AR

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{5WBQNOLBLLQ2RD6CJTHLXB4WZV}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

We propose a novel hardware prefetcher, named Tree Traversal Prefetcher (TTP), for ray tracing. ... TTP achieves 1.48x speedup on average (up to 1.89x) compared to the baseline, with nearly negligible hardware overhead. TTP achieves 98.92% average L1 accuracy, which is the ratio of the prefetched blocks being actually referenced by demand loads.

C2weakest assumption

The cycle-level simulator Vulkan-sim 2.0 accurately reproduces the memory access patterns and traversal stack behavior of real ray tracing hardware, and that consecutive stack pops reliably indicate useful upward traversal for prefetching without causing cache pollution.

C3one line summary

TTP is a hardware prefetcher for ray tracing that leverages traversal stack addresses during DFS to prefetch BVH nodes, achieving 1.48x average speedup and 98.92% L1 accuracy in cycle-level simulations.

References

46 extracted · 46 resolved · 0 Pith anchors

[1] Code repo for Treelet Prefetching For Ray Tracing (MICRO 2023) 2023
[2] DirectX Raytracing (DXR) Functional Spec
[3] Intel Embree
[4] Intel® Arc™ Graphics Developer Guide for Real-Time Ray Tracing in
[5] NVIDIA ADA GPU ARCHITECTURE

Formal links

2 machine-checked theorem links

Receipt and verification
First computed 2026-05-20T00:02:00.261293Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

ed8306b9615ae1a88fc24ccebb8796cd58bf80e891dd69a525c0dc9bf9447d77

Aliases

arxiv: 2605.16253 · arxiv_version: 2605.16253v1 · doi: 10.48550/arxiv.2605.16253 · pith_short_12: 5WBQNOLBLLQ2 · pith_short_16: 5WBQNOLBLLQ2RD6C · pith_short_8: 5WBQNOLB
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/5WBQNOLBLLQ2RD6CJTHLXB4WZV \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: ed8306b9615ae1a88fc24ccebb8796cd58bf80e891dd69a525c0dc9bf9447d77
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "c5ab0a1ad553811311ee378c9daae680a6d7342143646aeab566c64797311578",
    "cross_cats_sorted": [],
    "license": "http://creativecommons.org/licenses/by-nc-sa/4.0/",
    "primary_cat": "cs.AR",
    "submitted_at": "2026-05-15T17:57:31Z",
    "title_canon_sha256": "65d006c18f9f6f00ade01078594acbdbf9c0eb30088b234edfdad4979b9f50af"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2605.16253",
    "kind": "arxiv",
    "version": 1
  }
}