pith. sign in

Should I Hide My Duck in the Lake?

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it
abstract

Data lakes spend a significant fraction of query execution time on scanning data from remote, disaggregated storage. Decoding alone accounts for 46% of runtime when running TPC-H directly on Parquet files. To address this bottleneck, we propose a vision for a data processing SmartNIC for the cloud that sits on the network datapath of compute nodes to offload decoding and pushed-down operators, effectively hiding the cost of parsing raw files. Our experimental estimations with DuckDB suggest that by operating directly on pre-filtered data, as delivered by a SmartNIC, we can significantly increase query processing performance and can still match query throughput of traditional setups with smaller, less expensive CPUs.

citation-role summary

background 1

citation-polarity summary

fields

cs.AR 1

years

2026 1

verdicts

UNVERDICTED 1

roles

background 1

polarities

background 1

representative citing papers

SCENIC: Stream Computation-Enhanced SmartNIC

cs.AR · 2026-04-16 · unverdicted · novelty 7.0

SCENIC delivers a programmable 200G SmartNIC with offloaded protocol stacks, stream compute units, and full OS transparency that matches commercial performance for custom offloads like collective communication and GPU data partitioning.

citing papers explorer

Showing 1 of 1 citing paper.

  • SCENIC: Stream Computation-Enhanced SmartNIC cs.AR · 2026-04-16 · unverdicted · none · ref 23 · internal anchor

    SCENIC delivers a programmable 200G SmartNIC with offloaded protocol stacks, stream compute units, and full OS transparency that matches commercial performance for custom offloads like collective communication and GPU data partitioning.