Title resolution pending

· 2020 · arXiv 5697.2020

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 3 baseline 1

citation-polarity summary

background 3 baseline 1

representative citing papers

Loaded Dice: Solving the Non-Selection Problem for Scalable Probabilistic RowHammer Defense

cs.CR · 2026-05-17 · conditional · novelty 7.0

PrISM uses a Sampled History Queue to correlate row samples across windows, solving the non-selection problem in probabilistic RowHammer mitigation and cutting slowdown from 10.7% to 1.5% at threshold 250 versus prior methods.

A Switch-Centric In-Network Architecture for Accelerating LLM Inference in Shared-Memory Network

cs.AR · 2026-03-30 · unverdicted · novelty 7.0

SCIN uses an in-switch accelerator for direct memory access and 8-bit in-network quantization during All-Reduce, delivering up to 8.7x faster small-message reduction and 1.74x TTFT speedup on LLaMA-2 models.

Qurts: Automatic Quantum Uncomputation by Affine Types with Lifetime

cs.PL · 2024-11-16 · unverdicted · novelty 7.0

Qurts extends Rust with lifetime-parameterized types to provide a uniform framework for automatic quantum uncomputation by allowing temporary affine usage of quantum values.

TokenStack: A Heterogeneous HBM-PIM Architecture and Runtime for Efficient LLM Inference

cs.AR · 2026-05-07 · unverdicted · novelty 6.0

TokenStack's heterogeneous HBM-PIM design with base-die control and topology-aware KV placement delivers 1.62x higher geometric-mean token throughput and 1.70x SLO-compliant serving capacity than AttAcc while cutting per-token energy by 30-47%.

WaveTune: Wave-aware Bilinear Modeling for Efficient GPU Kernel Auto-tuning

cs.PF · 2026-04-11 · unverdicted · novelty 6.0

WaveTune introduces a wave-aware bilinear latency predictor and wave-structured sparse sampling to enable fast runtime auto-tuning of GPU kernels, achieving up to 1.83x kernel speedup and 1.33x TTFT reduction with drastically lower overhead.

A compact QUBO encoding of computational logic formulae demonstrated on cryptography constructions

cs.CR · 2024-09-10 · unverdicted · novelty 6.0

A compact QUBO encoding derived via ILP reduces logical variables by thousands in AES, MD5, SHA1 and SHA256, with over 8x reduction for AES-256.

Wattlytics: A Web Platform for Co-Optimizing Performance, Energy, and TCO in HPC Clusters

cs.DC · 2026-04-09 · unverdicted · novelty 5.0

Wattlytics is a public web platform that integrates benchmark-driven GPU performance scaling, DVFS-aware power modeling, and TCO analysis to support informed HPC cluster design and procurement decisions.

citing papers explorer

Showing 7 of 7 citing papers.

Loaded Dice: Solving the Non-Selection Problem for Scalable Probabilistic RowHammer Defense cs.CR · 2026-05-17 · conditional · none · ref 43
PrISM uses a Sampled History Queue to correlate row samples across windows, solving the non-selection problem in probabilistic RowHammer mitigation and cutting slowdown from 10.7% to 1.5% at threshold 250 versus prior methods.
A Switch-Centric In-Network Architecture for Accelerating LLM Inference in Shared-Memory Network cs.AR · 2026-03-30 · unverdicted · none · ref 2
SCIN uses an in-switch accelerator for direct memory access and 8-bit in-network quantization during All-Reduce, delivering up to 8.7x faster small-message reduction and 1.74x TTFT speedup on LLaMA-2 models.
Qurts: Automatic Quantum Uncomputation by Affine Types with Lifetime cs.PL · 2024-11-16 · unverdicted · none · ref 1
Qurts extends Rust with lifetime-parameterized types to provide a uniform framework for automatic quantum uncomputation by allowing temporary affine usage of quantum values.
TokenStack: A Heterogeneous HBM-PIM Architecture and Runtime for Efficient LLM Inference cs.AR · 2026-05-07 · unverdicted · none · ref 19
TokenStack's heterogeneous HBM-PIM design with base-die control and topology-aware KV placement delivers 1.62x higher geometric-mean token throughput and 1.70x SLO-compliant serving capacity than AttAcc while cutting per-token energy by 30-47%.
WaveTune: Wave-aware Bilinear Modeling for Efficient GPU Kernel Auto-tuning cs.PF · 2026-04-11 · unverdicted · none · ref 16
WaveTune introduces a wave-aware bilinear latency predictor and wave-structured sparse sampling to enable fast runtime auto-tuning of GPU kernels, achieving up to 1.83x kernel speedup and 1.33x TTFT reduction with drastically lower overhead.
A compact QUBO encoding of computational logic formulae demonstrated on cryptography constructions cs.CR · 2024-09-10 · unverdicted · none · ref 19
A compact QUBO encoding derived via ILP reduces logical variables by thousands in AES, MD5, SHA1 and SHA256, with over 8x reduction for AES-256.
Wattlytics: A Web Platform for Co-Optimizing Performance, Energy, and TCO in HPC Clusters cs.DC · 2026-04-09 · unverdicted · none · ref 10
Wattlytics is a public web platform that integrates benchmark-driven GPU performance scaling, DVFS-aware power modeling, and TCO analysis to support informed HPC cluster design and procurement decisions.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer