2005.864128

Jeroen Dijk, Bishnu Patra, Sushil Subramanian, Xiao Xue, Nodar Samkharadze, Andrea Corna, Charles Jeon, Farhana Sheikh, Esdras Juarez-Hernandez, Brando Esparza, Huzaifa Rampurawala, Brent Carlton, Surej Ravikumar, Carlos Nieva, Sungwon Kim · 2020 · DOI 10.1109/jssc

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

open at publisher browse 4 citing papers

representative citing papers

SymbolicLight V1: Spike-Gated Dual-Path Language Modeling with High Activation Sparsity and Sub-Billion-Scale Pre-Training Evidence

cs.CL · 2026-05-20 · unverdicted · novelty 6.0

A 194M-parameter spiking dual-path model trained on 3B Chinese-English tokens achieves held-out PPL 8.88-8.93 at >89% per-element sparsity, trailing GPT-2 201M by 7.7% while showing that LIF temporal integration outperforms simple top-k masking at matched sparsity.

CryoZip: An Efficient Cryogenic Compressor for Quantum Error Correction Syndromes

quant-ph · 2026-06-29 · unverdicted · novelty 5.0

CryoZip delivers up to 48x compression (1.8x over prior art) and 4-26x energy savings for QEC syndromes in 22 nm FDSOI at 4 K, reaching 14,238x bandwidth reduction and 42x energy savings when paired with a predecoder.

Hardware-Software Co-Design of Scalable, Energy-Efficient Analog Recurrent Computations

cs.AR · 2026-05-12 · unverdicted · novelty 5.0 · 2 refs

BMRUs enable analog recurrent neural network hardware via discrete outputs that suppress noise 20-fold, with one-to-one parameter-to-circuit mapping and linear power scaling for recurrence.

ShadowNPU: System and Algorithm Co-design for NPU-Centric On-Device LLM Inference

cs.PF · 2025-08-22 · unverdicted · novelty 5.0

ShadowNPU presents shadowAttn, a co-designed sparse attention system that uses NPU pilot compute and techniques like graph bucketing and per-head sparsity to minimize CPU/GPU fallback during on-device LLM inference while maintaining accuracy.

citing papers explorer

Showing 4 of 4 citing papers after filters.

SymbolicLight V1: Spike-Gated Dual-Path Language Modeling with High Activation Sparsity and Sub-Billion-Scale Pre-Training Evidence cs.CL · 2026-05-20 · unverdicted · none · ref 10
A 194M-parameter spiking dual-path model trained on 3B Chinese-English tokens achieves held-out PPL 8.88-8.93 at >89% per-element sparsity, trailing GPT-2 201M by 7.7% while showing that LIF temporal integration outperforms simple top-k masking at matched sparsity.
CryoZip: An Efficient Cryogenic Compressor for Quantum Error Correction Syndromes quant-ph · 2026-06-29 · unverdicted · none · ref 11
CryoZip delivers up to 48x compression (1.8x over prior art) and 4-26x energy savings for QEC syndromes in 22 nm FDSOI at 4 K, reaching 14,238x bandwidth reduction and 42x energy savings when paired with a predecoder.
Hardware-Software Co-Design of Scalable, Energy-Efficient Analog Recurrent Computations cs.AR · 2026-05-12 · unverdicted · none · ref 92 · 2 links
BMRUs enable analog recurrent neural network hardware via discrete outputs that suppress noise 20-fold, with one-to-one parameter-to-circuit mapping and linear power scaling for recurrence.
ShadowNPU: System and Algorithm Co-design for NPU-Centric On-Device LLM Inference cs.PF · 2025-08-22 · unverdicted · none · ref 44
ShadowNPU presents shadowAttn, a co-designed sparse attention system that uses NPU pilot compute and techniques like graph bucketing and per-head sparsity to minimize CPU/GPU fallback during on-device LLM inference while maintaining accuracy.

2005.864128

fields

years

verdicts

representative citing papers

citing papers explorer