Eyeriss: a spatial architecture for energy-efficient dataflow for convolutional neural networks,

Yu-Hsin Chen, Joel Emer, Vivienne Sze · 2016 · DOI 10.1109/isca.2016.40

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

open at publisher browse 4 citing papers

representative citing papers

The Turbo-Charged Mapper: Fast and Optimal Mapping for Energy-efficient and Low-latency Accelerator Design

cs.AR · 2026-02-16 · unverdicted · novelty 8.0

TCM finds provably optimal DNN accelerator mappings by pruning the search space up to 32 orders of magnitude with a new dataplacement concept, delivering 1.2-6.5x better energy-delay-product in 17 seconds instead of hours.

Fast and Fusiest: An Optimal Fusion-Aware Mapper for Accelerator Design

cs.AR · 2026-02-16 · unverdicted · novelty 7.0

FFM finds optimal fused mappings for tensor accelerators over 10,000 times faster than prior mappers while cutting energy-delay product by up to 1.8x versus hand-tuned designs.

SEADA: An efficient methodology for optimizing mixed-precision DNNs on multi-precision spatial architectures

cs.AR · 2026-06-26 · unverdicted · novelty 4.0

SEADA introduces an analytical framework combining cost models, mapping tools, and entropy-based precision selection to optimize mixed-precision DNNs on multi-precision spatial architectures.

Defeat the Heap: Zero-Copy Data Movement in AXI4MLIR

cs.AR · 2026-06-09 · unverdicted · novelty 4.0

Extending the accel dialect in AXI4MLIR with direct DMA-mapped allocation eliminates a staging copy and reduces main memory data movement by up to 2x on matrix multiplication accelerators.

citing papers explorer

Showing 4 of 4 citing papers after filters.

The Turbo-Charged Mapper: Fast and Optimal Mapping for Energy-efficient and Low-latency Accelerator Design cs.AR · 2026-02-16 · unverdicted · none · ref 8
TCM finds provably optimal DNN accelerator mappings by pruning the search space up to 32 orders of magnitude with a new dataplacement concept, delivering 1.2-6.5x better energy-delay-product in 17 seconds instead of hours.
Fast and Fusiest: An Optimal Fusion-Aware Mapper for Accelerator Design cs.AR · 2026-02-16 · unverdicted · none · ref 11
FFM finds optimal fused mappings for tensor accelerators over 10,000 times faster than prior mappers while cutting energy-delay product by up to 1.8x versus hand-tuned designs.
SEADA: An efficient methodology for optimizing mixed-precision DNNs on multi-precision spatial architectures cs.AR · 2026-06-26 · unverdicted · none · ref 7
SEADA introduces an analytical framework combining cost models, mapping tools, and entropy-based precision selection to optimize mixed-precision DNNs on multi-precision spatial architectures.
Defeat the Heap: Zero-Copy Data Movement in AXI4MLIR cs.AR · 2026-06-09 · unverdicted · none · ref 10
Extending the accel dialect in AXI4MLIR with direct DMA-mapped allocation eliminates a staging copy and reduces main memory data movement by up to 2x on matrix multiplication accelerators.

Eyeriss: a spatial architecture for energy-efficient dataflow for convolutional neural networks,

fields

years

verdicts

representative citing papers

citing papers explorer