Hardware acceleration of llms: A comprehen- sive survey and comparison

· 2024 · arXiv 2409.03384

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Microbenchmark-Driven Analytical Performance Modeling Across Modern GPU Architectures

cs.DC · 2026-05-05 · unverdicted · novelty 5.0

Microbenchmark-driven analytical models for B200 and MI300A achieve 1.31% and 0.09% MAE on validation kernels, far outperforming roofline baselines exceeding 95% error.

Secure eFPGA-Enabled Edge LLM Inference: Architectural and Hardware Countermeasures

cs.CR · 2026-04-24 · unverdicted · novelty 5.0

A hybrid ASIC+eFPGA architecture is proposed to add adaptive security mechanisms to edge LLM inference while retaining ASIC efficiency.

citing papers explorer

Showing 2 of 2 citing papers.

Microbenchmark-Driven Analytical Performance Modeling Across Modern GPU Architectures cs.DC · 2026-05-05 · unverdicted · none · ref 3
Microbenchmark-driven analytical models for B200 and MI300A achieve 1.31% and 0.09% MAE on validation kernels, far outperforming roofline baselines exceeding 95% error.
Secure eFPGA-Enabled Edge LLM Inference: Architectural and Hardware Countermeasures cs.CR · 2026-04-24 · unverdicted · none · ref 13
A hybrid ASIC+eFPGA architecture is proposed to add adaptive security mechanisms to edge LLM inference while retaining ASIC efficiency.

Hardware acceleration of llms: A comprehen- sive survey and comparison

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer