HAT: Hardware-aware transformers for efficient natural language processing

Hanrui Wang, Zhanghao Wu, Zhijian Liu, Han Cai, Ligeng Zhu, Chuang Gan, Song Han · 2020 · arXiv 2005.14187

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

LLMForge: Multi-Backend Hardware-Aware Neural Architecture Search with Infinite-Head Attention for Edge Language Models

cs.LG · 2026-05-17 · unverdicted · novelty 7.0

LLMForge is a NAS framework with Infinite-Head Attention, a Forge-Former surrogate, and Forge-DSE engine that discovers hardware-specific architectures for edge language models, yielding variants with improved accuracy, energy, or latency on different substrates.

Spiking Neural Network Architecture Search: A Survey

cs.NE · 2025-10-16 · unverdicted · novelty 2.0

A survey of Spiking Neural Network architecture search techniques viewed through a hardware/software co-design lens.

citing papers explorer

Showing 2 of 2 citing papers.

LLMForge: Multi-Backend Hardware-Aware Neural Architecture Search with Infinite-Head Attention for Edge Language Models cs.LG · 2026-05-17 · unverdicted · none · ref 34
LLMForge is a NAS framework with Infinite-Head Attention, a Forge-Former surrogate, and Forge-DSE engine that discovers hardware-specific architectures for edge language models, yielding variants with improved accuracy, energy, or latency on different substrates.
Spiking Neural Network Architecture Search: A Survey cs.NE · 2025-10-16 · unverdicted · none · ref 133
A survey of Spiking Neural Network architecture search techniques viewed through a hardware/software co-design lens.

HAT: Hardware-aware transformers for efficient natural language processing

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer