Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned

Voita, Elena, Talbot, David, Moiseev, Fedor, Sennrich, Rico, Titov, Ivan · 2019

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

PACT: Peak-Aware Cross-Attention Graph Transformers for Efficient Storm-Surge Emulation

cs.LG · 2026-05-09 · unverdicted · novelty 7.0

PACT introduces a peak-aware cross-attention graph transformer that emulates station-level storm surges more accurately than prior graph neural network baselines while running in seconds after training.

LEAP: Layer-wise Exit-Aware Pretraining for Efficient Transformer Inference

cs.LG · 2026-05-01 · unverdicted · novelty 6.0

LEAP adds a layer-wise exit-aware constraint to standard distillation, reconciling it with early-exit mechanisms and delivering 1.61x wall-clock speedup on MiniLM at 0.95 threshold with 91.9% early exits by layer 7.

citing papers explorer

Showing 2 of 2 citing papers.

PACT: Peak-Aware Cross-Attention Graph Transformers for Efficient Storm-Surge Emulation cs.LG · 2026-05-09 · unverdicted · none · ref 7
PACT introduces a peak-aware cross-attention graph transformer that emulates station-level storm surges more accurately than prior graph neural network baselines while running in seconds after training.
LEAP: Layer-wise Exit-Aware Pretraining for Efficient Transformer Inference cs.LG · 2026-05-01 · unverdicted · none · ref 36
LEAP adds a layer-wise exit-aware constraint to standard distillation, reconciling it with early-exit mechanisms and delivering 1.61x wall-clock speedup on MiniLM at 0.95 threshold with 91.9% early exits by layer 7.

Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer