Deep residual learning for image recognition

He, K · 2016

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

representative citing papers

Energy-Gated Attention: Spectral Salience as an Inductive Bias for Transformer Attention

cs.LG · 2026-05-21 · unverdicted · novelty 6.0

Energy-Gated Attention improves language model validation loss by gating attention according to spectral energy of key embeddings discovered by a learned projection, with consistent gains on TinyShakespeare and Penn Treebank using under 0.26% extra parameters.

HELIX: Hybrid Encoding with Learnable Identity and Cross-dimensional Synthesis for Time Series Imputation

cs.LG · 2026-05-04 · unverdicted · novelty 6.0

HELIX uses learnable feature identities and hybrid temporal-feature attention to achieve state-of-the-art time series imputation across multiple datasets and settings.

Orthogonal Subspace Decomposition for Generalizable AI-Generated Image Detection

cs.CV · 2024-11-23 · unverdicted · novelty 6.0

Orthogonal subspace decomposition via SVD on vision foundation model features preserves high-rank pre-trained knowledge by freezing principal components and adapting residuals, reducing overfitting for better generalization in AI-generated image detection.

citing papers explorer

Showing 3 of 3 citing papers.

Energy-Gated Attention: Spectral Salience as an Inductive Bias for Transformer Attention cs.LG · 2026-05-21 · unverdicted · none · ref 5
Energy-Gated Attention improves language model validation loss by gating attention according to spectral energy of key embeddings discovered by a learned projection, with consistent gains on TinyShakespeare and Penn Treebank using under 0.26% extra parameters.
HELIX: Hybrid Encoding with Learnable Identity and Cross-dimensional Synthesis for Time Series Imputation cs.LG · 2026-05-04 · unverdicted · none · ref 11
HELIX uses learnable feature identities and hybrid temporal-feature attention to achieve state-of-the-art time series imputation across multiple datasets and settings.
Orthogonal Subspace Decomposition for Generalizable AI-Generated Image Detection cs.CV · 2024-11-23 · unverdicted · none · ref 30
Orthogonal subspace decomposition via SVD on vision foundation model features preserves high-rank pre-trained knowledge by freezing principal components and adapting residuals, reducing overfitting for better generalization in AI-generated image detection.

Deep residual learning for image recognition

fields

years

verdicts

representative citing papers

citing papers explorer