Title resolution pending

Layer Normalization , author= · 2016

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

browse 9 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 3

citation-polarity summary

background 3

representative citing papers

Enjoy Your Layer Normalization with the Computational Efficiency of RMSNorm

cs.LG · 2026-05-14 · conditional · novelty 7.0

A framework to identify and convert foldable layer normalizations to RMSNorm for exact equivalence and faster inference in deep neural networks.

Improving the Performance and Learning Stability of Parallelizable RNNs Designed for Ultra-Low Power Applications

cs.LG · 2026-05-12 · unverdicted · novelty 7.0

Cumulative state updates in CMRU restore gradient flow through time in quantized bistable RNNs, yielding more stable convergence and competitive or superior performance versus LRUs and minGRUs on long-range sequence tasks.

ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning

cs.LG · 2026-05-01 · unverdicted · novelty 7.0

ResRL decouples shared semantics between positive and negative responses in LLM reinforcement learning via SVD-based projection residuals, outperforming baselines including NSR by up to 9.4% on math reasoning benchmarks.

Latent Dynamics for Full Body Avatar Animation

cs.CV · 2026-05-20 · unverdicted · novelty 6.0

The work augments pose-conditioned 3D Gaussian avatars with a residual latent evolved by a transformer decoder that decomposes updates into driving, restoring, and dissipative forces to produce history-dependent, temporally coherent full-body animations.

Behavior-Consistent Deep Reinforcement Learning

cs.LG · 2026-05-20 · unverdicted · novelty 6.0 · 2 refs

QED bounds cross-run KL divergence in Boltzmann policies by setting temperature proportional to Q-disagreement and reduces return variance by two orders of magnitude on 18 continuous-control tasks without performance loss.

STELLAR: Scaling 3D Perception Large Models for Autonomous Driving

cs.CV · 2026-05-19 · unverdicted · novelty 6.0

STELLAR trains up to 500M-parameter multi-modal models on 50M driving scenes and reports empirical scaling trends plus new state-of-the-art results on the Waymo Open Dataset.

Double Metric Learning for Building Directed Graphs with Chain Connections for the ATLAS ITk Detector

physics.data-an · 2026-05-13 · unverdicted · novelty 6.0

Double metric learning learns two embeddings per node to build directed graphs with chain connections, yielding better performance than single metric learning for high-pT particles and accurate edge direction prediction in ATLAS ITk simulations.

RT-Transformer: The Transformer Block as a Spherical State Estimator

cs.LG · 2026-05-10 · unverdicted · novelty 6.0

Transformer components arise as the natural solution to precision-weighted directional state estimation on the hypersphere.

mlr3torch: A Deep Learning Framework in R based on mlr3 and torch

stat.ML · 2026-04-20 · unverdicted · novelty 6.0

mlr3torch introduces an extensible deep learning framework in R that integrates torch models into the mlr3 ecosystem via graph-based architectures for classification, regression, and multimodal tasks.

citing papers explorer

Showing 9 of 9 citing papers.

Enjoy Your Layer Normalization with the Computational Efficiency of RMSNorm cs.LG · 2026-05-14 · conditional · none · ref 5
A framework to identify and convert foldable layer normalizations to RMSNorm for exact equivalence and faster inference in deep neural networks.
Improving the Performance and Learning Stability of Parallelizable RNNs Designed for Ultra-Low Power Applications cs.LG · 2026-05-12 · unverdicted · none · ref 9
Cumulative state updates in CMRU restore gradient flow through time in quantized bistable RNNs, yielding more stable convergence and competitive or superior performance versus LRUs and minGRUs on long-range sequence tasks.
ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning cs.LG · 2026-05-01 · unverdicted · none · ref 33
ResRL decouples shared semantics between positive and negative responses in LLM reinforcement learning via SVD-based projection residuals, outperforming baselines including NSR by up to 9.4% on math reasoning benchmarks.
Latent Dynamics for Full Body Avatar Animation cs.CV · 2026-05-20 · unverdicted · none · ref 69
The work augments pose-conditioned 3D Gaussian avatars with a residual latent evolved by a transformer decoder that decomposes updates into driving, restoring, and dissipative forces to produce history-dependent, temporally coherent full-body animations.
Behavior-Consistent Deep Reinforcement Learning cs.LG · 2026-05-20 · unverdicted · none · ref 35 · 2 links
QED bounds cross-run KL divergence in Boltzmann policies by setting temperature proportional to Q-disagreement and reduces return variance by two orders of magnitude on 18 continuous-control tasks without performance loss.
STELLAR: Scaling 3D Perception Large Models for Autonomous Driving cs.CV · 2026-05-19 · unverdicted · none · ref 43
STELLAR trains up to 500M-parameter multi-modal models on 50M driving scenes and reports empirical scaling trends plus new state-of-the-art results on the Waymo Open Dataset.
Double Metric Learning for Building Directed Graphs with Chain Connections for the ATLAS ITk Detector physics.data-an · 2026-05-13 · unverdicted · none · ref 21
Double metric learning learns two embeddings per node to build directed graphs with chain connections, yielding better performance than single metric learning for high-pT particles and accurate edge direction prediction in ATLAS ITk simulations.
RT-Transformer: The Transformer Block as a Spherical State Estimator cs.LG · 2026-05-10 · unverdicted · none · ref 162
Transformer components arise as the natural solution to precision-weighted directional state estimation on the hypersphere.
mlr3torch: A Deep Learning Framework in R based on mlr3 and torch stat.ML · 2026-04-20 · unverdicted · none · ref 69
mlr3torch introduces an extensible deep learning framework in R that integrates torch models into the mlr3 ecosystem via graph-based architectures for classification, regression, and multimodal tasks.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer