Ini- tialization and regularization of factorized neural layers

Initialization, regularization of factorized neural layers , author= · arXiv 2105.01029

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

BOOST: BOttleneck-Optimized Scalable Training Framework for Low-Rank Large Language Models

cs.LG · 2025-12-13 · unverdicted · novelty 6.0

BOOST delivers 1.46-2.27x end-to-end speedups for low-rank bottleneck LLMs by redesigning tensor parallelism around the bottleneck structure plus supporting optimizations.

DLR: Zero-Inference-Cost Latent Residuals for Low-Rank Pre-Training

cs.LG · 2026-06-27 · unverdicted · novelty 5.0

DLR augments low-rank factorization with a fixed structured residual during training that is absorbed post-training, improving C4 perplexity for LLaMA models from 60M to 7B while preserving exact low-rank inference cost.

citing papers explorer

Showing 1 of 1 citing paper after filters.

BOOST: BOttleneck-Optimized Scalable Training Framework for Low-Rank Large Language Models cs.LG · 2025-12-13 · unverdicted · none · ref 9
BOOST delivers 1.46-2.27x end-to-end speedups for low-rank bottleneck LLMs by redesigning tensor parallelism around the bottleneck structure plus supporting optimizations.

Ini- tialization and regularization of factorized neural layers

fields

years

verdicts

representative citing papers

citing papers explorer