Title resolution pending

Deep Learning , author= · 2016

11 Pith papers cite this work. Polarity classification is still indexing.

11 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

The Global Empirical NTK: Self-Referential Bias and Dimensionality of Gradient Descent Learning

cs.LG · 2026-05-09 · unverdicted · novelty 7.0

The global empirical NTK for finite-width networks has a universal Kronecker-core form that makes it structurally low-rank and biases gradient descent toward dominant modes of joint input-hidden activity.

Structured Neural Marked Point Processes for Interpretable Event Interaction Modeling

cs.LG · 2026-05-17 · unverdicted · novelty 6.0 · 2 refs

SNMPP builds a product-form neural influence kernel from a signed interaction network over event classes and a delay-aware monotonic temporal network to enable explicit discovery of inter-event relationships alongside strong prediction.

Entropy-Based Characterisation of the Polarised Regime in Latent Variable Models

cs.LG · 2026-05-15 · unverdicted · novelty 6.0

An entropy criterion on mean representations characterises the polarised regime in VAEs and related models, with theoretical links to KL minimisation and empirical tests across several architectures.

State-Space NTK Collapse Near Bifurcations

cs.LG · 2026-05-12 · unverdicted · novelty 6.0

Bifurcations cause sNTK to reduce to a dominant rank-one channel matching normal forms, collapsing effective rank and funneling gradient descent into critical dynamical directions.

SPHERE: Mitigating the Loss of Spectral Plasticity in Mixture-of-Experts for Deep Reinforcement Learning

cs.LG · 2026-05-06 · unverdicted · novelty 6.0 · 2 refs

SPHERE applies a Parseval penalty to MoE policies in continual RL to maintain spectral plasticity, yielding 133% and 50% higher average success on MetaWorld and HumanoidBench versus unregularized MoE baselines.

EmoMM: Benchmarking and Steering MLLM for Multimodal Emotion Recognition under Conflict and Missingness

cs.CV · 2026-05-01 · unverdicted · novelty 6.0

EmoMM benchmark reveals Video Contribution Collapse in MLLMs for emotion recognition under modality conflict and missingness, mitigated by CHASE head-level attention steering.

Rethinking Intrinsic Dimension Estimation in Neural Representations

cs.LG · 2026-04-22 · unverdicted · novelty 6.0

Common ID estimators fail to track the true intrinsic dimension of neural representations and are instead driven by other factors.

Cross-Stock Predictability via LLM-Augmented Semantic Networks

q-fin.PM · 2026-04-21 · unverdicted · novelty 6.0

LLM filtering of embedding-based stock networks raises long-short Sharpe ratio from 0.742 to 0.820 and cuts max drawdown from -10.47% to -7.85% in 2011-2019 S&P 500 backtests.

Adaptive Norm-Based Regularization for Neural Networks

stat.ML · 2026-04-30 · unverdicted · novelty 5.0

Covariance-aware ridge and combined l1-l2 regularizers for neural networks yield better predictive performance and complexity control than standard penalties in simulations and applications to cooling-load prediction and leukemia classification.

Revitalizing the Beginning: Avoiding Storage Dependency for Model Merging in Continual Learning

cs.LG · 2026-05-08 · unverdicted · novelty 4.0

The paper proposes Trajectory Regularized Merging (TRM) to enable storage-free model merging in continual learning by optimizing in an augmented trajectory subspace with task alignment, prediction consistency, and gradient responsiveness objectives, claiming SOTA results.

Hybrid TF--IDF Logistic Regression and MLP Neural Baseline for Indonesian Three-Class Sentiment Analysis on Social Media Text

cs.CL · 2026-05-08 · unverdicted · novelty 2.0

Logistic regression using TF-IDF features and three metadata attributes achieves 0.8028 accuracy, 0.8003 weighted F1, and 0.7276 macro F1 on three-class Indonesian sentiment classification from a 707-sample imbalanced dataset.

citing papers explorer

Showing 11 of 11 citing papers.

The Global Empirical NTK: Self-Referential Bias and Dimensionality of Gradient Descent Learning cs.LG · 2026-05-09 · unverdicted · none · ref 33
The global empirical NTK for finite-width networks has a universal Kronecker-core form that makes it structurally low-rank and biases gradient descent toward dominant modes of joint input-hidden activity.
Structured Neural Marked Point Processes for Interpretable Event Interaction Modeling cs.LG · 2026-05-17 · unverdicted · none · ref 12 · 2 links
SNMPP builds a product-form neural influence kernel from a signed interaction network over event classes and a delay-aware monotonic temporal network to enable explicit discovery of inter-event relationships alongside strong prediction.
Entropy-Based Characterisation of the Polarised Regime in Latent Variable Models cs.LG · 2026-05-15 · unverdicted · none · ref 61
An entropy criterion on mean representations characterises the polarised regime in VAEs and related models, with theoretical links to KL minimisation and empirical tests across several architectures.
State-Space NTK Collapse Near Bifurcations cs.LG · 2026-05-12 · unverdicted · none · ref 33
Bifurcations cause sNTK to reduce to a dominant rank-one channel matching normal forms, collapsing effective rank and funneling gradient descent into critical dynamical directions.
SPHERE: Mitigating the Loss of Spectral Plasticity in Mixture-of-Experts for Deep Reinforcement Learning cs.LG · 2026-05-06 · unverdicted · none · ref 43 · 2 links
SPHERE applies a Parseval penalty to MoE policies in continual RL to maintain spectral plasticity, yielding 133% and 50% higher average success on MetaWorld and HumanoidBench versus unregularized MoE baselines.
EmoMM: Benchmarking and Steering MLLM for Multimodal Emotion Recognition under Conflict and Missingness cs.CV · 2026-05-01 · unverdicted · none · ref 109
EmoMM benchmark reveals Video Contribution Collapse in MLLMs for emotion recognition under modality conflict and missingness, mitigated by CHASE head-level attention steering.
Rethinking Intrinsic Dimension Estimation in Neural Representations cs.LG · 2026-04-22 · unverdicted · none · ref 3
Common ID estimators fail to track the true intrinsic dimension of neural representations and are instead driven by other factors.
Cross-Stock Predictability via LLM-Augmented Semantic Networks q-fin.PM · 2026-04-21 · unverdicted · none · ref 69
LLM filtering of embedding-based stock networks raises long-short Sharpe ratio from 0.742 to 0.820 and cuts max drawdown from -10.47% to -7.85% in 2011-2019 S&P 500 backtests.
Adaptive Norm-Based Regularization for Neural Networks stat.ML · 2026-04-30 · unverdicted · none · ref 4
Covariance-aware ridge and combined l1-l2 regularizers for neural networks yield better predictive performance and complexity control than standard penalties in simulations and applications to cooling-load prediction and leukemia classification.
Revitalizing the Beginning: Avoiding Storage Dependency for Model Merging in Continual Learning cs.LG · 2026-05-08 · unverdicted · none · ref 48
The paper proposes Trajectory Regularized Merging (TRM) to enable storage-free model merging in continual learning by optimizing in an augmented trajectory subspace with task alignment, prediction consistency, and gradient responsiveness objectives, claiming SOTA results.
Hybrid TF--IDF Logistic Regression and MLP Neural Baseline for Indonesian Three-Class Sentiment Analysis on Social Media Text cs.CL · 2026-05-08 · unverdicted · none · ref 12
Logistic regression using TF-IDF features and three metadata attributes achieves 0.8028 accuracy, 0.8003 weighted F1, and 0.7276 macro F1 on three-class Indonesian sentiment classification from a 707-sample imbalanced dataset.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer