International Conference on Learning Representations , year =

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

browse 5 citing papers

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

Dropout Universality: Scaling Laws and Optimal Scheduling at the Edge-of-Chaos

cs.LG · 2026-05-20 · unverdicted · novelty 7.0

Mean-field perturbation theory of dropout at the edge of chaos yields distinct universality classes for smooth versus kinked activations, critical scaling laws for correlation decay, and front-loaded dropout schedules that reduce test loss.

CHASM: Cross-frequency Harmonized Axis-Separable Mixing for Spectral Token Operators

cs.CV · 2026-05-14 · unverdicted · novelty 7.0

CHASM introduces a cross-frequency harmonized axis-separable spectral mixer using a shared channel eigenbasis plus per-frequency positive gains, yielding consistent gains over same-backbone baselines in medical and natural image tasks.

Spectrum-Adaptive Generalization Bounds for Trained Deep Transformers

stat.ML · 2026-05-08 · unverdicted · novelty 6.0

Spectrum-adaptive post-hoc generalization bounds for multi-layer Transformers are derived using layerwise Schatten quantities whose indices are chosen after training based on singular-value profiles.

VC-FeS: Viewpoint-Conditioned Feature Selection for Vehicle Re-identification in Thermal Vision

cs.CV · 2026-05-06 · unverdicted · novelty 6.0

Viewpoint-conditioned feature selection improves thermal vehicle re-identification mAP by 19.7% on RGBNT100 and 12.8% on a new maritime dataset by adapting RGB ViT extractors.

Refresh-Scaling the Memory of Balanced Adam

cs.LG · 2026-05-11 · unverdicted · novelty 5.0

Setting β in balanced Adam to achieve a refresh count R_β ≈1000 based on effective learning horizon T_ES improves validation robustness over fixed-β baselines across 11 vision and language experiments.

citing papers explorer

Showing 5 of 5 citing papers.

Dropout Universality: Scaling Laws and Optimal Scheduling at the Edge-of-Chaos cs.LG · 2026-05-20 · unverdicted · none · ref 18
Mean-field perturbation theory of dropout at the edge of chaos yields distinct universality classes for smooth versus kinked activations, critical scaling laws for correlation decay, and front-loaded dropout schedules that reduce test loss.
CHASM: Cross-frequency Harmonized Axis-Separable Mixing for Spectral Token Operators cs.CV · 2026-05-14 · unverdicted · none · ref 8
CHASM introduces a cross-frequency harmonized axis-separable spectral mixer using a shared channel eigenbasis plus per-frequency positive gains, yielding consistent gains over same-backbone baselines in medical and natural image tasks.
Spectrum-Adaptive Generalization Bounds for Trained Deep Transformers stat.ML · 2026-05-08 · unverdicted · none · ref 8
Spectrum-adaptive post-hoc generalization bounds for multi-layer Transformers are derived using layerwise Schatten quantities whose indices are chosen after training based on singular-value profiles.
VC-FeS: Viewpoint-Conditioned Feature Selection for Vehicle Re-identification in Thermal Vision cs.CV · 2026-05-06 · unverdicted · none · ref 1
Viewpoint-conditioned feature selection improves thermal vehicle re-identification mAP by 19.7% on RGBNT100 and 12.8% on a new maritime dataset by adapting RGB ViT extractors.
Refresh-Scaling the Memory of Balanced Adam cs.LG · 2026-05-11 · unverdicted · none · ref 9
Setting β in balanced Adam to achieve a refresh count R_β ≈1000 based on effective learning horizon T_ES improves validation robustness over fixed-β baselines across 11 vision and language experiments.

International Conference on Learning Representations , year =

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer