International Conference on Learning Representations (ICLR) , year=

Decoupled Weight Decay Regularization , author=

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

browse 7 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Positional LSH: Binary Block Matrix Approximation for Attention with Linear Biases

cs.LG · 2026-05-10 · unverdicted · novelty 7.0

ALiBi bias is the expectation of positional LSH-induced block masks, yielding spectral and max-norm approximation bounds that reduce long-context biased attention to randomized short-context unbiased attention.

Convergence Rate Analysis of SOAP with Arbitrary Orthogonal Projection Matrices

math.OC · 2026-04-23 · unverdicted · novelty 7.0

SOAP and its generalizations with arbitrary orthogonal projections converge at a provable rate when the projections are conditionally independent of the current gradient.

DANCE: Detect and Classify Events in EEG

cs.LG · 2026-05-11 · unverdicted · novelty 6.0

DANCE frames EEG event identification as a set-prediction problem to jointly detect and classify events directly from raw, unaligned signals, outperforming existing methods on seizure monitoring and matching onset-informed models on BCI tasks across ten datasets.

BoostAPR: Boosting Automated Program Repair via Execution-Grounded Reinforcement Learning with Dual Reward Models

cs.AI · 2026-05-09 · unverdicted · novelty 6.0 · 2 refs

BoostAPR boosts automated program repair by training a sequence-level assessor and line-level credit allocator from execution outcomes, then applying them in PPO to reach 40.7% on SWE-bench Verified.

Enhancing Consistency Models for Multi-Agent Trajectory Prediction

cs.CV · 2026-05-09 · unverdicted · novelty 6.0

ECTraj enhances consistency models for multi-agent trajectory prediction via improved student-teacher supervision and conditional top-K generation, yielding faster inference and competitive accuracy on Argoverse 2.

bispectrum: Selective $G$-Bispectra Made Practical

cs.LG · 2026-05-08 · conditional · novelty 6.0

bispectrum library delivers selective G-bispectra for seven groups with reduced costs (O(|G|) for finite groups, O(L^2) for spheres), sub-millisecond GPU times, and superior benchmark performance versus standard pooling in low-data regimes.

Personalizing LLMs with Binary Feedback: A Preference-Corrected Optimization Framework

cs.CL · 2026-05-11 · unverdicted · novelty 5.0

C-BPO personalizes LLMs via preference-calibrated binary signals and PU learning theory to isolate inter-user differences from shared task knowledge.

citing papers explorer

Showing 7 of 7 citing papers.

Positional LSH: Binary Block Matrix Approximation for Attention with Linear Biases cs.LG · 2026-05-10 · unverdicted · none · ref 56
ALiBi bias is the expectation of positional LSH-induced block masks, yielding spectral and max-norm approximation bounds that reduce long-context biased attention to randomized short-context unbiased attention.
Convergence Rate Analysis of SOAP with Arbitrary Orthogonal Projection Matrices math.OC · 2026-04-23 · unverdicted · none · ref 5
SOAP and its generalizations with arbitrary orthogonal projections converge at a provable rate when the projections are conditionally independent of the current gradient.
DANCE: Detect and Classify Events in EEG cs.LG · 2026-05-11 · unverdicted · none · ref 47
DANCE frames EEG event identification as a set-prediction problem to jointly detect and classify events directly from raw, unaligned signals, outperforming existing methods on seizure monitoring and matching onset-informed models on BCI tasks across ten datasets.
BoostAPR: Boosting Automated Program Repair via Execution-Grounded Reinforcement Learning with Dual Reward Models cs.AI · 2026-05-09 · unverdicted · none · ref 74 · 2 links
BoostAPR boosts automated program repair by training a sequence-level assessor and line-level credit allocator from execution outcomes, then applying them in PPO to reach 40.7% on SWE-bench Verified.
Enhancing Consistency Models for Multi-Agent Trajectory Prediction cs.CV · 2026-05-09 · unverdicted · none · ref 35
ECTraj enhances consistency models for multi-agent trajectory prediction via improved student-teacher supervision and conditional top-K generation, yielding faster inference and competitive accuracy on Argoverse 2.
bispectrum: Selective $G$-Bispectra Made Practical cs.LG · 2026-05-08 · conditional · none · ref 25
bispectrum library delivers selective G-bispectra for seven groups with reduced costs (O(|G|) for finite groups, O(L^2) for spheres), sub-millisecond GPU times, and superior benchmark performance versus standard pooling in low-data regimes.
Personalizing LLMs with Binary Feedback: A Preference-Corrected Optimization Framework cs.CL · 2026-05-11 · unverdicted · none · ref 34
C-BPO personalizes LLMs via preference-calibrated binary signals and PU learning theory to isolate inter-user differences from shared task knowledge.

International Conference on Learning Representations (ICLR) , year=

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer