A new RTU grid method models the lensing source as a Gaussian process on a ray-transformed uniform grid, achieving comparable fits with roughly half the pixels per dimension and higher ELBOs on mock data.
Adabelief optimizer: Adapting stepsizes by the belief in observed gradients
3 Pith papers cite this work. Polarity classification is still indexing.
3
Pith papers citing it
verdicts
UNVERDICTED 3representative citing papers
Muon-MVR2 attains the optimal anytime convergence rate of ~O(T^{-1/3}) in stochastic non-convex settings under horizon-free schedules.
Ligandformer is a self-attention graph neural network framework that predicts compound properties, outputs attention maps for local structural interpretation, and claims improved robustness and generalization over prior methods.
citing papers explorer
-
On the Convergence of Muon and Beyond
Muon-MVR2 attains the optimal anytime convergence rate of ~O(T^{-1/3}) in stochastic non-convex settings under horizon-free schedules.