Deep learning via hessian-free optimization

James Martens et al · 2010

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

FlashSinkhorn: IO-Aware Entropic Optimal Transport on GPU

cs.LG · 2026-02-03 · conditional · novelty 7.0

FlashSinkhorn delivers up to 32x forward and 161x end-to-end speedups for entropic OT on A100 GPUs via IO-aware Triton kernels that fuse log-domain updates and streaming transport application.

Power Reinforcement Post-Training of Text-to-Image Models with Super-Linear Advantage Shaping

cs.CV · 2026-05-11 · unverdicted · novelty 6.0

Super-Linear Advantage Shaping (SLAS) introduces a non-linear geometric policy update for RL post-training of text-to-image models that reshapes the local policy space via advantage-dependent Fisher-Rao weighting to reduce reward hacking and improve performance over GRPO baselines.

A Regularized Hessian-Free Inexact Newton-Type Method with Global $\mathcal{O}(k^{-2})$ Convergence

math.OC · 2026-04-30 · unverdicted · novelty 6.0

A new regularized Hessian-free Newton-type method for smooth convex optimization achieves global O(k^{-2}) convergence and local quadratic convergence in a variant, with practical speedups over prior methods.

citing papers explorer

Showing 3 of 3 citing papers.

FlashSinkhorn: IO-Aware Entropic Optimal Transport on GPU cs.LG · 2026-02-03 · conditional · none · ref 34
FlashSinkhorn delivers up to 32x forward and 161x end-to-end speedups for entropic OT on A100 GPUs via IO-aware Triton kernels that fuse log-domain updates and streaming transport application.
Power Reinforcement Post-Training of Text-to-Image Models with Super-Linear Advantage Shaping cs.CV · 2026-05-11 · unverdicted · none · ref 30
Super-Linear Advantage Shaping (SLAS) introduces a non-linear geometric policy update for RL post-training of text-to-image models that reshapes the local policy space via advantage-dependent Fisher-Rao weighting to reduce reward hacking and improve performance over GRPO baselines.
A Regularized Hessian-Free Inexact Newton-Type Method with Global $\mathcal{O}(k^{-2})$ Convergence math.OC · 2026-04-30 · unverdicted · none · ref 18
A new regularized Hessian-free Newton-type method for smooth convex optimization achieves global O(k^{-2}) convergence and local quadratic convergence in a variant, with practical speedups over prior methods.

Deep learning via hessian-free optimization

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer