On Differentiating Parameterized Argmin and Argmax Problems with Application to Bi-level Optimization

Anoop Cherian; Basura Fernando; Edison Guo; Peter Anderson; Rodrigo Santa Cruz; Stephen Gould

arxiv: 1607.05447 · v2 · pith:CR3XVVP7new · submitted 2016-07-19 · 💻 cs.CV · math.OC

On Differentiating Parameterized Argmin and Argmax Problems with Application to Bi-level Optimization

Stephen Gould , Basura Fernando , Anoop Cherian , Peter Anderson , Rodrigo Santa Cruz , Edison Guo This is my paper

classification 💻 cs.CV math.OC

keywords optimizationproblemargmaxargminbi-levelproblemssomedifferentiating

0 comments

read the original abstract

Some recent works in machine learning and computer vision involve the solution of a bi-level optimization problem. Here the solution of a parameterized lower-level problem binds variables that appear in the objective of an upper-level problem. The lower-level problem typically appears as an argmin or argmax optimization problem. Many techniques have been proposed to solve bi-level optimization problems, including gradient descent, which is popular with current end-to-end learning approaches. In this technical report we collect some results on differentiating argmin and argmax optimization problems with and without constraints and provide some insightful motivating examples.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 9 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Constraint-Aware Flow Matching: Decision Aligned End-to-End Training for Constrained Sampling
cs.LG 2026-05 unverdicted novelty 7.0

Constraint-Aware Flow Matching integrates constraint projections into the flow matching training objective to align model dynamics with constrained sampling and reduce distributional shift.
Decision-Focused Learning via Tangent-Space Projection of Prediction Error
cs.LG 2026-05 unverdicted novelty 7.0

Regret gradients in DFL are the tangent-space projection of prediction error scaled by curvature, enabling efficient direct computation without differentiating through solvers.
Learning Latent Trees with Stochastic Perturbations and Differentiable Dynamic Programming
cs.CL 2019-06 unverdicted novelty 7.0

A fully differentiable parser that stochastically samples projective dependency trees using Gumbel perturbations and dynamic programming to boost downstream task performance without direct supervision.
Decision-Focused Learning via Tangent-Space Projection of Prediction Error
cs.LG 2026-05 unverdicted novelty 6.0

PEAR computes regret gradients via tangent-space projection of prediction error, delivering top decision quality and efficiency on LP and QP tasks without solver differentiation.
Representation-Guided Parameter-Efficient LLM Unlearning
cs.CL 2026-04 unverdicted novelty 6.0

REGLU guides LoRA-based unlearning via representation subspaces and orthogonal regularization to outperform prior methods on forget-retain trade-off in LLM benchmarks.
SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation
cs.LG 2023-10 conditional novelty 6.0

SalUn uses gradient-based weight saliency to achieve effective machine unlearning of data, classes, or concepts in image classification and generation, narrowing the gap to exact retraining.
InfoGeo: Information-Theoretic Object-Centric Learning for Cross-View Generalizable UAV Geo-Localization
cs.CV 2026-05 unverdicted novelty 5.0

InfoGeo reformulates cross-view geo-localization as an information bottleneck that aligns object-centric structural relations across views while minimizing view-specific noise.
InfoGeo: Information-Theoretic Object-Centric Learning for Cross-View Generalizable UAV Geo-Localization
cs.CV 2026-05 unverdicted novelty 5.0

InfoGeo applies an information bottleneck to object-centric learning for improved cross-view generalization in UAV geo-localization.
InfoGeo: Information-Theoretic Object-Centric Learning for Cross-View Generalizable UAV Geo-Localization
cs.CV 2026-05 unverdicted novelty 5.0

InfoGeo reformulates cross-view geo-localization as an information bottleneck that aligns object-centric structural relations while suppressing view-specific noise, outperforming prior methods on benchmarks.