Amortizing Maximum Inner Product Search with Learned Support Functions

Jo\~ao Monteiro; Marco Cuturi; Michal Klein; Theo X. Olausson

arxiv: 2603.08001 · v2 · pith:NVRQ4KAGnew · submitted 2026-03-09 · 💻 cs.LG · stat.ML

Amortizing Maximum Inner Product Search with Learned Support Functions

Theo X. Olausson , Jo\~ao Monteiro , Michal Klein , Marco Cuturi This is my paper

classification 💻 cs.LG stat.ML

keywords mipsfunctiondatabasedirectlykeynetsupportsupportnetamortized

0 comments

read the original abstract

Maximum inner product search (MIPS) is a crucial subroutine in machine learning, requiring the identification of a vector taken within a database (the keys) that best aligns with a given query. We propose amortized MIPS: a regression-based approach that trains neural networks to directly predict MIPS solutions, amortizing the cost of repeatedly solving MIPS for queries drawn from a known distribution over a fixed key database. Our key insight is that the MIPS value function is the \emph{support} function of the set of keys, a well-studied convex function whose gradient yields the optimal key. This motivates two complementary amortized models: SupportNet, an input-convex neural network trained to regress the support function, and KeyNet, a vector-valued network that directly regresses the optimal key. SupportNet can serve as a cluster router, steering queries toward relevant database partitions, while KeyNet can be used as a drop-in replacement for the original query, fed directly to off-the-shelf indexing pipelines. Our experiments on the BEIR benchmark show that, for document embeddings, learned \SupportNet{}s and \KeyNet{}s significantly improve IVF match rates when accounting for compute effort, whether measured in FLOPs, number of probes, or wall-clock time. Our code is available at: https://github.com/apple/ml-amips.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Nectar: Neural Estimation of Cached-Token Attention via Regression
cs.LG 2026-05 unverdicted novelty 6.0

Nectar fits small per-layer per-head neural networks via regression to predict attention outputs and normalizers, enabling constant-time inference independent of context length while preserving semantic generation quality.