pith. sign in

Practicalandoptimal lsh for angular distance

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it
abstract

We show the existence of a Locality-Sensitive Hashing (LSH) family for the angular distance that yields an approximate Near Neighbor Search algorithm with the asymptotically optimal running time exponent. Unlike earlier algorithms with this property (e.g., Spherical LSH [Andoni, Indyk, Nguyen, Razenshteyn 2014], [Andoni, Razenshteyn 2015]), our algorithm is also practical, improving upon the well-studied hyperplane LSH [Charikar, 2002] in practice. We also introduce a multiprobe version of this algorithm, and conduct experimental evaluation on real and synthetic data sets. We complement the above positive results with a fine-grained lower bound for the quality of any LSH family for angular distance. Our lower bound implies that the above LSH family exhibits a trade-off between evaluation time and quality that is close to optimal for a natural class of LSH functions.

fields

cs.LG 2 cs.CG 1

years

2026 2 2020 1

representative citing papers

Reformer: The Efficient Transformer

cs.LG · 2020-01-13 · accept · novelty 8.0

Reformer matches standard Transformer accuracy on long sequences while using far less memory and running faster via LSH attention and reversible residual layers.

citing papers explorer

Showing 3 of 3 citing papers.

  • Reformer: The Efficient Transformer cs.LG · 2020-01-13 · accept · none · ref 3

    Reformer matches standard Transformer accuracy on long sequences while using far less memory and running faster via LSH attention and reversible residual layers.

  • Dynamic Query Modification for Binary Locality Sensitive Hashing cs.CG · 2026-05-22 · unverdicted · none · ref 1 · internal anchor

    Dynamic query modification in binary LSH raises collision probability with near neighbors and lowers miss rates, with MQ-Forest achieving up to 40% faster build and query times on benchmarks.

  • Using predefined vector systems to speed up neural network multimillion class classification cs.LG · 2026-04-01 · unverdicted · none · ref 10

    Predefined vector systems structure neural network latent spaces to allow O(1) label prediction via index searches on embedding vectors, delivering up to 11.6x speedup on multimillion-class tasks while preserving accuracy and enabling new-class detection.