pith. sign in

arxiv: cs/0211018 · v2 · submitted 2002-11-14 · 💻 cs.DS

Indexing schemes for similarity search: an illustrated paradigm

classification 💻 cs.DS
keywords similarityschemesindexingequippedgeometryillustratedmeasuresearch
0
0 comments X
read the original abstract

We suggest a variation of the Hellerstein--Koutsoupias--Papadimitriou indexability model for datasets equipped with a similarity measure, with the aim of better understanding the structure of indexing schemes for similarity-based search and the geometry of similarity workloads. This in particular provides a unified approach to a great variety of schemes used to index into metric spaces and facilitates their transfer to more general similarity measures such as quasi-metrics. We discuss links between performance of indexing schemes and high-dimensional geometry. The concepts and results are illustrated on a very large concrete dataset of peptide fragments equipped with a biologically significant similarity measure.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.