Zoom: SSD-based Vector Search for Optimizing Accuracy, Latency and Memory

Minjia Zhang; Yuxiong He

arxiv: 1809.04067 · v1 · pith:35E646GFnew · submitted 2018-09-11 · 💻 cs.CV · cs.LG· cs.PF

Zoom: SSD-based Vector Search for Optimizing Accuracy, Latency and Memory

Minjia Zhang , Yuxiong He This is my paper

classification 💻 cs.CV cs.LGcs.PF

keywords accuracysearchmemoryvectorlatencyzoomcandidatesfootprint

0 comments

read the original abstract

With the advancement of machine learning and deep learning, vector search becomes instrumental to many information retrieval systems, to search and find best matches to user queries based on their semantic similarities.These online services require the search architecture to be both effective with high accuracy and efficient with low latency and memory footprint, which existing work fails to offer. We develop, Zoom, a new vector search solution that collaboratively optimizes accuracy, latency and memory based on a multiview approach. (1) A "preview" step generates a small set of good candidates, leveraging compressed vectors in memory for reduced footprint and fast lookup. (2) A "fullview" step on SSDs reranks those candidates with their full-length vector, striking high accuracy. Our evaluation shows that, Zoom achieves an order of magnitude improvements on efficiency while attaining equal or higher accuracy, comparing with the state-of-the-art.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

LSM-VEC: A Large-Scale Disk-Based System for Dynamic Vector Search
cs.DB 2025-05 unverdicted novelty 6.0

LSM-VEC integrates hierarchical graphs with LSM-tree levels for out-of-place dynamic updates, sampling-based search, and connectivity-aware reordering, outperforming prior disk-based ANN systems on billion-scale data ...