pith. sign in

arxiv: 2606.26373 · v1 · pith:DWLZ6U2Jnew · submitted 2026-06-24 · 💻 cs.CR · cs.AI· cs.IR

Hybrid privacy-aware semantic search: SVD-truncated document geometry and CKKS-encrypted query reranking under a restricted threat model

Pith reviewed 2026-06-26 01:13 UTC · model grok-4.3

classification 💻 cs.CR cs.AIcs.IR
keywords semantic searchprivacy-preserving retrievalhomomorphic encryptionSVD truncationembedding inversionCKKSthreat modelvector databases
0
0 comments X

The pith

Truncating document embeddings onto a secret SVD subspace and rotating them, while reranking queries under CKKS encryption, establishes a tight lower bound on reconstruction error and maintains ranking quality at million-document scale.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes a hybrid defense for semantic search over dense embeddings that exploits the difference between a static document collection and dynamic queries. Documents receive geometric protection through SVD subspace truncation followed by a secret orthogonal rotation known only to the owner. Queries receive cryptographic protection by performing reranking inside CKKS homomorphic encryption so the server never sees them. Under this restricted threat model the authors prove a lower bound on any attacker's reconstruction error when limited to the protected subspace. Experiments across one million documents and five encoders show that ranking quality stays intact or improves slightly while latency remains sub-second and standard inversion attacks drop to the noise floor.

Core claim

Protecting the static collection geometrically by projecting each embedding onto a lower-dimensional SVD subspace and applying a secret orthogonal rotation, while protecting the dynamic query by reranking it under CKKS homomorphic encryption, yields a tight lower bound on the reconstruction error of any attacker confined to the protected subspace; the scheme preserves ranking quality at million-document scale and sub-second latency, with document protection treated as empirical obfuscation rather than a cryptographic primitive.

What carries the argument

SVD-truncated document geometry combined with a secret orthogonal rotation for the collection and CKKS-encrypted reranking for the query.

If this is right

  • Ranking quality is preserved or slightly improved on strong encoders, functioning as a linear denoiser.
  • Off-the-shelf inversion attacks on the protected space fall to the noise floor.
  • A known-plaintext attacker recovers the rotation via orthogonal Procrustes once the number of leaked pairs approaches the retained dimension.
  • Product-quantization codes retain most nearest-neighbor structure after protection.
  • Truncation accuracy cost depends on the encoder rather than acting as a universal denoiser.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The separation of static geometric protection from dynamic cryptographic protection could be tested on other retrieval-augmented generation pipelines to measure leakage reduction without full encryption cost.
  • Varying the retained SVD dimension as a function of encoder strength offers a practical knob for tuning the accuracy-privacy trade-off in deployed systems.
  • The approach suggests checking whether the same subspace truncation plus rotation can be combined with lighter query protections in settings where full CKKS reranking is too expensive.
  • Public product-quantization codes surviving the protection layer points to a possible need for joint design of quantization and geometric masking in future vector databases.

Load-bearing premise

The secret orthogonal rotation stays unknown to the attacker and the SVD truncation supplies enough obfuscation for the empirical protection layer to work.

What would settle it

An attacker who only sees the protected subspace and the truncation dimension recovers source documents with error materially below the stated lower bound.

Figures

Figures reproduced from arXiv: 2606.26373 by Sergey Kurilenko.

Figure 1
Figure 1. Figure 1: Online query flow of the construction under study. Steps 1–7 realise the two-stage protocol: client transformation T(·) and local PQ filtering produce a short-list of Kcands candidate IDs; step 4 encrypts the rotated query under CKKS; step 5 runs ct-pt reranking on the server; steps 6–7 decrypt the scores and sort. The secret keys (µ, Vk, R, skCKKS) and the rotated database Erot are produced offline by the… view at source ↗
Figure 2
Figure 2. Figure 2: ct-pt vs. ct-ct: distribution of per-batch latencies over 50 trials. Removing relinearisa￾tion moves the median by ∼ 6 s and tightens the upper tail. 7.1 Experiment 1: ciphertext-plaintext speed-up We benchmark a batched dot-product of an encrypted 192-dimensional query with N = 10 000 plaintext documents. The CKKS parameters are N = 8192, coeff_mod_bit_sizes = [60, 40, 40, 60], ∆ = 240 (security level ≥ 1… view at source ↗
Figure 3
Figure 3. Figure 3: Vec2Text BLEU as a function of k/d under three threat models (left); zoom on k/d = 1.0 comparing no-rotation, known-R and unknown-R (right). Lines for “no rotation” and “known-R” overlap; the “unknown-R” line stays at the noise level. budget ( [PITH_FULL_IMAGE:figures/full_fig_p011_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Per-seed Acc@10 of the proposed pipeline relative to baseline_proj on five rotation seeds; the spread is below the 5-seed CI of the baseline [PITH_FULL_IMAGE:figures/full_fig_p014_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Left: per-stage latency decomposition at k = 192 (the server-side ct-pt rerank dominates). Right: CDF of the server-side rerank latency over 500 queries × 5 seeds (p95 ≈ 283 ms); the full end-to-end p95 ≈ 370 ms ( [PITH_FULL_IMAGE:figures/full_fig_p015_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Known-plaintext alignment destroys the unknown-rotation assumption: with about k leaked pairs the rotation is recovered. The x axis uses a symlog scale to include m = 0 [PITH_FULL_IMAGE:figures/full_fig_p016_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Reference-corpus lookup after alignment. When the reference overlaps the protected collection, alignment turns the protected vector into an exact paragraph lookup; with a disjoint reference this token-overlap proxy stays low. 16 [PITH_FULL_IMAGE:figures/full_fig_p016_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Utility/leakage diagnostic for SVD truncation and calibrated Gaussian noise (dashed: matched-σrec noise baselines). 18 [PITH_FULL_IMAGE:figures/full_fig_p018_8.png] view at source ↗
read the original abstract

Dense embeddings power semantic search and retrieval-augmented generation, but embedding-inversion attacks can reconstruct source text from a vector: when a vector database leaks, the documents behind it leak too. The textbook defences are extremes - encrypting the whole search homomorphically is sound but too slow at million-document scale, while privacy noise degrades ranking long before it protects. We study a middle path exploiting the asymmetry between the static collection and the dynamic query. The collection is protected geometrically: each vector is truncated onto a lower-dimensional SVD subspace and rotated by a secret orthogonal transform known only to the owner. The query is protected cryptographically: it is reranked under CKKS homomorphic encryption, so an honest-but-curious server never sees the query or the scores. CKKS parameters come from a small offline benchmark. We prove a tight lower bound on the reconstruction error of any attacker confined to the protected subspace. On one million documents and five encoders the scheme preserves ranking quality (slightly improving it on strong encoders, as a linear denoiser) at sub-second latency, and an off-the-shelf inversion attack on the protected space collapses to the noise floor. We then test stronger adversaries: a known-plaintext attacker recovers the rotation by orthogonal Procrustes from about as many leaked pairs as the retained dimension; the public product-quantization codes preserve most nearest-neighbour structure; and random-projection, calibrated-noise and BEIR baselines show the truncation is an encoder-dependent accuracy cost, not a free denoiser. We state the limits: query confidentiality is cryptographic, but document protection is an empirical obfuscation layer (SVD truncation plus a secret rotation), not a cryptographic primitive, and we delimit the threat model for each claim.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 3 minor

Summary. The manuscript proposes a hybrid privacy-preserving semantic search system. Static document embeddings are protected geometrically via SVD subspace truncation followed by a secret orthogonal rotation known only to the owner; dynamic queries are protected via CKKS homomorphic encryption during reranking on an honest-but-curious server. The authors claim a tight lower bound on reconstruction error for any attacker confined to the protected subspace, report that ranking quality is preserved (and slightly improved on strong encoders) across 1M documents and five encoders at sub-second latency, and show that an off-the-shelf inversion attack falls to the noise floor. Stronger adversaries (known-plaintext Procrustes recovery, public PQ codes) are tested, and the document-protection component is explicitly labeled an empirical obfuscation layer rather than a cryptographic primitive, with the threat model delimited accordingly.

Significance. If the lower-bound derivation holds and the 1M-document experiments are reproducible, the work supplies a concrete middle path between full homomorphic search (impractical at scale) and additive noise (utility loss). Strengths include the explicit scoping of the threat model, the use of standard primitives (SVD, orthogonal transforms, CKKS), the observation that truncation can function as a linear denoiser for strong encoders, and the honest delimitation that document protection is empirical. These elements make the contribution falsifiable within its stated model and potentially useful for practical retrieval systems.

major comments (2)
  1. [Proof of the tight lower bound] Proof of the tight lower bound on reconstruction error (abstract and limits/threat-model section): the claim that the bound is tight for attackers confined to the SVD-truncated rotated subspace is load-bearing for the privacy guarantee, yet the derivation steps relating the retained dimension, the secret rotation, and the resulting error floor are not visible; without an explicit inequality or reduction showing how the bound is obtained and why it remains tight against the tested inversion attacks, it is unclear whether the bound is achieved or merely asserted.
  2. [Experimental ranking results] Experimental ranking results on 1M documents (results section): the statements that ranking quality is preserved and 'slightly improving it on strong encoders, as a linear denoiser' are central to the utility claim, but the manuscript does not report effect sizes, variance across encoders, or statistical tests; without these, the improvement cannot be distinguished from experimental noise and the denoiser interpretation remains unsupported.
minor comments (3)
  1. [Abstract] The abstract refers to 'five encoders' without naming them; listing the specific models and their dimensionalities in the experimental setup would improve reproducibility.
  2. [CKKS parameters] CKKS parameters are stated to come from 'a small offline benchmark'; the exact selection criteria, security level, and resulting latency/accuracy trade-offs should be tabulated or described in the methods.
  3. Notation for the SVD truncation dimension k and the secret rotation matrix R is introduced late; defining these symbols at first use would aid readability of the geometric-protection description.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed report. The two major comments identify areas where the manuscript's presentation of the lower-bound derivation and experimental statistics can be strengthened for greater clarity and rigor. We address each point below and will revise the manuscript accordingly.

read point-by-point responses
  1. Referee: [Proof of the tight lower bound] Proof of the tight lower bound on reconstruction error (abstract and limits/threat-model section): the claim that the bound is tight for attackers confined to the SVD-truncated rotated subspace is load-bearing for the privacy guarantee, yet the derivation steps relating the retained dimension, the secret rotation, and the resulting error floor are not visible; without an explicit inequality or reduction showing how the bound is obtained and why it remains tight against the tested inversion attacks, it is unclear whether the bound is achieved or merely asserted.

    Authors: We acknowledge that while the manuscript states a tight lower bound on reconstruction error for attackers limited to the protected subspace, the explicit derivation steps, inequality, and reduction relating retained dimension, secret rotation, and error floor are not fully detailed in the main text. We will revise the limits/threat-model section to include a complete step-by-step derivation demonstrating how the bound is obtained and its tightness against the inversion attacks evaluated in the experiments. revision: yes

  2. Referee: [Experimental ranking results] Experimental ranking results on 1M documents (results section): the statements that ranking quality is preserved and 'slightly improving it on strong encoders, as a linear denoiser' are central to the utility claim, but the manuscript does not report effect sizes, variance across encoders, or statistical tests; without these, the improvement cannot be distinguished from experimental noise and the denoiser interpretation remains unsupported.

    Authors: We agree that the results section does not report effect sizes, per-encoder variance, or statistical tests to support the claims of preserved (and slightly improved) ranking quality or the linear denoiser interpretation. In the revision we will add these elements, including standard deviations across runs, appropriate effect-size measures, and statistical significance tests, to allow readers to evaluate the strength of the observed improvements. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper's core claims—a mathematical lower bound on reconstruction error within the SVD-truncated rotated subspace, ranking preservation under CKKS reranking, and explicit scoping to a restricted threat model—rely on standard external linear-algebra and cryptographic primitives rather than any self-referential fitting, self-citation chain, or redefinition of inputs as outputs. Document protection is labeled an empirical layer, not a primitive, and no equations or results reduce by construction to parameters fitted from the same data. The derivation chain is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

2 free parameters · 2 axioms · 0 invented entities

The central claims rest on the restricted threat model (honest-but-curious server plus specific stronger adversaries) and on the empirical effectiveness of the secret rotation remaining unknown; CKKS parameters are obtained from an offline benchmark and the SVD truncation dimension is chosen to balance privacy and utility.

free parameters (2)
  • SVD truncation dimension
    Retained dimension controls the privacy-utility trade-off and is selected per encoder; appears chosen experimentally rather than derived from first principles.
  • CKKS encryption parameters
    Derived from a small offline benchmark; these control noise growth and latency but are not part of the core geometric protection claim.
axioms (2)
  • domain assumption The attacker is confined to the protected subspace and does not obtain the secret rotation matrix except via known-plaintext attacks with limited pairs.
    The paper explicitly delimits the threat model and states that document protection is empirical obfuscation rather than a cryptographic primitive.
  • domain assumption Orthogonal Procrustes attack success depends on the number of leaked pairs relative to retained dimension.
    Used to bound stronger adversaries; stated in the abstract's adversary testing section.

pith-pipeline@v0.9.1-grok · 5853 in / 1682 out tokens · 21418 ms · 2026-06-26T01:13:26.338938+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

50 extracted references · 1 canonical work pages

  1. [1]

    Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks,

    N. Reimers and I. Gurevych, “Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks,” inProc. EMNLP, 2019, pp. 3982–3992. 21

  2. [2]

    Text Embeddings by Weakly-Supervised Contrastive Pre- training,

    L. Wang, N. Yang, X. Huang, et al., “Text Embeddings by Weakly-Supervised Contrastive Pre- training,”arXiv:2212.03533, 2022

  3. [3]

    Large Dual Encoders Are Generalizable Retrievers,

    J. Ni et al., “Large Dual Encoders Are Generalizable Retrievers,” inProc. EMNLP, 2022, pp. 9844–9855

  4. [4]

    BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation,

    J. Chen, S. Xiao, P. Zhang, K. Luo, D. Lian, and Z. Liu, “BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation,” arXiv:2402.03216, 2024

  5. [5]

    Dense Passage Retrieval for Open-Domain Question Answering,

    V. Karpukhin et al., “Dense Passage Retrieval for Open-Domain Question Answering,” inProc. EMNLP, 2020, pp. 6769–6781

  6. [6]

    ColBERT: Efficient and Effective Passage Search via Contextualised Late Interaction over BERT,

    O. Khattab and M. Zaharia, “ColBERT: Efficient and Effective Passage Search via Contextualised Late Interaction over BERT,” inProc. ACM SIGIR, 2020, pp. 39–48

  7. [7]

    Text Embeddings Reveal (Almost) As Much As Text,

    J. X. Morris, V. Kuleshov, V. Shmatikov, and A. M. Rush, “Text Embeddings Reveal (Almost) As Much As Text,” inProc. EMNLP, 2023

  8. [8]

    Sentence Embedding Leaks More Information than You Expect: Generative Embedding Inversion Attack to Recover the Whole Sentence,

    H. Li, M. Xu, and Y. Song, “Sentence Embedding Leaks More Information than You Expect: Generative Embedding Inversion Attack to Recover the Whole Sentence,” inFindings of ACL, 2023, pp. 14022–14040

  9. [9]

    Transferable Embedding Inversion Attack: Uncovering Privacy Risks in Text Embeddings without Model Queries,

    Y.-H. Huang, Y. Tsai, H. Hsiao, H.-Y. Lin, and S.-D. Lin, “Transferable Embedding Inversion Attack: Uncovering Privacy Risks in Text Embeddings without Model Queries,” inProc. ACL (Long), 2024, pp. 4193–4205

  10. [10]

    ALGEN: Few-shot Inversion Attacks on Textual Embeddings via Cross-Model Alignment and Generation,

    Y. Chen, Q. Xu, and J. Bjerva, “ALGEN: Few-shot Inversion Attacks on Textual Embeddings via Cross-Model Alignment and Generation,” inProc. ACL (Long), 2025, pp. 24330–24348. arXiv:2502.11308

  11. [11]

    Universal Zero-shot Embedding Inversion,

    C. Zhang, J. X. Morris, and V. Shmatikov, “Universal Zero-shot Embedding Inversion,” arXiv:2504.00147, 2025

  12. [12]

    Harnessing the Universal Geometry of Embeddings (vec2vec),

    R. Jha, C. Zhang, V. Shmatikov, and J. X. Morris, “Harnessing the Universal Geometry of Embeddings (vec2vec),”arXiv:2505.12540, 2025

  13. [13]

    Zero2Text: Zero-Training Cross-Domain Inversion Attacks on Textual Embeddings,

    D. Kim, D. Kang, K. Lee, H. Baek, and B. B. Kang, “Zero2Text: Zero-Training Cross-Domain Inversion Attacks on Textual Embeddings,”arXiv:2602.01757, 2026

  14. [14]

    The Good and the Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG),

    S. Zeng et al., “The Good and the Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG),” inFindings of ACL, 2024, pp. 4505–4524

  15. [15]

    Information Leakage in Embedding Models,

    C. Song and A. Raghunathan, “Information Leakage in Embedding Models,” inProc. ACM CCS, 2020, pp. 377–390

  16. [16]

    Membership Inference Attacks Against Machine Learning Models,

    R. Shokri et al., “Membership Inference Attacks Against Machine Learning Models,” inProc. IEEE S&P, 2017, pp. 3–18

  17. [17]

    Extracting Training Data from Large Language Models,

    N. Carlini et al., “Extracting Training Data from Large Language Models,” inProc. USENIX Security, 2021, pp. 2633–2650

  18. [18]

    Differential Privacy,

    C. Dwork, “Differential Privacy,” inProc. ICALP, 2006, pp. 1–12

  19. [19]

    Deep Learning with Differential Privacy,

    M. Abadi et al., “Deep Learning with Differential Privacy,” inProc. ACM CCS, 2016, pp. 308–318

  20. [20]

    Differentially Private Representation for NLP,

    L. Lyu, X. He, and Y. Li, “Differentially Private Representation for NLP,” inFindings of EMNLP, 2020, pp. 2355–2365

  21. [21]

    Privacy via the Johnson-Lindenstrauss Transform,

    K. Kenthapadi, A. Korolova, I. Mironov, and N. Mishra, “Privacy via the Johnson-Lindenstrauss Transform,”J. Privacy and Confidentiality, vol. 5, no. 1, 2013

  22. [22]

    Random Projection-Based Multiplicative Data Perturbation for Privacy Preserving Distributed Data Mining,

    K. Liu, H. Kargupta, and J. Ryan, “Random Projection-Based Multiplicative Data Perturbation for Privacy Preserving Distributed Data Mining,”IEEE TKDE, vol. 18, no. 1, 2006, pp. 92–106. 22

  23. [23]

    Homomorphic Encryption for Arithmetic of Approximate Numbers,

    J. H. Cheon, A. Kim, M. Kim, and Y. Song, “Homomorphic Encryption for Arithmetic of Approximate Numbers,” inAdvances in Cryptology—ASIACRYPT 2017, LNCS 10624, pp. 409–437

  24. [24]

    Bootstrapping for Approximate Homomorphic Encryption,

    J. H. Cheon, K. Han, A. Kim et al., “Bootstrapping for Approximate Homomorphic Encryption,” in Advances in Cryptology—EUROCRYPT 2018, LNCS 10820, pp. 360–384

  25. [25]

    Homomorphic Encryption Security Standard,

    M. Albrecht et al., “Homomorphic Encryption Security Standard,” HomomorphicEncryption.org, 2018

  26. [26]

    On the Concrete Hardness of Learning with Errors,

    M. R. Albrecht, R. Player, and S. Scott, “On the Concrete Hardness of Learning with Errors,”Journal of Mathematical Cryptology, vol. 9, no. 3, 2015, pp. 169–203 (lattice-estimator methodology)

  27. [27]

    A Generalized Solution of the Orthogonal Procrustes Problem,

    P. H. Schönemann, “A Generalized Solution of the Orthogonal Procrustes Problem,”Psychometrika, vol. 31, no. 1, 1966, pp. 1–10

  28. [28]

    OpenFHE: Open-Source Fully Homomorphic Encryption Library,

    A. Al Badawi et al., “OpenFHE: Open-Source Fully Homomorphic Encryption Library,” inProc. WAHC ’22, 2022, pp. 53–63

  29. [29]

    EVA: An Encrypted Vector Arithmetic Language and Compiler for Efficient Homomorphic Computation,

    R. Dathathri et al., “EVA: An Encrypted Vector Arithmetic Language and Compiler for Efficient Homomorphic Computation,” inProc. ACM PLDI, 2020, pp. 546–561

  30. [30]

    CHET: An Optimizing Compiler for Fully-Homomorphic Neural-Network Inferencing,

    R. Dathathri et al., “CHET: An Optimizing Compiler for Fully-Homomorphic Neural-Network Inferencing,” inProc. ACM PLDI, 2019, pp. 142–156

  31. [31]

    Over 100x Faster Bootstrapping in Fully Homomorphic Encryption through Memory-centric Optimisation with GPUs,

    W. Jung, S. Kim, J. H. Ahn et al., “Over 100x Faster Bootstrapping in Fully Homomorphic Encryption through Memory-centric Optimisation with GPUs,”IACR TCHES, vol. 2021, no. 4, pp. 114–148

  32. [32]

    Intel HEXL: Accelerating Homomorphic Encryption with Intel AVX512-IFMA52,

    F. Boemer et al., “Intel HEXL: Accelerating Homomorphic Encryption with Intel AVX512-IFMA52,” inProc. WAHC ’21, 2021, pp. 57–62

  33. [33]

    CryptoNets: Applying Neural Networks to Encrypted Data with High Throughput and Accuracy,

    R. Gilad-Bachrach et al., “CryptoNets: Applying Neural Networks to Encrypted Data with High Throughput and Accuracy,” inProc. ICML, 2016, vol. 48, pp. 201–210

  34. [34]

    Private Web Search with Tiptoe,

    A. Henzinger, E. Dauterman, H. Corrigan-Gibbs, and N. Zeldovich, “Private Web Search with Tiptoe,” inProc. ACM SOSP, 2023

  35. [35]

    PIR with Compressed Queries and Amortised Query Processing,

    S. Angel, H. Chen, K. Laine, and S. Setty, “PIR with Compressed Queries and Amortised Query Processing,” inProc. IEEE S&P, 2018, pp. 962–979

  36. [36]

    One Server for the Price of Two: Simple and Fast Single-Server Private Information Retrieval (SimplePIR),

    A. Henzinger, M. M. Hong, H. Corrigan-Gibbs, S. Meiklejohn, and V. Vaikuntanathan, “One Server for the Price of Two: Simple and Fast Single-Server Private Information Retrieval (SimplePIR),” in Proc. USENIX Security, 2023

  37. [37]

    OnionPIR: Response Efficient Single-Server PIR,

    M. H. Mughees, H. Chen, and L. Ren, “OnionPIR: Response Efficient Single-Server PIR,” inProc. ACM CCS, 2021, pp. 2292–2306

  38. [38]

    Finding Structure with Randomness: Probabilistic Algorithms for Constructing Approximate Matrix Decompositions,

    N. Halko, P.-G. Martinsson, and J. A. Tropp, “Finding Structure with Randomness: Probabilistic Algorithms for Constructing Approximate Matrix Decompositions,”SIAM Review, vol. 53, no. 2, 2011, pp. 217–288

  39. [39]

    Product Quantisation for Nearest Neighbor Search,

    H. Jegou, M. Douze, and C. Schmid, “Product Quantisation for Nearest Neighbor Search,”IEEE TPAMI, vol. 33, no. 1, 2011, pp. 117–128

  40. [40]

    Efficient and Robust Approximate Nearest Neighbour Search Using Hierarchical Navigable Small World Graphs,

    Y. A. Malkov and D. A. Yashunin, “Efficient and Robust Approximate Nearest Neighbour Search Using Hierarchical Navigable Small World Graphs,”IEEE TPAMI, vol. 42, no. 4, 2020, pp. 824–836

  41. [41]

    Milvus: A Purpose-Built Vector Data Management System,

    J. Wang, X. Yi, R. Guo et al., “Milvus: A Purpose-Built Vector Data Management System,” inProc. ACM SIGMOD, 2021, pp. 2614–2627

  42. [42]

    The Approximation of One Matrix by Another of Lower Rank,

    C. Eckart and G. Young, “The Approximation of One Matrix by Another of Lower Rank,”Psychome- trika, vol. 1, no. 3, 1936, pp. 211–218

  43. [43]

    Extensions of Lipschitz Mappings into a Hilbert Space,

    W. B. Johnson and J. Lindenstrauss, “Extensions of Lipschitz Mappings into a Hilbert Space,” Contemporary Mathematics, vol. 26, 1984, pp. 189–206. 23

  44. [44]

    BEIR: A Heterogeneous Benchmark for Zero-shot Evaluation of Information Retrieval Models,

    N. Thakur, N. Reimers, A. Rücklé, A. Srivastava, and I. Gurevych, “BEIR: A Heterogeneous Benchmark for Zero-shot Evaluation of Information Retrieval Models,” inProc. NeurIPS Datasets and Benchmarks Track, 2021

  45. [45]

    BLEU: a Method for Automatic Evaluation of Machine Translation,

    K. Papineni et al., “BLEU: a Method for Automatic Evaluation of Machine Translation,” inProc. ACL, 2002, pp. 311–318

  46. [46]

    Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks,

    P. Lewis et al., “Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks,” inProc. NeurIPS, 2020

  47. [47]

    Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection,

    A. Asai et al., “Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection,” in Proc. ICLR, 2024

  48. [48]

    Corrective Retrieval Augmented Generation,

    S.-Q. Yan et al., “Corrective Retrieval Augmented Generation,”arXiv:2401.15884, 2024

  49. [49]

    Retrieval-Augmented Generation for Large Language Models: A Survey,

    Y. Gao et al., “Retrieval-Augmented Generation for Large Language Models: A Survey,” arXiv:2312.10997, 2024

  50. [50]

    Hybrid Method for Privacy-Preserving Semantic Search Based on Homomorphic Encryption and Random Projections,

    S. M. Kurilenko, “Hybrid Method for Privacy-Preserving Semantic Search Based on Homomorphic Encryption and Random Projections,”Vestnik Komp’yuternykh i Informatsionnykh Tekhnologiy, no. 3, 2026, pp. 44–49. doi:10.14489/vkit.2026.03.pp.044-049. 24