首页|Optimization of Latent-Space Compression using Game-Theoretic Techniques for Transformer-Based Vector Search

Optimization of Latent-Space Compression using Game-Theoretic Techniques for Transformer-Based Vector Search

来源：

英文摘要

Vector similarity search plays a pivotal role in modern information retrieval systems, especially when powered by transformer-based embeddings. However, the scalability and efficiency of such systems are often hindered by the high dimensionality of latent representations. In this paper, we propose a novel game-theoretic framework for optimizing latent-space compression to enhance both the efficiency and semantic utility of vector search. By modeling the compression strategy as a zero-sum game between retrieval accuracy and storage efficiency, we derive a latent transformation that preserves semantic similarity while reducing redundancy. We benchmark our method against FAISS, a widely-used vector search library, and demonstrate that our approach achieves a significantly higher average similarity (0.9981 vs. 0.5517) and utility (0.8873 vs. 0.5194), albeit with a modest increase in query time. This trade-off highlights the practical value of game-theoretic latent compression in high-utility, transformer-based search applications. The proposed system can be seamlessly integrated into existing LLM pipelines to yield more semantically accurate and computationally efficient retrieval.

作者：Kushagra Agrawal、Nisharg Nargund、Oishani Banerjee

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Kushagra Agrawal,Nisharg Nargund,Oishani Banerjee.Optimization of Latent-Space Compression using Game-Theoretic Techniques for Transformer-Based Vector Search[EB/OL].(2025-08-26)[2025-09-06].https://arxiv.org/abs/2508.18877.点此复制

Optimization of Latent-Space Compression using Game-Theoretic Techniques for Transformer-Based Vector Search

Optimization of Latent-Space Compression using Game-Theoretic Techniques for Transformer-Based Vector Search

评论