Optimization of Latent-Space Compression using Game-Theoretic Techniques for Transformer-Based Vector Search
Optimization of Latent-Space Compression using Game-Theoretic Techniques for Transformer-Based Vector Search
Vector similarity search plays a pivotal role in modern information retrieval systems, especially when powered by transformer-based embeddings. However, the scalability and efficiency of such systems are often hindered by the high dimensionality of latent representations. In this paper, we propose a novel game-theoretic framework for optimizing latent-space compression to enhance both the efficiency and semantic utility of vector search. By modeling the compression strategy as a zero-sum game between retrieval accuracy and storage efficiency, we derive a latent transformation that preserves semantic similarity while reducing redundancy. We benchmark our method against FAISS, a widely-used vector search library, and demonstrate that our approach achieves a significantly higher average similarity (0.9981 vs. 0.5517) and utility (0.8873 vs. 0.5194), albeit with a modest increase in query time. This trade-off highlights the practical value of game-theoretic latent compression in high-utility, transformer-based search applications. The proposed system can be seamlessly integrated into existing LLM pipelines to yield more semantically accurate and computationally efficient retrieval.
Kushagra Agrawal、Nisharg Nargund、Oishani Banerjee
计算技术、计算机技术
Kushagra Agrawal,Nisharg Nargund,Oishani Banerjee.Optimization of Latent-Space Compression using Game-Theoretic Techniques for Transformer-Based Vector Search[EB/OL].(2025-08-26)[2025-09-06].https://arxiv.org/abs/2508.18877.点此复制
评论