首页|Reranking with Compressed Document Representation

Reranking with Compressed Document Representation

来源：

英文摘要

Reranking, the process of refining the output of a first-stage retriever, is often considered computationally expensive, especially with Large Language Models. Borrowing from recent advances in document compression for RAG, we reduce the input size by compressing documents into fixed-size embedding representations. We then teach a reranker to use compressed inputs by distillation. Although based on a billion-size model, our trained reranker using this compressed input can challenge smaller rerankers in terms of both effectiveness and efficiency, especially for long documents. Given that text compressors are still in their early development stages, we view this approach as promising.

作者：Hervé Déjean、Stéphane Clinchant

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Hervé Déjean,Stéphane Clinchant.Reranking with Compressed Document Representation[EB/OL].(2025-05-21)[2025-06-09].https://arxiv.org/abs/2505.15394.点此复制

Reranking with Compressed Document Representation

Reranking with Compressed Document Representation

评论