首页|A Comprehensive Taxonomy of Negation for NLP and Neural Retrievers

A Comprehensive Taxonomy of Negation for NLP and Neural Retrievers

来源：

英文摘要

Understanding and solving complex reasoning tasks is vital for addressing the information needs of a user. Although dense neural models learn contextualised embeddings, they still underperform on queries containing negation. To understand this phenomenon, we study negation in both traditional neural information retrieval and LLM-based models. We (1) introduce a taxonomy of negation that derives from philosophical, linguistic, and logical definitions; (2) generate two benchmark datasets that can be used to evaluate the performance of neural information retrieval models and to fine-tune models for a more robust performance on negation; and (3) propose a logic-based classification mechanism that can be used to analyze the performance of retrieval models on existing datasets. Our taxonomy produces a balanced data distribution over negation types, providing a better training setup that leads to faster convergence on the NevIR dataset. Moreover, we propose a classification schema that reveals the coverage of negation types in existing datasets, offering insights into the factors that might affect the generalization of fine-tuned models on negation.

作者：Roxana Petcu、Samarth Bhargav、Maarten de Rijke、Evangelos Kanoulas

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Roxana Petcu,Samarth Bhargav,Maarten de Rijke,Evangelos Kanoulas.A Comprehensive Taxonomy of Negation for NLP and Neural Retrievers[EB/OL].(2025-07-30)[2025-08-06].https://arxiv.org/abs/2507.22337.点此复制

A Comprehensive Taxonomy of Negation for NLP and Neural Retrievers

A Comprehensive Taxonomy of Negation for NLP and Neural Retrievers

评论