首页|Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation

Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation

来源：

英文摘要

Recent decoding methods improve the factuality of large language models (LLMs) by refining how the next token is selected during generation. These methods typically operate at the token level, leveraging internal representations to suppress superficial patterns. Nevertheless, LLMs remain prone to hallucinations, especially over longer contexts. In this paper, we propose Active Layer-Contrastive Decoding (ActLCD), a novel decoding strategy that actively decides when to apply contrasting layers during generation. By casting decoding as a sequential decision-making problem, ActLCD employs a reinforcement learning policy guided by a reward-aware classifier to optimize factuality beyond the token level. Our experiments demonstrate that ActLCD surpasses state-of-the-art methods across five benchmarks, showcasing its effectiveness in mitigating hallucinations in diverse generation scenarios.

作者：Hongxiang Zhang、Hao Chen、Muhao Chen、Tianyi Zhang

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Hongxiang Zhang,Hao Chen,Muhao Chen,Tianyi Zhang.Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation[EB/OL].(2025-05-29)[2025-06-29].https://arxiv.org/abs/2505.23657.点此复制

Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation

Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation

评论