|国家预印本平台
首页|UProp: Investigating the Uncertainty Propagation of LLMs in Multi-Step Agentic Decision-Making

UProp: Investigating the Uncertainty Propagation of LLMs in Multi-Step Agentic Decision-Making

UProp: Investigating the Uncertainty Propagation of LLMs in Multi-Step Agentic Decision-Making

来源:Arxiv_logoArxiv
英文摘要

As Large Language Models (LLMs) are integrated into safety-critical applications involving sequential decision-making in the real world, it is essential to know when to trust LLM decisions. Existing LLM Uncertainty Quantification (UQ) methods are primarily designed for single-turn question-answering formats, resulting in multi-step decision-making scenarios, e.g., LLM agentic system, being underexplored. In this paper, we introduce a principled, information-theoretic framework that decomposes LLM sequential decision uncertainty into two parts: (i) internal uncertainty intrinsic to the current decision, which is focused on existing UQ methods, and (ii) extrinsic uncertainty, a Mutual-Information (MI) quantity describing how much uncertainty should be inherited from preceding decisions. We then propose UProp, an efficient and effective extrinsic uncertainty estimator that converts the direct estimation of MI to the estimation of Pointwise Mutual Information (PMI) over multiple Trajectory-Dependent Decision Processes (TDPs). UProp is evaluated over extensive multi-step decision-making benchmarks, e.g., AgentBench and HotpotQA, with state-of-the-art LLMs, e.g., GPT-4.1 and DeepSeek-V3. Experimental results demonstrate that UProp significantly outperforms existing single-turn UQ baselines equipped with thoughtful aggregation strategies. Moreover, we provide a comprehensive analysis of UProp, including sampling efficiency, potential applications, and intermediate uncertainty propagation, to demonstrate its effectiveness. Codes will be available at https://github.com/jinhaoduan/UProp.

Jinhao Duan、James Diffenderfer、Sandeep Madireddy、Tianlong Chen、Bhavya Kailkhura、Kaidi Xu

计算技术、计算机技术

Jinhao Duan,James Diffenderfer,Sandeep Madireddy,Tianlong Chen,Bhavya Kailkhura,Kaidi Xu.UProp: Investigating the Uncertainty Propagation of LLMs in Multi-Step Agentic Decision-Making[EB/OL].(2025-06-20)[2025-07-16].https://arxiv.org/abs/2506.17419.点此复制

评论