|国家预印本平台
首页|Lazy Heuristic Search for Solving POMDPs with Expensive-to-Compute Belief Transitions

Lazy Heuristic Search for Solving POMDPs with Expensive-to-Compute Belief Transitions

Lazy Heuristic Search for Solving POMDPs with Expensive-to-Compute Belief Transitions

来源:Arxiv_logoArxiv
英文摘要

Heuristic search solvers like RTDP-Bel and LAO* have proven effective for computing optimal and bounded sub-optimal solutions for Partially Observable Markov Decision Processes (POMDPs), which are typically formulated as belief MDPs. A belief represents a probability distribution over possible system states. Given a parent belief and an action, computing belief state transitions involves Bayesian updates that combine the transition and observation models of the POMDP to determine successor beliefs and their transition probabilities. However, there is a class of problems, specifically in robotics, where computing these transitions can be prohibitively expensive due to costly physics simulations, raycasting, or expensive collision checks required by the underlying transition and observation models, leading to long planning times. To address this challenge, we propose Lazy RTDP-Bel and Lazy LAO*, which defer computing expensive belief state transitions by leveraging Q-value estimation, significantly reducing planning time. We demonstrate the superior performance of the proposed lazy planners in domains such as contact-rich manipulation for pose estimation, outdoor navigation in rough terrain, and indoor navigation with a 1-D LiDAR sensor. Additionally, we discuss practical Q-value estimation techniques for commonly encountered problem classes that our lazy planners can leverage. Our results show that lazy heuristic search methods dramatically improve planning speed by postponing expensive belief transition evaluations while maintaining solution quality.

Muhammad Suhail Saleem、Rishi Veerapaneni、Maxim Likhachev

计算技术、计算机技术

Muhammad Suhail Saleem,Rishi Veerapaneni,Maxim Likhachev.Lazy Heuristic Search for Solving POMDPs with Expensive-to-Compute Belief Transitions[EB/OL].(2025-05-30)[2025-06-27].https://arxiv.org/abs/2506.00285.点此复制

评论