|国家预印本平台
首页|Novel RL approach for efficient Elevator Group Control Systems

Novel RL approach for efficient Elevator Group Control Systems

Novel RL approach for efficient Elevator Group Control Systems

来源:Arxiv_logoArxiv
英文摘要

Efficient elevator traffic management in large buildings is critical for minimizing passenger travel times and energy consumption. Because heuristic- or pattern-detection-based controllers struggle with the stochastic and combinatorial nature of dispatching, we model the six-elevator, fifteen-floor system at Vrije Universiteit Amsterdam as a Markov Decision Process and train an end-to-end Reinforcement Learning (RL) Elevator Group Control System (EGCS). Key innovations include a novel action space encoding to handle the combinatorial complexity of elevator dispatching, the introduction of infra-steps to model continuous passenger arrivals, and a tailored reward signal to improve learning efficiency. In addition, we explore various ways to adapt the discounting factor to the infra-step formulation. We investigate RL architectures based on Dueling Double Deep Q-learning, showing that the proposed RL-based EGCS adapts to fluctuating traffic patterns, learns from a highly stochastic environment, and thereby outperforms a traditional rule-based algorithm.

Nathan Vaartjes、Vincent Francois-Lavet

自动化技术、自动化技术设备计算技术、计算机技术

Nathan Vaartjes,Vincent Francois-Lavet.Novel RL approach for efficient Elevator Group Control Systems[EB/OL].(2025-06-12)[2025-07-17].https://arxiv.org/abs/2507.00011.点此复制

评论