Novel RL approach for efficient Elevator Group Control Systems
Novel RL approach for efficient Elevator Group Control Systems
Efficient elevator traffic management in large buildings is critical for minimizing passenger travel times and energy consumption. Because heuristic- or pattern-detection-based controllers struggle with the stochastic and combinatorial nature of dispatching, we model the six-elevator, fifteen-floor system at Vrije Universiteit Amsterdam as a Markov Decision Process and train an end-to-end Reinforcement Learning (RL) Elevator Group Control System (EGCS). Key innovations include a novel action space encoding to handle the combinatorial complexity of elevator dispatching, the introduction of infra-steps to model continuous passenger arrivals, and a tailored reward signal to improve learning efficiency. In addition, we explore various ways to adapt the discounting factor to the infra-step formulation. We investigate RL architectures based on Dueling Double Deep Q-learning, showing that the proposed RL-based EGCS adapts to fluctuating traffic patterns, learns from a highly stochastic environment, and thereby outperforms a traditional rule-based algorithm.
Nathan Vaartjes、Vincent Francois-Lavet
自动化技术、自动化技术设备计算技术、计算机技术
Nathan Vaartjes,Vincent Francois-Lavet.Novel RL approach for efficient Elevator Group Control Systems[EB/OL].(2025-06-12)[2025-07-17].https://arxiv.org/abs/2507.00011.点此复制
评论