首页|RL2Grid: Benchmarking Reinforcement Learning in Power Grid Operations

RL2Grid: Benchmarking Reinforcement Learning in Power Grid Operations

来源：

英文摘要

Reinforcement learning (RL) can transform power grid operations by providing adaptive and scalable controllers essential for grid decarbonization. However, existing methods struggle with the complex dynamics, aleatoric uncertainty, long-horizon goals, and hard physical constraints that occur in real-world systems. This paper presents RL2Grid, a benchmark designed in collaboration with power system operators to accelerate progress in grid control and foster RL maturity. Built on a power simulation framework developed by RTE France, RL2Grid standardizes tasks, state and action spaces, and reward structures within a unified interface for a systematic evaluation and comparison of RL approaches. Moreover, we integrate real control heuristics and safety constraints informed by the operators' expertise to ensure RL2Grid aligns with grid operation requirements. We benchmark popular RL baselines on the grid control tasks represented within RL2Grid, establishing reference performance metrics. Our results and discussion highlight the challenges that power grids pose for RL methods, emphasizing the need for novel algorithms capable of handling real-world physical systems.

作者：Enrico Marchesini、Benjamin Donnot、Constance Crozier、Ian Dytham、Christian Merz、Lars Schewe、Nico Westerbeck、Cathy Wu、Antoine Marot、Priya L. Donti

作者单位：

学科分类：输配电工程自动化技术、自动化技术设备发电、发电厂

推荐引用：Enrico Marchesini,Benjamin Donnot,Constance Crozier,Ian Dytham,Christian Merz,Lars Schewe,Nico Westerbeck,Cathy Wu,Antoine Marot,Priya L. Donti.RL2Grid: Benchmarking Reinforcement Learning in Power Grid Operations[EB/OL].(2025-03-29)[2025-04-24].https://arxiv.org/abs/2503.23101.点此复制

RL2Grid: Benchmarking Reinforcement Learning in Power Grid Operations

RL2Grid: Benchmarking Reinforcement Learning in Power Grid Operations

评论