Of Mice and Machines: A Comparison of Learning Between Real World Mice and RL Agents
Of Mice and Machines: A Comparison of Learning Between Real World Mice and RL Agents
Recent advances in reinforcement learning (RL) have demonstrated impressive capabilities in complex decision-making tasks. This progress raises a natural question: how do these artificial systems compare to biological agents, which have been shaped by millions of years of evolution? To help answer this question, we undertake a comparative study of biological mice and RL agents in a predator-avoidance maze environment. Through this analysis, we identify a striking disparity: RL agents consistently demonstrate a lack of self-preservation instinct, readily risking ``death'' for marginal efficiency gains. These risk-taking strategies are in contrast to biological agents, which exhibit sophisticated risk-assessment and avoidance behaviors. Towards bridging this gap between the biological and artificial, we propose two novel mechanisms that encourage more naturalistic risk-avoidance behaviors in RL agents. Our approach leads to the emergence of naturalistic behaviors, including strategic environment assessment, cautious path planning, and predator avoidance patterns that closely mirror those observed in biological systems.
Shuo Han、German Espinosa、Junda Huang、Daniel A. Dombeck、Malcolm A. MacIver、Bradly C. Stadie
生物科学现状、生物科学发展生物科学研究方法、生物科学研究技术计算技术、计算机技术
Shuo Han,German Espinosa,Junda Huang,Daniel A. Dombeck,Malcolm A. MacIver,Bradly C. Stadie.Of Mice and Machines: A Comparison of Learning Between Real World Mice and RL Agents[EB/OL].(2025-05-17)[2025-07-21].https://arxiv.org/abs/2505.12204.点此复制
评论