|国家预印本平台
首页|Ella: Embodied Social Agents with Lifelong Memory

Ella: Embodied Social Agents with Lifelong Memory

Ella: Embodied Social Agents with Lifelong Memory

来源:Arxiv_logoArxiv
英文摘要

We introduce Ella, an embodied social agent capable of lifelong learning within a community in a 3D open world, where agents accumulate experiences and acquire knowledge through everyday visual observations and social interactions. At the core of Ella's capabilities is a structured, long-term multimodal memory system that stores, updates, and retrieves information effectively. It consists of a name-centric semantic memory for organizing acquired knowledge and a spatiotemporal episodic memory for capturing multimodal experiences. By integrating this lifelong memory system with foundation models, Ella retrieves relevant information for decision-making, plans daily activities, builds social relationships, and evolves autonomously while coexisting with other intelligent beings in the open world. We conduct capability-oriented evaluations in a dynamic 3D open world where 15 agents engage in social activities for days and are assessed with a suite of unseen controlled evaluations. Experimental results show that Ella can influence, lead, and cooperate with other agents well to achieve goals, showcasing its ability to learn effectively through observation and social interaction. Our findings highlight the transformative potential of combining structured memory systems with foundation models for advancing embodied intelligence. More videos can be found at https://umass-embodied-agi.github.io/Ella/.

Hongxin Zhang、Zheyuan Zhang、Zeyuan Wang、Zunzhe Zhang、Lixing Fang、Qinhong Zhou、Chuang Gan

计算技术、计算机技术

Hongxin Zhang,Zheyuan Zhang,Zeyuan Wang,Zunzhe Zhang,Lixing Fang,Qinhong Zhou,Chuang Gan.Ella: Embodied Social Agents with Lifelong Memory[EB/OL].(2025-06-30)[2025-07-19].https://arxiv.org/abs/2506.24019.点此复制

评论