首页|MLLM-CL: Continual Learning for Multimodal Large Language Models

MLLM-CL: Continual Learning for Multimodal Large Language Models

来源：

英文摘要

Recent Multimodal Large Language Models (MLLMs) excel in vision-language understanding but face challenges in adapting to dynamic real-world scenarios that require continuous integration of new knowledge and skills. While continual learning (CL) offers a potential solution, existing benchmarks and methods suffer from critical limitations. In this paper, we introduce MLLM-CL, a novel benchmark encompassing domain and ability continual learning, where the former focuses on independently and identically distributed (IID) evaluation across evolving mainstream domains, whereas the latter evaluates on non-IID scenarios with emerging model ability. Methodologically, we propose preventing catastrophic interference through parameter isolation, along with an MLLM-based routing mechanism. Extensive experiments demonstrate that our approach can integrate domain-specific knowledge and functional abilities with minimal forgetting, significantly outperforming existing methods.

作者：Hongbo Zhao、Fei Zhu、Rundong Wang、Gaofeng Meng、Zhaoxiang Zhang

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Hongbo Zhao,Fei Zhu,Rundong Wang,Gaofeng Meng,Zhaoxiang Zhang.MLLM-CL: Continual Learning for Multimodal Large Language Models[EB/OL].(2025-06-05)[2025-07-25].https://arxiv.org/abs/2506.05453.点此复制

MLLM-CL: Continual Learning for Multimodal Large Language Models

MLLM-CL: Continual Learning for Multimodal Large Language Models

评论