首页|Continual Learning for Large Language Models: A Survey

Continual Learning for Large Language Models: A Survey

来源：

英文摘要

Large language models (LLMs) are not amenable to frequent re-training, due to high training costs arising from their massive scale. However, updates are necessary to endow LLMs with new skills and keep them up-to-date with rapidly evolving human knowledge. This paper surveys recent works on continual learning for LLMs. Due to the unique nature of LLMs, we catalog continue learning techniques in a novel multi-staged categorization scheme, involving continual pretraining, instruction tuning, and alignment. We contrast continual learning for LLMs with simpler adaptation methods used in smaller models, as well as with other enhancement strategies like retrieval-augmented generation and model editing. Moreover, informed by a discussion of benchmarks and evaluation, we identify several challenges and future work directions for this crucial task.

作者：Gholamreza Haffari、Yuan-Fang Li、Shirui Pan、Linhao Luo、Tongtong Wu、Thuy-Trang Vu

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Gholamreza Haffari,Yuan-Fang Li,Shirui Pan,Linhao Luo,Tongtong Wu,Thuy-Trang Vu.Continual Learning for Large Language Models: A Survey[EB/OL].(2024-02-02)[2025-08-02].https://arxiv.org/abs/2402.01364.点此复制

Continual Learning for Large Language Models: A Survey

Continual Learning for Large Language Models: A Survey

评论