|国家预印本平台
| 注册
首页|ALAS: Autonomous Learning Agent for Self-Updating Language Models

ALAS: Autonomous Learning Agent for Self-Updating Language Models

ALAS: Autonomous Learning Agent for Self-Updating Language Models

来源:Arxiv_logoArxiv
英文摘要

Large language models (LLMs) often have a fixed knowledge cutoff, limiting their accuracy on emerging information. We present ALAS (Autonomous Learning Agent System), a modular pipeline that continuously updates an LLM's knowledge with minimal human intervention. ALAS autonomously generates a learning curriculum for a target domain, retrieves up-to-date information from the web (with citations), distills this into question-answer training data, and fine-tunes the model through supervised fine-tuning (SFT) and direct preference optimization (DPO). It iteratively evaluates performance and revises the curriculum, enabling long-term continual learning. We demonstrate ALAS's ability to self-improve a model on rapidly evolving domains (e.g., new Python releases, latest security CVEs, academic trends), significantly boosting post-cutoff question answering accuracy (from 15% to 90% on average) without manual dataset curation. The system emphasizes modularity and reproducibility: each component (planning, retrieval, distillation, memory, fine-tuning) is interchangeable and built on standard APIs. We discuss comparative baselines (e.g., retrieval-augmented generation vs. fine-tuning) and show that ALAS achieves 90% accuracy on knowledge-updated queries with minimal engineering overhead. Finally, we outline limitations (cost, dependency on source quality) and future directions for autonomous lifelong learning in LLMs.

Dhruv Atreja

计算技术、计算机技术

Dhruv Atreja.ALAS: Autonomous Learning Agent for Self-Updating Language Models[EB/OL].(2025-08-14)[2025-09-06].https://arxiv.org/abs/2508.15805.点此复制

评论