|国家预印本平台
首页|From Query to Explanation: Uni-RAG for Multi-Modal Retrieval-Augmented Learning in STEM

From Query to Explanation: Uni-RAG for Multi-Modal Retrieval-Augmented Learning in STEM

From Query to Explanation: Uni-RAG for Multi-Modal Retrieval-Augmented Learning in STEM

来源:Arxiv_logoArxiv
英文摘要

In AI-facilitated teaching, leveraging various query styles to interpret abstract educational content is crucial for delivering effective and accessible learning experiences. However, existing retrieval systems predominantly focus on natural text-image matching and lack the capacity to address the diversity and ambiguity inherent in real-world educational scenarios. To address this limitation, we develop a lightweight and efficient multi-modal retrieval module, named Uni-Retrieval, which extracts query-style prototypes and dynamically matches them with tokens from a continually updated Prompt Bank. This Prompt Bank encodes and stores domain-specific knowledge by leveraging a Mixture-of-Expert Low-Rank Adaptation (MoE-LoRA) module and can be adapted to enhance Uni-Retrieval's capability to accommodate unseen query types at test time. To enable natural language educational content generation, we integrate the original Uni-Retrieval with a compact instruction-tuned language model, forming a complete retrieval-augmented generation pipeline named Uni-RAG. Given a style-conditioned query, Uni-RAG first retrieves relevant educational materials and then generates human-readable explanations, feedback, or instructional content aligned with the learning objective. Experimental results on SER and other multi-modal benchmarks show that Uni-RAG outperforms baseline retrieval and RAG systems in both retrieval accuracy and generation quality, while maintaining low computational cost. Our framework provides a scalable, pedagogically grounded solution for intelligent educational systems, bridging retrieval and generation to support personalized, explainable, and efficient learning assistance across diverse STEM scenarios.

Xinyi Wu、Yanhao Jia、Luwei Xiao、Shuai Zhao、Fengkuang Chiang、Erik Cambria

教育信息传播、知识传播计算技术、计算机技术

Xinyi Wu,Yanhao Jia,Luwei Xiao,Shuai Zhao,Fengkuang Chiang,Erik Cambria.From Query to Explanation: Uni-RAG for Multi-Modal Retrieval-Augmented Learning in STEM[EB/OL].(2025-07-05)[2025-07-17].https://arxiv.org/abs/2507.03868.点此复制

评论