|国家预印本平台
首页|多模态古代汉语大语言模型AI九思2.0的设计与开发

多模态古代汉语大语言模型AI九思2.0的设计与开发

he Design and Development of Multimodal Ancient Chinese Large Language Model AI Jiusi 2.0

中文摘要英文摘要

目的/意义] 随着生成式人工智能(AIGC)的快速发展,各类大模型由最初仅能处理单一文本模态的大语言模型,逐步升级为能够同时处理文本、图像、语音和视频等多模态数据的大语言模型。而国内面向古代汉语专业领域的大语言模型,仍主要聚焦于提升古汉语信息处理任务的性能,且以单一文本模态信息处理为主,在大语言模型的知识理解与问答交互能力,以及多模态信息处理方面,还有较大发展空间。基于此,华中科技大学全新推出了既掌握古汉语专业知识,又兼备古汉语应用能力,并支持多模态数据处理的古汉语多模态大语言模型——“AI九思2.0”,以为多模态古代汉语大语言模型的发展抛砖引玉。[方法/过程]本文详细介绍了“AI九思2.0”的数据集构建、算力升级、模型训练、界面优化情况,并展示新版本“AI九思”在古汉语语言知识和语言能力方面的表现。[结果/结论]全新升级的“AI九思2.0”在古代汉语文本理解及古代汉语知识问答领域展现出显著优势,且已经具备了一定的古文字(甲骨文、金文)图像理解能力,从而为推动古代汉语大语言模型的发展做出了应有的贡献。

[Objective/Significance] With the rapid development of generative artificial intelligence (AIGC), various large models have gradually evolved from initially being capable of processing only single text modality in large language models to being able to handle multi-modal data such as text, images, voice, and video. However, domestic large language models targeting the ancient Chinese language field still mainly focus on improving the performance of ancient Chinese language processing tasks, and are mainly centered on single text modality information processing. There is still considerable room for development in terms of the knowledge understanding and question-answering interaction capabilities of large language models, as well as in multi-modal information processing. Based on this, Huazhong University of Science and Technology has newly launched the ancient Chinese multi-modal large language model AI Jiusi 2.0, which not only masters ancient Chinese professional knowledge but also possesses ancient Chinese application capabilities and supports multi-modal data processing, aiming to set an example for the development of multi-modal ancient Chinese large language models. [Method/Process] This paper details the dataset construction, computing power upgrade, model training, and interface optimization of AI Jiusi 2.0, and showcases the performance of the new version AI Jiusi in ancient Chinese language knowledge and language ability.?[Result/ Conclusion] The newly upgraded AI Jiusi 2.0 demonstrates significant advantages in the understanding of ancient Chinese texts and ancient Chinese knowledge question-answering, and has already acquired a certain ability to understand ancient Chinese characters (oracle bone

郑诗铭、刘金柱、张润哲、贺心雨、吕佳源、李志芳、余乐妍、余锁湘、谢佳延、杨纯、王锦绣、汪靓、郑苏楠、陈旷心、王金柳、张曼丽、谢雨霏、刘根辉、袁方、吴翊嘉、罗捷春、吕萍、夏婉婷、罗婉滢、刘艺溶、龚丹、余静静

语言学汉语

I九思2.0古代汉语大语言模型多模态

I Jiusi2.0Ancient ChineseLarge Language ModelMultimodal

郑诗铭,刘金柱,张润哲,贺心雨,吕佳源,李志芳,余乐妍,余锁湘,谢佳延,杨纯,王锦绣,汪靓,郑苏楠,陈旷心,王金柳,张曼丽,谢雨霏,刘根辉,袁方,吴翊嘉,罗捷春,吕萍,夏婉婷,罗婉滢,刘艺溶,龚丹,余静静.多模态古代汉语大语言模型AI九思2.0的设计与开发[EB/OL].(2025-01-26)[2025-05-28].https://chinaxiv.org/abs/202501.00233.点此复制

评论