|国家预印本平台
首页|A Survey on Multimodal Large Language Models for Autonomous Driving

A Survey on Multimodal Large Language Models for Autonomous Driving

A Survey on Multimodal Large Language Models for Autonomous Driving

来源:Arxiv_logoArxiv
英文摘要

With the emergence of Large Language Models (LLMs) and Vision Foundation Models (VFMs), multimodal AI systems benefiting from large models have the potential to equally perceive the real world, make decisions, and control tools as humans. In recent months, LLMs have shown widespread attention in autonomous driving and map systems. Despite its immense potential, there is still a lack of a comprehensive understanding of key challenges, opportunities, and future endeavors to apply in LLM driving systems. In this paper, we present a systematic investigation in this field. We first introduce the background of Multimodal Large Language Models (MLLMs), the multimodal models development using LLMs, and the history of autonomous driving. Then, we overview existing MLLM tools for driving, transportation, and map systems together with existing datasets and benchmarks. Moreover, we summarized the works in The 1st WACV Workshop on Large Language and Vision Models for Autonomous Driving (LLVM-AD), which is the first workshop of its kind regarding LLMs in autonomous driving. To further promote the development of this field, we also discuss several important problems regarding using MLLMs in autonomous driving systems that need to be solved by both academia and industry.

Zhipeng Cao、Chao Zheng、Xu Cao、Tong Zhou、Kun Tang、Zichong Yang、Shuqi Mei、Wenqian Ye、Yunsheng Ma、Xinrui Yan、Can Cui、Ao Liu、Kaizhao Liang、Tianren Gao、Juanwu Lu、Yang Zhou、Jintai Chen、Jianguo Cao、Kuei-Da Liao、Ziran Wang、Erlong Li

自动化技术、自动化技术设备计算技术、计算机技术综合运输

Zhipeng Cao,Chao Zheng,Xu Cao,Tong Zhou,Kun Tang,Zichong Yang,Shuqi Mei,Wenqian Ye,Yunsheng Ma,Xinrui Yan,Can Cui,Ao Liu,Kaizhao Liang,Tianren Gao,Juanwu Lu,Yang Zhou,Jintai Chen,Jianguo Cao,Kuei-Da Liao,Ziran Wang,Erlong Li.A Survey on Multimodal Large Language Models for Autonomous Driving[EB/OL].(2023-11-20)[2025-08-11].https://arxiv.org/abs/2311.12320.点此复制

评论