首页|Chart-to-Experience: Benchmarking Multimodal LLMs for Predicting Experiential Impact of Charts

Chart-to-Experience: Benchmarking Multimodal LLMs for Predicting Experiential Impact of Charts

来源：

英文摘要

The field of Multimodal Large Language Models (MLLMs) has made remarkable progress in visual understanding tasks, presenting a vast opportunity to predict the perceptual and emotional impact of charts. However, it also raises concerns, as many applications of LLMs are based on overgeneralized assumptions from a few examples, lacking sufficient validation of their performance and effectiveness. We introduce Chart-to-Experience, a benchmark dataset comprising 36 charts, evaluated by crowdsourced workers for their impact on seven experiential factors. Using the dataset as ground truth, we evaluated capabilities of state-of-the-art MLLMs on two tasks: direct prediction and pairwise comparison of charts. Our findings imply that MLLMs are not as sensitive as human evaluators when assessing individual charts, but are accurate and reliable in pairwise comparisons.

作者：Seon Gyeom Kim、Jae Young Choi、Ryan Rossi、Eunyee Koh、Tak Yeon Lee

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Seon Gyeom Kim,Jae Young Choi,Ryan Rossi,Eunyee Koh,Tak Yeon Lee.Chart-to-Experience: Benchmarking Multimodal LLMs for Predicting Experiential Impact of Charts[EB/OL].(2025-05-22)[2025-06-06].https://arxiv.org/abs/2505.17374.点此复制

Chart-to-Experience: Benchmarking Multimodal LLMs for Predicting Experiential Impact of Charts

Chart-to-Experience: Benchmarking Multimodal LLMs for Predicting Experiential Impact of Charts

评论