Machine Psychophysics: Cognitive Control in Vision-Language Models
Machine Psychophysics: Cognitive Control in Vision-Language Models
Cognitive control refers to the ability to flexibly coordinate thought and action in pursuit of internal goals. A standard method for assessing cognitive control involves conflict tasks that contrast congruent and incongruent trials, measuring the ability to prioritize relevant information while suppressing interference. We evaluate 108 vision-language models on three classic conflict tasks and their more demanding "squared" variants across 2,220 trials. Model performance corresponds closely to human behavior under resource constraints and reveals individual differences. These results indicate that some form of human-like executive function have emerged in current multi-modal foundational models.
Hokin Deng、Yijiang Li、Dezhi Luo、Maijunxian Wang、Bingyang Wang、Tianwei Zhao
计算技术、计算机技术
Hokin Deng,Yijiang Li,Dezhi Luo,Maijunxian Wang,Bingyang Wang,Tianwei Zhao.Machine Psychophysics: Cognitive Control in Vision-Language Models[EB/OL].(2025-05-25)[2025-07-16].https://arxiv.org/abs/2505.18969.点此复制
评论