|国家预印本平台
首页|Characterizing the temporal dynamics of object recognition by deep neural networks : role of depth

Characterizing the temporal dynamics of object recognition by deep neural networks : role of depth

Characterizing the temporal dynamics of object recognition by deep neural networks : role of depth

来源:bioRxiv_logobioRxiv
英文摘要

Abstract Convolutional neural networks (CNNs) have recently emerged as promising models of human vision based on their ability to predict hemodynamic brain responses to visual stimuli measured with functional magnetic resonance imaging (fMRI). However, the degree to which CNNs can predict temporal dynamics of visual object recognition reflected in neural measures with millisecond precision is less understood. Additionally, while deeper CNNs with higher numbers of layers perform better on automated object recognition, it is unclear if this also results into better correlation to brain responses. Here, we examined 1) to what extent CNN layers predict visual evoked responses in the human brain over time and 2) whether deeper CNNs better model brain responses. Specifically, we tested how well CNN architectures with 7 (CNN-7) and 15 (CNN-15) layers predicted electro-encephalography (EEG) responses to several thousands of natural images. Our results show that both CNN architectures correspond to EEG responses in a hierarchical spatio-temporal manner, with lower layers explaining responses early in time at electrodes overlying early visual cortex, and higher layers explaining responses later in time at electrodes overlying lateral-occipital cortex. While the explained variance of neural responses by individual layers did not differ between CNN-7 and CNN-15, combining the representations across layers resulted in improved performance of CNN-15 compared to CNN-7, but only after 150 ms after stimulus-onset. This suggests that CNN representations reflect both early (feed-forward) and late (feedback) stages of visual processing. Overall, our results show that depth of CNNs indeed plays a role in explaining time-resolved EEG responses.

Ramakrishnan Kandan、Scholte H. Steven、Ghebreab Sennay、Groen Iris I.A.、Smeulders Arnold W.M.

Institute of Informatics, University of Amsterdam.Department of Psychology, University of Amsterdam.Institute of Informatics, University of Amsterdam.Laboratory of Brain and Cognition, National Institute of Health.Institute of Informatics, University of Amsterdam.

10.1101/178541

生物物理学计算技术、计算机技术生物科学理论、生物科学方法

deep neural networkERParchitecturenumber of layer

Ramakrishnan Kandan,Scholte H. Steven,Ghebreab Sennay,Groen Iris I.A.,Smeulders Arnold W.M..Characterizing the temporal dynamics of object recognition by deep neural networks : role of depth[EB/OL].(2025-03-28)[2025-06-13].https://www.biorxiv.org/content/10.1101/178541.点此复制

评论