|国家预印本平台
首页|MindShot: Multi-Shot Video Reconstruction from fMRI with LLM Decoding

MindShot: Multi-Shot Video Reconstruction from fMRI with LLM Decoding

MindShot: Multi-Shot Video Reconstruction from fMRI with LLM Decoding

来源:Arxiv_logoArxiv
英文摘要

Reconstructing dynamic videos from fMRI is important for understanding visual cognition and enabling vivid brain-computer interfaces. However, current methods are critically limited to single-shot clips, failing to address the multi-shot nature of real-world experiences. Multi-shot reconstruction faces fundamental challenges: fMRI signal mixing across shots, the temporal resolution mismatch between fMRI and video obscuring rapid scene changes, and the lack of dedicated multi-shot fMRI-video datasets. To overcome these limitations, we propose a novel divide-and-decode framework for multi-shot fMRI video reconstruction. Our core innovations are: (1) A shot boundary predictor module explicitly decomposing mixed fMRI signals into shot-specific segments. (2) Generative keyframe captioning using LLMs, which decodes robust textual descriptions from each segment, overcoming temporal blur by leveraging high-level semantics. (3) Novel large-scale data synthesis (20k samples) from existing datasets. Experimental results demonstrate our framework outperforms state-of-the-art methods in multi-shot reconstruction fidelity. Ablation studies confirm the critical role of fMRI decomposition and semantic captioning, with decomposition significantly improving decoded caption CLIP similarity by 71.8%. This work establishes a new paradigm for multi-shot fMRI reconstruction, enabling accurate recovery of complex visual narratives through explicit decomposition and semantic prompting.

Wenwen Zeng、Yonghuang Wu、Yifan Chen、Xuan Xie、Chengqian Zhao、Feiyu Yin、Guoqing Wu、Jinhua Yu

生物科学研究方法、生物科学研究技术

Wenwen Zeng,Yonghuang Wu,Yifan Chen,Xuan Xie,Chengqian Zhao,Feiyu Yin,Guoqing Wu,Jinhua Yu.MindShot: Multi-Shot Video Reconstruction from fMRI with LLM Decoding[EB/OL].(2025-08-04)[2025-08-19].https://arxiv.org/abs/2508.02480.点此复制

评论