|国家预印本平台
首页|Visual Image Reconstruction from Brain Activity via Latent Representation

Visual Image Reconstruction from Brain Activity via Latent Representation

Visual Image Reconstruction from Brain Activity via Latent Representation

来源:Arxiv_logoArxiv
英文摘要

Visual image reconstruction, the decoding of perceptual content from brain activity into images, has advanced significantly with the integration of deep neural networks (DNNs) and generative models. This review traces the field's evolution from early classification approaches to sophisticated reconstructions that capture detailed, subjective visual experiences, emphasizing the roles of hierarchical latent representations, compositional strategies, and modular architectures. Despite notable progress, challenges remain, such as achieving true zero-shot generalization for unseen images and accurately modeling the complex, subjective aspects of perception. We discuss the need for diverse datasets, refined evaluation metrics aligned with human perceptual judgments, and compositional representations that strengthen model robustness and generalizability. Ethical issues, including privacy, consent, and potential misuse, are underscored as critical considerations for responsible development. Visual image reconstruction offers promising insights into neural coding and enables new psychological measurements of visual experiences, with applications spanning clinical diagnostics and brain-machine interfaces.

Yukiyasu Kamitani、Misato Tanaka、Ken Shirakawa

计算技术、计算机技术

Yukiyasu Kamitani,Misato Tanaka,Ken Shirakawa.Visual Image Reconstruction from Brain Activity via Latent Representation[EB/OL].(2025-06-19)[2025-06-27].https://arxiv.org/abs/2505.08429.点此复制

评论