Versatile Framework with Semantic and Structural guidance for Image Reconstruction from Brain Activity 文章

ArXiv CS.CV2026-06-02NEWSen作者: Yizhuo Lu, Changde Du, Qiongyi Zhou, Liuyun Jiang, Huiguang He

详细信息

来源站点: ArXiv CS.CV
作者: Yizhuo Lu, Changde Du, Qiongyi Zhou, Liuyun Jiang, Huiguang He
文章类型: NEWS
语言: en
发布日期: 2026-06-02

摘要

arXiv:2606.00121v1 Announce Type: new Abstract: Reconstructing visual stimuli from brain recordings has been a meaningful and challenging task in brain decoding. Especially, the achievement of precise and controllable image reconstruction bears great significance in propelling the progress and utilization of brain-computer interfaces. Recent methods, leveraging advances in the power of text-to-image generation models, have reconstructed images that closely approximate complex natural stimuli in terms of semantics (e.g., concepts and objects). However, they struggle to maintain consistency with the original stimuli in fine-grained structural information (e.g., position, orientation and size), which undermines both the controllability and interpretability of the models. To address the aforementioned issues, we propose a two-stage image reconstruction framework, termed MindDiffuser.

Versatile Framework with Semantic and Structural guidance for Image Reconstruction from Brain Activity 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品查看全部 (1)

相关技术