MindVoice: Reconstructing Intelligible Speech from Non-invasive Neural Signals with Pretrained Priors 文章

ArXiv CS.AI2026-06-01NEWSen作者: Guangyin Bao, Taiping Zeng, Jianfeng Feng, Xiangyang Xue

摘要

arXiv:2605.31173v1 Announce Type: cross Abstract: Reconstructing continuous speech from non-invasive neural recordings is a fundamental problem for probing human auditory perception and building safe, scalable speech brain-computer interfaces. Despite recent progress, intelligible reconstruction remains elusive, as non-invasive recordings are inherently noisy, spatially blurred, and only partially preserve information about perceived speech. Existing methods directly map neural activity to entangled speech representations before synthesizing waveforms with neural vocoders, resulting in spectral-similar but unintelligible results. To overcome these limitations, we introduce MindVoice, a neuro-to-speech reconstruction framework that uses pretrained models to compensate for the incomplete semantic and acoustic information in neural recordings.

相关公司

暂无数据

相关人物

暂无数据

相关技术

暂无数据