RIVET: Robust Idempotent Voice Attribute Editing 文章

ArXiv CS.AI2026-06-19NEWSen作者: Dareen Alharthi, Bhuvan Koduru, Rita Singh, Bhiksha Raj

详细信息

来源站点: ArXiv CS.AI
作者: Dareen Alharthi, Bhuvan Koduru, Rita Singh, Bhiksha Raj
文章类型: NEWS
语言: en
发布日期: 2026-06-19

摘要

arXiv:2606.19629v1 Announce Type: cross Abstract: Voice attribute editing models modify characteristics such as age and gender while preserving speaker identity. In large-scale speech datasets, however, attribute annotations are often noisy or inconsistent, which can cause conditional generative models to produce unstable edits. In this work, we show that idempotency provides an effective mechanism for improving robustness to noisy labels. An idempotent operator is one for which repeated application does not change the result, i.e., f(f(x)) = f(x). Enforcing this property acts as an implicit regularizer that reduces sensitivity to mislabeled examples. We introduce RIVET, a training framework that incorporates an idempotency objective to improve robustness to label noise. We evaluate RIVET under controlled label noise and on the GLOBE dataset with naturally noisy annotations.

RIVET: Robust Idempotent Voice Attribute Editing 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品查看全部 (3)

相关技术查看全部 (2)