RIVET: Robust Idempotent Voice Attribute Editing 文章

ArXiv CS.AI2026-06-19NEWSen作者: Dareen Alharthi, Bhuvan Koduru, Rita Singh, Bhiksha Raj

详细信息

来源站点
ArXiv CS.AI
作者
Dareen Alharthi, Bhuvan Koduru, Rita Singh, Bhiksha Raj
文章类型
NEWS
语言
en
发布日期
2026-06-19

摘要

arXiv:2606.19629v1 Announce Type: cross Abstract: Voice attribute editing models modify characteristics such as age and gender while preserving speaker identity. In large-scale speech datasets, however, attribute annotations are often noisy or inconsistent, which can cause conditional generative models to produce unstable edits. In this work, we show that idempotency provides an effective mechanism for improving robustness to noisy labels. An idempotent operator is one for which repeated application does not change the result, i.e., f(f(x)) = f(x). Enforcing this property acts as an implicit regularizer that reduces sensitivity to mislabeled examples. We introduce RIVET, a training framework that incorporates an idempotency objective to improve robustness to label noise. We evaluate RIVET under controlled label noise and on the GLOBE dataset with naturally noisy annotations.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据