Geometric Second-Order Feature Correlation Learning for Self-Supervised Speech Emotion Recognition 文章

ArXiv CS.AI2026-06-08NEWSen作者: Shuanglin Li, Ruxiao Qian, Siyang Song

摘要

arXiv:2606.06550v1 Announce Type: cross Abstract: Self-supervised learning (SSL) yields powerful, context-rich representations for speech emotion recognition (SER), yet aggregating these representations into holistic descriptors remains a bottleneck. Conventional first-order aggregation implicitly assumes feature independence, which overlooks the latent Riemannian geometry and discards higher-order relationships essential to the representational power of the backbone. To address this problem, this paper proposes a novel Second-Order Correlation (SOC) layer. Instead of treating features in isolation, SOC models feature correlations as covariance descriptors to capture synergistic co-occurrence patterns, which serve as discriminative signatures for robust emotion recognition.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据