Echo: A Joint-Embedding Predictive Architecture for Speaker Diarization and Speech Recognition in a Shared Latent Space 文章

ArXiv CS.AI2026-06-02NEWSen作者: Louis Mouchon

Echo: A Joint-Embedding Predictive Architecture for Speaker Diarization and Speech Recognition in a Shared Latent Space · 相关事件