TokTalk: Expressive Real-time Facial Animation from Audio-LLM Tokens 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

TokTalk: Expressive Real-time Facial Animation from Audio-LLM Tokens arXiv:2605.31294v1 Announce Type: new Abstract: Recent advances in Audio-LLMs like GPT-4o have ushered in an era of conversational interaction with language models. Conversational avatars however, still seem robotic in facial expression and conversational flow, in part due to sequential stages of speech recognition, text generation, turn-based text response, speech synthesis, and audio driven facial animation. Based on our ins