TokTalk: Expressive Real-time Facial Animation from Audio-LLM Tokens 事件
PRODUCT_LAUNCH2026-06-01影响: MEDIUM
TokTalk: Expressive Real-time Facial Animation from Audio-LLM Tokens arXiv:2605.31294v1 Announce Type: new Abstract: Recent advances in Audio-LLMs like GPT-4o have ushered in an era of conversational interaction with language models. Conversational avatars however, still seem robotic in facial expression and conversational flow, in part due to sequential stages of speech recognition, text generation, turn-based text response, speech synthesis, and audio driven facial animation. Based on our ins
TokTalk: Expressive Real-time Facial Animation from Audio-LLM Tokens · 相关报道
相关报道
TokTalk: Expressive Real-time Facial Animation from Audio-LLM Tokens
ArXiv CS.CV2026-06-01