Speech gesture generation from the trimodal context of text, audio, and speaker identity 论文

2020ACM Transactions on Graphics引用 298

Human Pose and Action RecognitionMultimodal Machine Learning ApplicationsHuman Motion and Animation

Speech gesture generation from the trimodal context of text, audio, and speaker identity · 相关事件

暂无数据