Talking Slide Avatars: Open-Source Multimodal Communication Approach for Teaching 文章

ArXiv CS.AI2026-05-26NEWSen作者: Xinxing Wu

详细信息

来源站点: ArXiv CS.AI
作者: Xinxing Wu
文章类型: NEWS
语言: en
发布日期: 2026-05-26

摘要

arXiv:2604.23703v2 Announce Type: replace-cross Abstract: Slide-based teaching is widely used in higher education, yet in online, hybrid, and asynchronous contexts, slides often lose instructor presence, narrative continuity, and expressive framing that help learners connect with course content. Full lecture video can partly restore these qualities, but it is time-consuming to record, revise, and reuse. This study presents a practice-based implementation and analytic reflection of an open-source workflow for creating talking slide avatars. The workflow integrates OpenVoice for text-to-speech and authorized voice-style conversion with Ditto-TalkingHead for audio-driven talking-image synthesis, enabling instructors to transform a short script and an authorized or synthetic portrait image into a narrated video for slide decks or HTML-based lecture materials.

Talking Slide Avatars: Open-Source Multimodal Communication Approach for Teaching 文章

详细信息

摘要

相关事件

相关公司查看全部 (3)

相关人物

相关产品查看全部 (9)

相关技术查看全部 (22)