Beyond End-to-End Video Models: An LLM-Based Multi-Agent System for Educational Video Generation 文章

ArXiv CS.CL2026-06-02NEWSen作者: Lingyong Yan, Jiulong Wu, Dong Xie, Weixian Shi, Deguo Xia, Jizhou Huang

摘要

arXiv:2602.11790v2 Announce Type: replace-cross Abstract: Although recent end-to-end video generation models demonstrate impressive performance in visually oriented content creation, they remain limited in scenarios that require strict logical rigor and precise knowledge representation, such as instructional and educational media. To address this problem, we propose LASEV, a hierarchical LLM-based multi-agent system for generating high-quality instructional videos from educational problems. LASEV formulates educational video generation as a multi-objective task that simultaneously demands correct step-by-step reasoning, pedagogically coherent narration, semantically faithful visual demonstrations, and precise audio--visual alignment.

相关公司

暂无数据

相关人物

暂无数据