PaperVoyager : Building Interactive Web with Visual Language Models 文章

ArXiv CS.CL2026-06-02NEWSen作者: Dasen Dai, Biao Wu, Meng Fang, Wenhao Wang

摘要

arXiv:2603.22999v3 Announce Type: replace Abstract: Recent advances in visual language models have enabled autonomous agents for complex reasoning, tool use, and document understanding. However, existing document agents mainly transform papers into static artifacts such as summaries, webpages, or slides, which are insufficient for technical papers involving dynamic mechanisms and state transitions. In this work, we propose a Paper-to-Interactive-System Agent that converts research papers into executable interactive web systems. Given a PDF paper, the agent performs end-to-end processing without human intervention, including paper understanding, system modeling, and interactive webpage synthesis, enabling users to manipulate inputs and observe dynamic behaviors. To evaluate this task, we introduce a benchmark of 19 research papers paired with expert-built interactive systems as ground truth.

相关公司

暂无数据

相关人物

暂无数据

相关技术

暂无数据