VISTA: An End-to-End Benchmark for Visual Spec-to-Web-App Coding Agents 事件

PRODUCT_LAUNCH2026-05-27影响: MEDIUM

VISTA: An End-to-End Benchmark for Visual Spec-to-Web-App Coding Agents arXiv:2605.26144v1 Announce Type: cross Abstract: We present VISTA (VIsual Spec-To-App Benchmark), a benchmark for evaluating the end-to-end web-app generation capabilities of LLM-based agents. Unlike prior code generation benchmarks that focus on algorithmic tasks, VISTA targets realistic UI-centric development, where agents must produce functional, visually coherent applications from underspecified inputs. We define five