VISTA: An End-to-End Benchmark for Visual Spec-to-Web-App Coding Agents 事件
PRODUCT_LAUNCH2026-05-27影响: MEDIUM
VISTA: An End-to-End Benchmark for Visual Spec-to-Web-App Coding Agents arXiv:2605.26144v1 Announce Type: cross Abstract: We present VISTA (VIsual Spec-To-App Benchmark), a benchmark for evaluating the end-to-end web-app generation capabilities of LLM-based agents. Unlike prior code generation benchmarks that focus on algorithmic tasks, VISTA targets realistic UI-centric development, where agents must produce functional, visually coherent applications from underspecified inputs. We define five
相关产品查看全部 (10)
相关报道查看全部 (1)
VISTA: An End-to-End Benchmark for Visual Spec-to-Web-App Coding Agents
ArXiv CS.CV2026-05-27