VideoFDB: Evaluating Full-Duplex Vision-Speech Capabilities in Conversational Agents 文章

ArXiv CS.CV2026-05-29NEWSen作者: Amrita Mazumdar, Seonwook Park, Rajarshi Roy, Nikhil Srihari, Shengze Wang, Yuhao Zhou, Julia Wang, Koki Nagano, Shalini De Mello

VideoFDB: Evaluating Full-Duplex Vision-Speech Capabilities in Conversational Agents · 相关技术

相关技术