Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks 文章

ArXiv CS.CV2026-05-26NEWSen作者: Tajamul Ashraf, Amal Saqib, Hanan Ghani, Muhra AlMahri, Yuhao Li, Noor Ahsan, Umair Nawaz, Jean Lahoud, Hisham Cholakkal, Mubarak Shah, Philip Torr, Fahad Shahbaz Khan, Rao Muhammad Anwer, Salman Khan

Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks · 相关技术

暂无数据