MedCTA: A Benchmark for Clinical Tool Agents 事件
ACQUISITION2026-06-11影响: HIGH
MedCTA: A Benchmark for Clinical Tool Agents arXiv:2606.11702v1 Announce Type: new Abstract: To make clinically grounded decisions, medical AI agents are expected to go beyond simple recognition and be capable of tool retrieval, evidence acquisition, and integration. Existing benchmarks largely evaluate isolated perception or single-turn question answering, and therefore provide limited visibility into failures of planning, tool recruitment, and rollout reliability. We introduce MedCTA, a bench