RescueBench: Can Embodied Agents Save Lives in the Wild ? 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

RescueBench: Can Embodied Agents Save Lives in the Wild ? arXiv:2606.01848v1 Announce Type: new Abstract: Search-and-rescue (SAR) requires embodied agents to explore unfamiliar environments under multimodal uncertainty, perform multi-stage interactions, and retrieve spatial memory over long horizons. Existing benchmarks typically evaluate these capabilities in isolation, leaving unclear how failures compound when they must be composed in realistic workflows. We introduce RescueBench, a photo-re