DisasterBench: Benchmarking LLM Planning under Typed Tool Interface Constraints 文章

ArXiv CS.CL2026-05-28NEWSen作者: Zhitong Chen, Kai Yin, Weifeng Zhang, Zhiyuan Wang, Xiangjue Dong, Chengkai Liu, Zhewei Liu, Yiming Xiao, Ali Mostafavi, James Caverlee

DisasterBench: Benchmarking LLM Planning under Typed Tool Interface Constraints · 相关技术