LLM-WikiRace Benchmark: How Far Can LLMs Plan over Real-World Knowledge Graphs? 事件

Name: LLM-WikiRace Benchmark: How Far Can LLMs Plan over Real-World Knowledge Graphs?
Start: 2026-06-02

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

LLM-WikiRace Benchmark: How Far Can LLMs Plan over Real-World Knowledge Graphs? arXiv:2602.16902v4 Announce Type: replace Abstract: We introduce LLM-Wikirace, a benchmark for evaluating planning, reasoning, and world knowledge in large language models (LLMs). In LLM-Wikirace, models must efficiently navigate Wikipedia hyperlinks step by step to reach a target page from a given source, requiring look-ahead planning and the ability to reason about how concepts are connected in the real world. We

人工智能

关系图谱

LLM-WikiRace Benchmark: How Far Can LLMs Plan over Real-World Knowledge Graphs? 事件

相关公司查看全部 (10)

相关人物查看全部 (4)

相关产品查看全部 (10)

相关技术查看全部 (10)

相关报道查看全部 (1)