Core-based Hierarchies for Efficient GraphRAG 文章

ArXiv CS.CL2026-06-03NEWSen作者: Jakir Hossain, Ahmet Erdem Sar{\i}y\"uce

摘要

arXiv:2603.05207v2 Announce Type: replace-cross Abstract: Retrieval-Augmented Generation (RAG) enhances large language models by incorporating external knowledge. However, existing vector-based methods often fail on global sensemaking tasks that require reasoning across many documents. GraphRAG addresses this by organizing documents into a knowledge graph with hierarchical communities that can be recursively summarized. Current GraphRAG approaches rely on Leiden clustering for community detection, but we prove that on sparse knowledge graphs, where average degree is constant and most nodes have low degree, modularity optimization admits exponentially many near-optimal partitions, making Leiden-based communities inherently non-reproducible. To address this, we propose replacing Leiden with k-core decomposition, which yields a deterministic, density-aware hierarchy in linear time.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据