Query Symbolically or Retrieve Semantically? A Dataset and Method for Semi-Structured Question Answering 文章

ArXiv CS.AI2026-05-27NEWSen作者: Mateusz Czy\.znikiewicz, Ryszard Tuora, Adam Kozakiewicz, Tomasz Zi\k{e}tkiewicz, Mateusz Gali\'nski, Micha{\l} Godziszewski, Micha{\l} Karpowicz, Timothy Hospedales, Cristina Cornelio

摘要

arXiv:2605.27164v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) systems for question answering typically retrieve evidence by semantic similarity between the query and document chunks. While effective for unstructured text, this approach is less reliable on semi-structured corpora where answering may require exact filtering, aggregation, or exhaustive retrieval over structured attributes across multiple documents. Symbolic approaches support such operations, but they are often brittle on noisy natural-language corpora. We address this gap with DualGraph, a RAG framework that represents documents through two complementary views: a Textual Knowledge Graph for semantic retrieval and a Symbolic Knowledge Graph for symbolic querying over typed subject--predicate--object triples. Building on these two components, we provide multiple strategies for selecting or combining semantic and symbolic evidence.