CORE-T: COherent REtrieval of Tables for Text-to-SQL 文章

ArXiv CS.CL2026-05-29NEWSen作者: Hassan Soliman, Vivek Gupta, Dan Roth, Iryna Gurevych

摘要

arXiv:2601.13111v2 Announce Type: replace Abstract: Realistic text-to-SQL workflows often require joining multiple tables. As a result, accurately retrieving the relevant set of tables becomes a key bottleneck for end-to-end performance. We study an open-book setting where queries must be answered over large, heterogeneous table collections pooled from many sources, without clean scoping signals such as database identifiers. Here, dense retrieval (DR) achieves high recall but returns many distractors, while join-aware alternatives often rely on extra assumptions and/or incur high inference overhead. We propose CORE-T, a scalable, training-free framework that enriches tables with LLM-generated purpose metadata and pre-computes a lightweight table-compatibility cache. At inference time, DR returns top-K candidates; a single LLM call selects a coherent, joinable subset, and a two-step additive adjustment stage restores strongly compatible tables.

相关事件查看全部 (1)

CORE-T: COherent REtrieval of Tables for Text-to-SQL
2026-05-29PRODUCT_LAUNCH影响: MEDIUM

相关公司

暂无数据

相关人物

暂无数据