CiteCheck: Retrieval-Grounded Detection of LLM Citation Hallucinations in Scientific Text 文章

ArXiv CS.AI2026-05-28NEWSen作者: Khashayar Khajavi, Shaghayegh Sadeghi, Rise Adhikari, Alexander Tessier

摘要

arXiv:2605.27700v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly used to generate scientific reports, but they can produce references that appear plausible while containing corrupted metadata or pointing to papers that do not exist. We introduce CiteCheck, a hybrid framework for citation hallucination detection that verifies whether a citation corresponds to a real scholarly work and whether its metadata is faithful to that work. CiteCheck retrieves candidate publications from external scholarly sources, compares the citation against the retrieved candidate using a structured LLM verifier, and maps verifier scores into three labels: Exact, Minor, and Major. We also construct a 982-citation physics benchmark with controlled corruptions that capture both subtle metadata drift and fully fabricated references. On the held-out test set, CiteCheck achieves 88.7 macro-F1 and 88.

CiteCheck: Retrieval-Grounded Detection of LLM Citation Hallucinations in Scientific Text 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品查看全部 (3)

相关技术查看全部 (2)