详细信息
- 来源站点
- ArXiv CS.CL
- 作者
- Jingkun Luo, Yifan Sun, Da-Tian Peng, Guanxiong Pei
- 文章类 型
- NEWS
- 语言
- en
- 发布日期
- 2026-06-18
摘要
arXiv:2606.18946v1 Announce Type: new Abstract: Sentence-level AI-generated text detection (S-AGTD) for hybrid documents, where humans and LLMs co-author one text, faces two gaps: existing methods classify each sentence in isolation, discarding inter-sentence dependencies, and existing benchmarks omit the newest generation of generators. We construct MOSAIC, a benchmark of 16,000 hybrid documents over PubMed and XSum, generated by DeepSeek-V3.2 and Kimi K2 under stringent quality controls including a perplexity-consistency filter absent from prior benchmarks. We recast S-AGTD as structured prediction over the document sentence sequence and instantiate it as SenFlow, integrating graph-based inter-sentence propagation with linear-chain CRF decoding in a single document-level pass over a sentence graph. SenFlow reaches state-of-the-art performance on MOSAIC, with a +4.15 pp average Macro-F1 margin on cross-domain transfer, the hardest of three protocols of increasing difficulty.
相关事件
暂无数据
相关公司
暂无数据
相关人物
暂无数据