A Benchmark Construction and Evaluation Framework for Specialist Domains: Case Study on Defense-related Documents 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

A Benchmark Construction and Evaluation Framework for Specialist Domains: Case Study on Defense-related Documents arXiv:2604.17943v2 Announce Type: replace Abstract: RAG-based question-answering (QA) in specialist domains faces a cold-start problem: lack of evaluative benchmarks and absence of labeled data for post-training. We present DoRA (Domain-oriented RAG Assessment), a novel benchmark construction and evaluation framework using only a small set of specialist domain documents. DoRA system