SAGE: Scalable AI Governance & Evaluation 事件

REGULATION2026-06-06影响: MEDIUM

SAGE: Scalable AI Governance & Evaluation arXiv:2602.07840v3 Announce Type: replace-cross Abstract: Evaluating relevance in large-scale search systems is fundamentally constrained by the governance gap between nuanced, resource-constrained human oversight and the high-throughput requirements of production systems. While traditional approaches rely on engagement proxies or sparse manual review, these methods often fail to capture the full scope of high-impact relevance failures. We present \text