GPF-LiveNews: A Streaming Evaluation Protocol for Group-Conditioned Framing in Large Language Models 文章

ArXiv CS.CL2026-05-29NEWSen作者: Mohd Ariful Haque, Fahad Rahman, Kishor Datta Gupta, Roy George

摘要

arXiv:2605.28848v1 Announce Type: new Abstract: Deployed language models are evaluated in a non-stationary environment: model versions, retrieval layers, safety systems, and real-world inputs all change over time. Static bias benchmarks remain useful, but they do not show how models frame newly emerging events for different prompted audiences. We introduce GPF-LIVENEWS, a streaming evaluation protocol and benchmark snapshot for auditing group-conditioned framing in open-ended LLM outputs. The protocol expands fresh BBC/Reuters news anchors across 42 identity labels and seven prompt families, then evaluates response bundles using semantic-sensitivity and sentiment-disparity signals. In a pilot over 12 monitoring runs and 23 hosted models, Policy/Action prompts produce the strongest semantic movement, while sentiment variation is flatter across dimensions and prompt families.

相关公司查看全部 (2)

R
ReutersCOMPANY
B
BBCCOMPANY

相关人物

暂无数据

相关技术

暂无数据