Quality-constrained Entropy Maximization Policy Optimization for LLM Diversity 事件
PRODUCT_LAUNCH2026-05-28影响: MEDIUM
Quality-constrained Entropy Maximization Policy Optimization for LLM Diversity arXiv:2602.15894v2 Announce Type: replace Abstract: In many large language model (LLM) alignment applications, users expect not only high-quality outputs but also substantial diversity. However, existing methods often face a fundamental trade-off between these objectives: approaches that improve output quality tend to reduce diversity, while methods that increase diversity often do so at the expense of quality. In th
相关人物
暂无数据
相关产品查看全部 (10)
相关报道查看全部 (1)
Quality-constrained Entropy Maximization Policy Optimization for LLM Diversity
ArXiv CS.CL2026-05-28