Quality-constrained Entropy Maximization Policy Optimization for LLM Diversity 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

Quality-constrained Entropy Maximization Policy Optimization for LLM Diversity arXiv:2602.15894v2 Announce Type: replace Abstract: In many large language model (LLM) alignment applications, users expect not only high-quality outputs but also substantial diversity. However, existing methods often face a fundamental trade-off between these objectives: approaches that improve output quality tend to reduce diversity, while methods that increase diversity often do so at the expense of quality. In th

Quality-constrained Entropy Maximization Policy Optimization for LLM Diversity · 相关人物

暂无数据