Cross-Entropy Games and Frost Training 事件

Name: Cross-Entropy Games and Frost Training
Start: 2026-05-28

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

Cross-Entropy Games and Frost Training arXiv:2605.27701v1 Announce Type: new Abstract: We present Frost Training, a method for improving Monte Carlo-based policy optimization for a large family of LLM-as-a-judge tasks called Cross-Entropy Games. The key idea is to exploit the gradient of the reward function in embedding space. This signal is used in the Greedy Coordinate Gradient (GCG) jailbreaking technique; we demonstrate for the first time that it can also be used to boost model training. We

人工智能

关系图谱

Cross-Entropy Games and Frost Training 事件

相关公司查看全部 (6)

相关人物查看全部 (1)

相关产品查看全部 (10)

相关技术查看全部 (10)

相关报道查看全部 (1)