Trait-Aware Policy Optimization for Autoregressive Multi-Trait Essay Scoring 事件
PRODUCT_LAUNCH2026-05-26影响: MEDIUM
Trait-Aware Policy Optimization for Autoregressive Multi-Trait Essay Scoring arXiv:2605.25731v1 Announce Type: new Abstract: Multi-trait essay scoring aims to provide fine-grained evaluation of writing quality across multiple dimensions. However, how to effectively post-train autoregressive scoring models remains underexplored. In this paper, we propose Trait-Aware Policy Optimization (TAPO), a post-training framework tailored to autoregressive multi-trait scoring. Our method decomposes rewards
相关产品查看全部 (10)
相关报道查看全部 (1)
Trait-Aware Policy Optimization for Autoregressive Multi-Trait Essay Scoring
ArXiv CS.CL2026-05-26