Flow-OPD: On-Policy Distillation for Flow Matching Models 事件
PRODUCT_LAUNCH2026-05-26影响: MEDIUM
Flow-OPD: On-Policy Distillation for Flow Matching Models arXiv:2605.08063v5 Announce Type: replace Abstract: Existing Flow Matching (FM) text-to-image models suffer from two critical bottlenecks under multi-task alignment: the reward sparsity induced by scalar-valued rewards, and the gradient interference arising from jointly optimizing heterogeneous objectives, which together give rise to a 'seesaw effect' of competing metrics and pervasive reward hacking. Inspired by the success of On-Policy
相关人物
暂无数据
相关产品查看全部 (10)
相关报道查看全部 (1)
Flow-OPD: On-Policy Distillation for Flow Matching Models
ArXiv CS.CV2026-05-26