Flow-OPD: On-Policy Distillation for Flow Matching Models 事件

Name: Flow-OPD: On-Policy Distillation for Flow Matching Models
Start: 2026-05-26

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

Flow-OPD: On-Policy Distillation for Flow Matching Models arXiv:2605.08063v5 Announce Type: replace Abstract: Existing Flow Matching (FM) text-to-image models suffer from two critical bottlenecks under multi-task alignment: the reward sparsity induced by scalar-valued rewards, and the gradient interference arising from jointly optimizing heterogeneous objectives, which together give rise to a 'seesaw effect' of competing metrics and pervasive reward hacking. Inspired by the success of On-Policy

人工智能

关系图谱

Flow-OPD: On-Policy Distillation for Flow Matching Models 事件

相关公司查看全部 (10)

相关人物

相关产品查看全部 (10)

相关技术查看全部 (10)

相关报道查看全部 (1)