InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models arXiv:2505.13878v3 Announce Type: replace-cross Abstract: Model fusion combines multiple Large Language Models (LLMs) with different strengths into a more powerful, integrated model through lightweight training methods. Existing works on model fusion focus primarily on supervised fine-tuning (SFT), leaving preference alignment (PA) --a critical phase for enhancing LLM performance--largely unexplored. The current