InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models 事件
PRODUCT_LAUNCH2026-05-26影响: MEDIUM
InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models arXiv:2505.13878v3 Announce Type: replace-cross Abstract: Model fusion combines multiple Large Language Models (LLMs) with different strengths into a more powerful, integrated model through lightweight training methods. Existing works on model fusion focus primarily on supervised fine-tuning (SFT), leaving preference alignment (PA) --a critical phase for enhancing LLM performance--largely unexplored. The current
相关产品查看全部 (10)
相关报道查看全部 (1)
InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models
ArXiv CS.CL2026-05-26