Preference Tuning LLMs with Direct Preference Optimization Methods 文章

Hugging Face Blog2024-01-18BLOGen