Enhancing Paraphrase Type Generation: The Impact of DPO and RLHF Evaluated with Human-Ranked Data 文章

ArXiv CS.CL2026-06-03NEWSen作者: Christopher Lee L\"ubbers

Enhancing Paraphrase Type Generation: The Impact of DPO and RLHF Evaluated with Human-Ranked Data · 相关技术