A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
4747
Stars
485
Forks
2
技 术栈
0
替代方案
相关事件