🚀 An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems.
1686
Stars
86
Forks
2
技术栈
0
替代方案
相关事件