PEARL: Training Socratic Tutors with Pedagogically Aligned Reinforcement Learning 事件

PRODUCT_LAUNCH2026-05-29影响: MEDIUM

PEARL: Training Socratic Tutors with Pedagogically Aligned Reinforcement Learning arXiv:2605.29582v1 Announce Type: cross Abstract: Large Language Models (LLMs) have shown promise as educational tutors, yet effective tutoring requires more than solving problems: it must provide progressive Socratic guidance and balance multiple pedagogical objectives across multi-turn interactions. However, training such tutors remains challenging due to limited-fidelity and weakly controllable student simulati