GeneralThinker: Domain-General Reasoning through Likelihood-Guided Answer-Conditioned Optimization 文章

ArXiv CS.CL2026-05-28NEWSen作者: Shengmin Piao, Sanghyun Park

GeneralThinker: Domain-General Reasoning through Likelihood-Guided Answer-Conditioned Optimization · 相关技术

暂无数据