MechELK: A Mechanistic Interpretability Framework for Eliciting Latent Knowledge in Large Language Models 文章

ArXiv CS.CL2026-05-29NEWSen作者: Ji-jun Park, Soo-joon Choi, Jiwon Jeong, Taeyang Yoon, Ju-Wan Lee

MechELK: A Mechanistic Interpretability Framework for Eliciting Latent Knowledge in Large Language Models · 相关人物

暂无数据