Identifying High-Confidence Social Biases in LLMs for Trustworthy Conversational Tutoring Agents 文章

ArXiv CS.CL2026-06-02NEWSen作者: Aitor Arronte Alvarez, Naiyi Xie Fincham

摘要

arXiv:2606.01584v1 Announce Type: new Abstract: Conversational tutoring agents have been shown to improve learning engagement and student outcomes, and large language models (LLMs) are increasingly used in these systems to provide scalable, personalized feedback. However, LLMs may perpetuate or amplify stereotypical social biases, posing particular risks in educational settings. In this study, we evaluate LLMs in conversational tutoring scenarios to identify high-confidence social biases, instances where models are unable to identify biased judgments in tutoring conversations while maintaining strong confidence in their assessments, potentially affecting their reasoning and the feedback they provide to learners. We present a new dataset generation method that enables bias evaluation under naturalistic instructional conditions by regenerating student-AI tutor interactions and introducing turns with controlled bias derived from a benchmark dataset.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据