BenHalluEval: A Multi-Task Hallucination Evaluation Framework for Large Language Models on Bengali 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

BenHalluEval: A Multi-Task Hallucination Evaluation Framework for Large Language Models on Bengali arXiv:2605.31483v1 Announce Type: new Abstract: Despite Bengali being the sixth most spoken language in the world, no prior work has systematically evaluated hallucination in large language models (LLMs) for Bengali. We introduce BenHalluEval, a fine-grained hallucination evaluation framework for Bengali covering four tasks: Generative Question Answering (GQA), Bangla-English Code-Mixed QA, Summar