Towards Lightweight Reliability: Using Soft Prompts for Hallucination Mitigation in Large Language Models 文章

ArXiv CS.CL2026-06-02NEWSen作者: S M Tahmid Siddiqui, Akib Jawad Ononto, Anoop Singhal, Latifur Khan

摘要

arXiv:2606.00919v1 Announce Type: new Abstract: Large language models (LLMs) have seen widespread adoption across various domains, yet their reliability is frequently undermined by hallucinations - responses that are plausible-sounding but factually incorrect. In high-stakes domains, these errors can reduce trust and introduce real-world risk. To address this challenge, we present a parameter-efficient approach that uses soft prompts to mitigate hallucinated content and promote responsible abstention in generative question-answering (QA) tasks. Our method, called Responsible Contrastive Soft Prompting (RCSP), uses a composite loss to train soft prompts that balance three goals: suppressing hallucinatory content, encouraging abstention under uncertainty, and preserving or improving factual recall. To achieve these goals, we incorporate contrastive loss, curriculum learning, and KL regularization into our training mechanism.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据