REALISTA: Realistic Latent Adversarial Attacks that Elicit LLM Hallucinations 事件

Name: REALISTA: Realistic Latent Adversarial Attacks that Elicit LLM Hallucinations
Start: 2026-06-02

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

REALISTA: Realistic Latent Adversarial Attacks that Elicit LLM Hallucinations arXiv:2605.12813v2 Announce Type: replace Abstract: Large language models (LLMs) achieve strong performance across many tasks but remain vulnerable to hallucinations, making it important to systematically evaluate their reliability under realistic adversarial inputs. We formulate hallucination elicitation as a constrained optimization problem, where the goal is to find semantically coherent adversarial prompts that ar

人工智能

关系图谱

REALISTA: Realistic Latent Adversarial Attacks that Elicit LLM Hallucinations 事件

REALISTA: Realistic Latent Adversarial Attacks that Elicit LLM Hallucinations · 相关技术

相关技术