The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions 文章

OpenAI Blog2024-04-19BLOGen

摘要

Today's LLMs are susceptible to prompt injections, jailbreaks, and other attacks that allow adversaries to overwrite a model's original instructions with their own malicious prompts.

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions 文章

摘要

相关事件

相关公司

相关人物

相关产品

相关技术查看全部 (3)