What is Prompt Injection?
Prompt Injection
A. What is Prompt Injection
An attack that makes an AI misbehave using malicious instructions hidden in external data or web pages. It is a key security concern in the age of AI agents.
Prompt injection is an attack that plants instructions for the AI inside data it reads, such as web pages or files, getting it to override or ignore its original instructions. It can lead to information leaks or unintended actions.
The risk is especially high for AI agents that access external tools and data. Countermeasures such as validating input, minimizing privileges, and designing the system not to execute instructions from untrusted data are important.