When exploiting AI applications, I find myself using this technique really often so I figured I’d write a quick blog about it. I call it the “Metanarrative Prompt Injection.” You might have already used this before, and it might already have another name. It’s basically like breaking the fourth wall, so to speak, by directly addressing the top level AI or a specific processing step in a way that influences its behavior. And it’s pretty effective.