The parent's model is right. You can mitigate a great deal with a basic zero tru...

what · 2026-02-22T05:24:17 1771737857

>You can define a communication protocol between agents that fails when the communicating agent has been prompt injected

Good luck with that.

aix1 · 2026-02-22T06:23:15 1771741395

Yeah, how exactly would that work?

CuriouslyC · 2026-02-22T13:50:55 1771768255

A schema with response metadata (so responses that deviate from it fail automatically), plus a challenge question that's calibrated to be hard enough that the disruption of instruction following from prompt injection can cause the model to answer incorrectly.