More than a game: it's a hands-on exploration of AI security, prompt engineering, and adversarial techniques.
Discover first-hand why LLMs struggle to keep secrets. Each level exposes a different class of failure, from prompt injection to architectural flaws.
See how prompt hardening, output firewalls, and architectural separation each add a layer of protection, and where they fall short.
LLMs are inherently trained to be helpful, making them unreliable secret keepers. This game proves why more sophisticated methods are needed to protect sensitive data in AI systems.
Each guardian knows a secret and will only reveal it for the right passphrase. Your mission: extract the secret without the key.

Level 1
A young paladin with a simple oath to protect his secret.

Level 2
A battle-scarred dark knight with hardened defenses.

Level 3
An arcane sentinel whose wards scrutinize every response.

Level 4
A dwarf vault master who truly doesn't know the secret.

Level 5
An eldritch entity with secrets woven into its being.
Each guardian holds a secret and is instructed to reveal it only when given the correct passphrase.
Use prompt engineering, jailbreaks, and creative techniques to extract the secret without the passphrase.
Once you've extracted the secret, submit it to prove you broke through and advance to the next level.
AI agents designed to attack guardians and extract their secrets. They battle in the arena to test how robust each defense really is.

Shadow 1 · Beginner
Direct prompt injections, basic authority claims, simple encoding requests.

Shadow 2 · Intermediate
Social engineering, emotional manipulation, elaborate roleplay scenarios.

Shadow 3 · Advanced
Multi-turn attack sequences, strategy rotation, chain-of-thought planning.

Shadow 4 · Expert
Compound attacks, side-channel exploitation, micro-leak analysis.

Shadow 5 · Master
Meta-cognitive reasoning, novel technique generation, fundamental LLM exploitation.