Get poetic in prompts and AI will break its guardrails
25 frontier proprietary and open-weight models yielded high attack success rates when prompted in verse, indicating a deeper, underlying problems in their ability to process ambiguity veiled in poetry.
Dec 2, 2025 7 mins
Artificial Intelligence
Generative AI
Vulnerabilities