Malwarebytes

Whispering poetry at AI can make it break its own rules

Malicious prompts rewritten as poems have been found to bypass AI guardrails. Which models resisted and which failed the poetic jailbreak test?
favicon
malwarebytes.com
malwarebytes.com
Create attached notes ...