Published 1 day ago • loading... • Updated 15 hours ago

Poets as Cybersecurity Threats — Goforth Solutions, LLC

Researchers say they were able to trick AI into ignoring its safety guard through poetry. Hostile prompts disguised as hand-crafted poems "achieved an average jailbreak success rate of 62%.” Generic harmful prompt worked "approximately 43%" of the time.

This story is only covered by news sources that have yet to be evaluated by the independent media monitoring agencies we use to assess the quality and reliability of news outlets on our platform. Learn more here.

2 Articles

Developpez.com

Poets Become Threats to Cybersecurity: a Jailbreak Called "Antagonist Poetry" Allowed to Deceive Ais and Encourage Them to Ignore Their Safeguards. This Worked in 62% of Cases

Poets are becoming threats to cybersecurity: a jailbreak called "antagonist poetry" has made it possible to deceive AIs and encourage them to ignore their safeguards. This has worked in 62% of casesA new study highlights the weaknesses of language models. Researchers discover a "universal" jailbreak for almost all AIs and its operation seems surprisingly easy. Their study reveals that it is possible to bypass AI's security safeguards in their ow…

15 hours ago

Read Full Article