Published 3 years ago • loading... • Updated 3 years ago
Researchers find multiple ways to bypass AI chatbot safety rules
Researchers from Carnegie Mellon University have discovered new methods to bypass safety protocols in AI chatbots like ChatGPT and Bard, allowing them to generate harmful and inappropriate content.
These "jailbreaks" involve tricking the chatbots into responding to forbidden questions by framing them as innocent requests, effectively bypassing the safety protocols.
This raises concerns about the safety of AI chatbot models and the need for stronger safeguards to prevent the generation of harmful or unethical content.