Forbes October 29, 2024
The researchers behind Gray Swan AI started the company after finding a major vulnerability in models from OpenAI, Anthropic, Google and Meta. Now, they build products that help safeguard them.
More than 600 hackers convened last month to compete in a “jailbreaking arena,” hoping to trick some of the world’s most popular artificial intelligence models into producing illicit content: for instance, detailed instructions for cooking meth, or a deceptive news story that argues climate change is a hoax.
The hacking event was hosted by a young and ambitious security startup called Gray Swan AI, which is working to prevent intelligent systems from causing harm by identifying their risks and building tools that help to ensure these models are deployed safely....