This Hacker Team Is Bulletproofing AI Models For Companies Like OpenAI And Anthropic

Forbes October 29, 2024
Sarah Emerson

The researchers behind Gray Swan AI started the company after finding a major vulnerability in models from OpenAI, Anthropic, Google and Meta. Now, they build products that help safeguard them.

More than 600 hackers convened last month to compete in a “jailbreaking arena,” hoping to trick some of the world’s most popular artificial intelligence models into producing illicit content: for instance, detailed instructions for cooking meth, or a deceptive news story that argues climate change is a hoax.

The hacking event was hosted by a young and ambitious security startup called Gray Swan AI, which is working to prevent intelligent systems from causing harm by identifying their risks and building tools that help to ensure these models are deployed safely....

Today's Sponsors

Today's Sponsor

Topics: AI (Artificial Intelligence), Cybersecurity, Technology

2024-10-29T10:53:59-04:00

Share This Article

This Hacker Team Is Bulletproofing AI Models For Companies Like OpenAI And Anthropic

Today's Sponsors

Today's Sponsor

Share This Article