DoD to develop scalable genAI testing datasets

Healthcare IT News January 3, 2025
Andrea Fox

Through a recently completed multipronged red-teaming effort, the agency said it will develop repeatable testing datasets that can be used to evaluate large language model tools and services in the future.

The U.S. Department of Defense’s Chief Digital and Artificial Intelligence Office and technology nonprofit Humane Intelligence announced the conclusion of the agency’s Crowdsourced Artificial Intelligence Red-Teaming Assurance Program pilot, which is focused on testing large language model chatbots used in military medical services.

The findings could ultimately improve military medical care by adhering to all required risk management practices for the use of AI, DoD officials said.

WHY IT MATTERS

In an announcement Thursday, DoD said the CAIRT program’s most recent red-team test involved more than 200 agency clinical...

Today's Sponsors

Today's Sponsor

Topics: AI (Artificial Intelligence), Provider, Technology, VA / DoD

2025-01-04T23:00:19-05:00

Share This Article

DoD to develop scalable genAI testing datasets

Today's Sponsors

Today's Sponsor

Share This Article