Healthcare IT News January 3, 2025
Through a recently completed multipronged red-teaming effort, the agency said it will develop repeatable testing datasets that can be used to evaluate large language model tools and services in the future.
The U.S. Department of Defense’s Chief Digital and Artificial Intelligence Office and technology nonprofit Humane Intelligence announced the conclusion of the agency’s Crowdsourced Artificial Intelligence Red-Teaming Assurance Program pilot, which is focused on testing large language model chatbots used in military medical services.
The findings could ultimately improve military medical care by adhering to all required risk management practices for the use of AI, DoD officials said.
WHY IT MATTERS
In an announcement Thursday, DoD said the CAIRT program’s most recent red-team test involved more than 200 agency clinical...