National Institutes of Health July 23, 2024
AI model scored well on medical diagnostic quiz, but made mistakes explaining answers.
Researchers at the National Institutes of Health (NIH) found that an artificial intelligence (AI) model solved medical quiz questions—designed to test health professionals’ ability to diagnose patients based on clinical images and a brief text summary—with high accuracy. However, physician-graders found the AI model made mistakes when describing images and explaining how its decision-making led to the correct answer. The findings, which shed light on AI’s potential in the clinical setting, were published in npj Digital Medicine(link is external). The study was led by researchers from NIH’s National Library of Medicine (NLM) and Weill Cornell Medicine, New York City.
“Integration of AI into health care holds great...