Forbes February 11, 2024
Version 3.5 of ChatGPT could not formulate a correct diagnosis in 83 of 100 pediatric cases, according to recent research published in JAMA Pediatrics.
According to the authors of the study, 72 of the incorrect diagnoses were completely incorrect and 11 of the incorrect diagnoses were clinically related but too broad to be considered a correct diagnosis.
A caveat of this study was the large language model used represented an older version of ChatGPT. Despite this, what do these results mean for healthcare and the use of AI?
The aforementioned study underscores the importance of physician oversight when implementing AI tools and large language models in clinical medicine. AI tools are only beginning to be developed, and much more research...