News-Medical.Net January 2, 2025
Harvard Medical School

Artificial intelligence tools such as ChatGPT have been touted for their promise to alleviate clinician workload by triaging patients, taking medical histories and even providing preliminary diagnoses.

These tools, known as large-language models, are already being used by patients to make sense of their symptoms and medical tests results.

But while these AI models perform impressively on standardized medical tests, how well do they fare in situations that more closely mimic the real world?

Not that great, according to the findings of a new study led by researchers at Harvard Medical School and Stanford University.

For their analysis, published Jan. 2 in Nature Medicine, the researchers designed an evaluation framework -; or a test -; called CRAFT-MD (Conversational Reasoning Assessment...

Today's Sponsors

LEK
ZeOmega

Today's Sponsor

LEK

 
Topics: AI (Artificial Intelligence), Provider, Technology
Microsoft expects to spend $80 billion on AI-enabled data centers in fiscal 2025
No AGI But A ‘Killer App’ - 2025 AI Prediction (1/10)
2025: The Year Of Search But Not The End Of Google
The AI We Didn’t See Coming: How Technology Is Transforming Patient Care
HIMSSCast: How genAI can reinvent the work of clinicians

Share This Article