Op-ed: How well can AI chatbots mimic doctors in a treatment setting? We put 5 to the test

CNBC July 18, 2024
Dr. Scott Gottlieb and Shani Benezra

Key Points

– Many consumers and medical providers are turning to chatbots, powered by large language models, to answer medical questions and inform treatment choices.

– We subjected five major large language models to parts of the U.S. Medical Licensing Examination Step 3 examination, widely regarded as the most challenging.

– Here’s how ChatGPT, Claude, Google Gemini, Grok and Llama performed.

Dr. Scott Gottlieb is a physician and served as the 23rd Commissioner of the U.S. Food and Drug Administration. He is a CNBC contributor and is a member of the boards of Pfizer and several other startups in health and tech. He is also a partner at the venture capital firm New Enterprise Associates. Shani Benezra is a senior...

Today's Sponsors

Today's Sponsor

Topics: AI (Artificial Intelligence), Patient / Consumer, Physician, Provider, Technology

2024-07-18T10:11:47-04:00

Share This Article

Op-ed: How well can AI chatbots mimic doctors in a treatment setting? We put 5 to the test

Today's Sponsors

Today's Sponsor

Share This Article