CNBC July 18, 2024
Key Points
– Many consumers and medical providers are turning to chatbots, powered by large language models, to answer medical questions and inform treatment choices.
– We subjected five major large language models to parts of the U.S. Medical Licensing Examination Step 3 examination, widely regarded as the most challenging.
– Here’s how ChatGPT, Claude, Google Gemini, Grok and Llama performed.
Dr. Scott Gottlieb is a physician and served as the 23rd Commissioner of the U.S. Food and Drug Administration. He is a CNBC contributor and is a member of the boards of Pfizer and several other startups in health and tech. He is also a partner at the venture capital firm New Enterprise Associates. Shani Benezra is a senior...