News-Medical.Net January 23, 2026
A new expert consensus made available online on 10 October 2025 and published in Volume 5, Issue 4 of the journal Intelligent Medicine on 1 November 2025, sets out a structured framework to assess large language models (LLMs) before they are introduced into clinical workflows. The guidance responds to the rapid uptake of artificial intelligence (AI) tools for diagnostic support, medical documentation, and patient communication, and the corresponding need for consistent evaluation of safety, effectiveness, and fairness.
The consensus formalizes retrospective evaluation-testing fully trained models on real or simulated clinical data in specific care contexts, without further modifying the models-to verify performance, ethical compliance, and operational readiness prior to deployment.
Developed in line with World Health Organization guideline methods and...







