Computerworld January 21, 2025
Only answers 46% of the questions correctly.
Today’s AI models do a poor job of providing accurate information about world history, according to a new report from the Austrian research institute Complexity Science Hub (CSH).
In an experiment, OpenAI’s GPT-4, Meta’s Llama, and Google’s Gemini were asked to answer yes or no to historical questions — and only 46% of the answers were correct. GPT-4, for example, answered “yes” to the question of whether Ancient Egypt had...