Tests reveal that chatgpt-5 hallucinates smaller than gpt-4o do-and-hell is still the king to make things up


  • Chatgpt 5 scores a low 1.4% on Hallucination Leaderboard
  • This puts it in front of chatgpt-4 that scores 1.8% and GPT-4o that scores 1.49%
  • GROK 4 is much higher at 4.8% with Gemini-2.5 Pro is at 2.6%

When Openai launched Chatgpt-5 on Thursday last week, if the major sales outlets that CEO Sam Altman emphasized was that Chatgpt-5 was the most “powerful, smart, fastest, reliable and robust version of Chatgpt that we have ever sent”, and in the presentation Openai also emphasized that chatgpt-5 would “diminish hallucinations”.

When AI does something up, it is called a hallucination, and while the hallucination rate falls among all LLMs, it is still surprisingly common and one of the main reasons we cannot trust AI to perform a task without human supervision.

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top