Top AI coding assistants fail one in four tasks, revealing serious gaps between hype and actual performance reliability


  • The report finds that AI coding assistants fail one in four structured output tasks
  • Even advanced proprietary models only reach approximately 75% accuracy
  • Open source AI models fare worse, averaging closer to 65% reliability

The promise of artificial intelligence as a tireless coding assistant has hit a significant roadblock after new research claimed such tools can experience a number of problems.

A recent study from the University of Waterloo found that artificial intelligence struggles with software development, with even the most advanced models failing one in four structured output tasks.

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top