AIs like ChatGPT fall apart in classic ‘Stroop’ psychological test – and this could stand in the way of achieving artificial general intelligence


  • A new study tasked AIs with tackling the ‘Stroop’ test
  • GPT and Claude performed very poorly compared to humans
  • There are nuances here, but in general, researchers argue that improving this side of AIs is critical to achieving artificial general intelligence

A recently published study has pointed out a limitation of big name AI models such as ChatGPT, albeit causing some controversy as the primary piece of research uses now outdated versions of these models – but there are nuances to that and that doesn’t make the findings irrelevant.

I’ll get into that map, but first let’s look at the study itself, which was highlighted on Reddit (‘New study reveals top AI models completely fail the classic ‘Stroop’ psychological attention test’) and published via Oxford University Press in the journal PNAS Nexus.

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top