GLM-5.2 overtakes Claude Fable 5 on Design Arena HTML leaderboard as China’s open model stuns global AI rankings

GLM-5.2 wins HTML design leaderboard despite minor architectural limitations
The consistency of the template yields higher preference scores across user evaluations
Open weight model challenges price norms in competition in the AI market

Z.ai’s GLM-5.2 has knocked Anthropic’s Fable 5 off the top of Design Arena’s single-turn HTML web design leaderboard – a position the Claude model family had dominated for months.

The Chinese open weight model, built on 744 billion parameters and licensed by MIT, is now in first place, five places above its predecessor, GLM-5.1.

What makes this remarkable is that Z.ai achieved this without vision capabilities and with a model the same size as the GLM-5.1, while its closest rivals are speculated to be as much as 6.7x larger.

A price advantage that matches the performance

GLM-5.2 costs $1.40/$4.40 per 1 million tokens against Fable 5’s $10/$50 per

The model doesn’t beat Fable 5 everywhere: it ranks second on Game Dev, Data Visualization and 3D design leaderboards, and fourth on UI Components, but on website generation three specific behaviors explain its lead.

GLM-5.2 uses a consistent set of high-performance base templates that avoid the anti-patterns – such as the infamous purple gradients – that plagued previous AI-generated designs.

It also handles external dependencies like chart.js and three.js more reliably than the competition, generating a 6.0 percentage point increase in win rate across 21% of sessions using these libraries.

It implements TailwindCSS in 91% of sessions and Font Awesome in 51%, compared to Opus 4.8’s TailwindCSS usage of only 57%.

It also generates 25% more characters and lines of code than the competition, with an average generation time of 304.7 seconds, roughly double that of Fable 5.

In contrast, Fabel 5 generates 38% fewer lines of code and 29% fewer characters than its competitors, reflecting a more generalized approach that trades average output quality for variety and speed.

Mythos Timeline Exchange

The model’s release has fueled a wider public debate about how quickly China can close the capability gap with US frontier AI.

Recently, Tesla CEO Elon Musk took part in a public debate about the X and suggested that China will hit Fable-class AI capability “in Q1,” meaning the first quarter of next year.

But in a confident yet cheeky response, Z.ai co-founder Jie Tang responded with just four words – “Won’t take that long.”

The exchange drew attention because it coincided with GLM-5.2 topping a ranking that Anthropic’s models had long controlled.

Design Arena’s own analysis acknowledges that GLM-5.2’s “expert template” approach, which favors consistent and high-quality output over diversity, performs better on website generation tasks, but does not necessarily indicate broader capability parity.

In agent settings, GLM-5.2 generates 11% more files and calls 17% more tools than the competition, yet produces slightly less code overall.

The open source frontier is clearly moving faster than many expected, and what was state-of-the-art months ago is now matched by models that anyone can freely build on, tweak, and implement.

However, topping a design leaderboard does not automatically mean that a model can match the deeper reasoning of most advanced AI systems.

Google logo on black background next to text reading 'Click to follow TechRadar'