I have a confession: I roll my eyes at AI benchmarks. Every other week someone on Twitter posts a chart where a brand new model is suddenly beating Opus and GPT, the replies go crazy, and then you actually use the thing and it falls apart on the first real task. Beautiful numbers, ugly code.

Source: [Dev.to](https://dev.to/danielbergholz/testing-glm-52-on-opencode-im-impressed-1780)

Sponsored