I've tried many different models and without doubt the code coming out of them differs a lot when it comes to "quality". Some of that is subjective for sure, but there are objective sides to "good" code. I wish this was a metric for the AI benchmarks so I could choose a model based on this, bec...
Source: [Hacker News](https://news.ycombinator.com/item?id=48488990)