Standard SLOs can't measure LLM correctness. Split reliability into operational, structural, and semantic layers; each needs different metrics and ownership.
Source: [HackerNoon](https://hackernoon.com/designing-slos-for-llm-powered-applications-what-breaks-when-your-service-is-probably-correct?source=rss)