Thoughtworks Insights Follow LLM benchmarks, evals and tests: A mental model https://www.thoughtworks.com/insights/blog/generative-ai/LLM-benchmarks,-evals,-and-tests thoughtworks.com