VentureBeat Follow Stop benchmarking in the lab: Inclusion Arena shows how LLMs perform in production Researchers from Inclusion AI and Ant Group proposed a new LLM leaderboard that takes its data from real, in-production apps. https://venturebeat.com/ai/stop-benchmarking-in-the-lab-inclusion-arena-shows-how-llms-perform-in-production/ venturebeat.com AI and ML News on Bluesky @ai-news.at.thenote.app bsky.app