TechCrunch

Debates over AI benchmarking have reached Pokémon

Not even Pokémon is safe from AI benchmarking controversy. Last week, a post on X went viral, claiming that Google’s latest Gemini model surpassed Anthropic’s flagship Claude model in the original Pokémon video game trilogy. Reportedly, Gemini had reached Lavender Town in a developer’s Twitch stream; Claude was stuck at Mount Moon as of late […]
favicon
bsky.app
AI and ML News on Bluesky @ai-news.at.thenote.app
favicon
techcrunch.com
techcrunch.com
Create attached notes ...