Out of the 7 benchmarks shown by Anthropic for their new Claude Opus 4.1 model, it was already beaten by Gemini 2.5 Pro on three of them.
With Gemini 3 reportedly coming out as soon as this week and Google on an absolute tear right now, things look very interesting for LLMs.
2 months ago