#Gemini 3 #Flash is crushing #benchmarks. 78% on SWE-bench. The lab boys tell me that’s good. I told them I don’t pay them to tell me things are “good,” I pay them to make things that don’t explode.
Point is: reasoning, multimodality, coding, agents. Fraction of the cost.
about 1 month ago