Gemini 2.5 Pro 🫥
In software development benchmarks, Gemini 2.5 Pro performance is mixed.
It scored 68.6% on the Aider Polyglot benchmark for code editing, outperforming most top-tier models. However, it scored 63.8% on SWE-bench Verified, placing second to Claude Sonnet 3.7 in broader programming tasks.
Despite this, Google says Gemini 2.5 Pro excels at creating visually compelling web apps and agentic code applications, as evidenced by its ability to create a video game from a single prompt.
The model supports a context window of one million tokens, meaning it can process the equivalent of a 750,000-word prompt, or the first six Harry Potter books. Google plans to increase this threshold to two million tokens in due course.
Gemini 2.5 Pro is currently available through the Gemini Advanced app, which requires a $20-a-month subscription, and to developers and enterprises through Google AI Studio.
Comments
Post a Comment
Empathy recommended