26 березня 2025 р., 09:15·2 хв читання · 289 слів·👁 14.3K↗ 16

♊️ Gemini 2.5 Pro: Google's New Champion LLM

♊️ Gemini 2.5 Pro: Google's New Champion LLM

Google DeepMind has unveiled a new "reasoning" experimental LLM, Gemini 2.5 Pro. Many were surprised by the model's name, considering that even Gemini 2.0 Pro hasn't officially graduated from experimental status yet. Stull, the performance leap justifies the jump to a new generation.

In the LMArena ranking, compiled based on blind user evaluations, Google's newcomer has scored 1443 points. The previous leaderboard champs, Grok 3 from Elon Musk's xAI and GPT-4.5 from OpenAI, hovered just above 1400 points.

Key Features:

1️⃣ In the most challenging benchmark, "Humanity's Last Exam," Gemini 2.5 Pro scores 18.8%. The thinking version of Gemini 2.0 managed only 7.2%, while o3-mini-high hit 14%.

2️⃣ The model outshines competitors in exact sciences and math while showing at least comparable results in programming.

3️⃣ Context window: 1M tokens (roughly 2-2.5 thousand PDF pages), with developers planning to expand it to 2M.

4️⃣ The model processes audio and video and understands images but can't generate pictures.

You can try the model for free in Google AI Studio.

🌐 Is Google The New Market Leader?

Once again, Google proves it's not just refusing to let the AI market slip away but is gunning for the top spot.

Before, when Google's experimental models climbed to the top of the rankings, like last December, it seemed like they were just trying to keep up with OpenAI. Now, Google's the one leading the way and setting the pace.

It's possible that the recent introduction of native image generation in Gemini 2.0 Flash prompted OpenAI to expedite the launch of a similar feature in GPT-4o. Additionally, Google's new generation of small language models, Gemma 3, provides nearly flawless results in its category.

#news #Google @hiaimediaen

#google #news

Відкрити в Telegram