[AINews] Gemini 3.1 Pro: 2x 3.0 on ARC-AGI 2

Original: Swyx · 20/02/2026

Summary

*AI News for 2/18/2026-2/19/2026. We checked 12 subreddits, 544 Twitters and 24 Discords (262 channels, and 14980 messages) for you. Estimated reading time saved (at 200wpm): 1467 minutes. AINews’ website lets

Key Insights

“It’s getting a little hard to say interesting things with all the round robin minor version updates of frontier models every week, but Gemini 3.1 Pro seems like a decent enough advance.” — Discussing the rapid evolution of AI models and the significance of Gemini 3.1 Pro’s release.

“Google shipped Gemini 3.1 Pro (generally described as a Preview for developers) and rolled it out across the Gemini app, NotebookLM, Gemini API / AI Studio, and Vertex AI.” — Detailing the rollout and application of Gemini 3.1 Pro across various platforms.

“The announcement emphasized a big reasoning jump—especially ARC-AGI-2 = 77.1%—plus strong coding and agentic-tool benchmarks.” — Highlighting the significant improvements in reasoning and coding benchmarks with the release of Gemini 3.1 Pro.

Topics

Full Article

# [AINews] Gemini 3.1 Pro: 2x 3.0 on ARC-AGI 2

Author: Swyx
Published: 2026-02-20
Source: https://www.latent.space/p/ainews-gemini-31-pro-2x-30-on-arc

AI News for 2/18/2026-2/19/2026. We checked 12 subreddits, 544 Twitters and 24 Discords (262 channels, and 14980 messages) for you. Estimated reading time saved (at 200wpm): 1467 minutes. AINews’ website lets you search all past issues. As a reminder, AINews is now a section of Latent Space. You can opt in/out of email frequencies!

It’s getting a little hard to say interesting things with all the round robin minor version updates of frontier models every week, but Gemini 3.1 Pro seems like a decent enough advance to catch up, and in some cases, supercede, the fellow frontier models (this is surely the reason that 3.1 -had- to be released, because with 5.3 and 4.6 things were seriously falling behind for Google1)

[![Image](https://substackcdn.com/image/fetch/$s_!yx8y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8564929f-b251-49ac-bc93-3564e36f2cd2_2160x2700.png "Image")](https://substackcdn.com/image/fetch/$s_!yx8y!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8564929f-b251-49ac-bc93-3564e36f2cd2_2160x2700.png)

It’s better at some svg design things:

[![](https://substackcdn.com/image/fetch/$s_!ccZ4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45ca8560-c21c-4986-878f-0c6bd90275f6_1200x1078.png)](https://substackcdn.com/image/fetch/$s_!ccZ4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45ca8560-c21c-4986-878f-0c6bd90275f6_1200x1078.png)

and translating textual vibes to visual aesthetics:

[![](https://substackcdn.com/image/fetch/$s_!LpTw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcef6b7d8-c9db-4e5b-80db-4d65f9535c1a_1202x1082.png)](https://substackcdn.com/image/fetch/$s_!LpTw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcef6b7d8-c9db-4e5b-80db-4d65f9535c1a_1202x1082.png)

# **AI Twitter Recap**

Top Story: Gemini 3.1 release facts and reactions/opinions

Google shipped Gemini 3.1 Pro (generally described as a Preview for developers) and rolled it out across the Gemini app, NotebookLM, Gemini API / AI Studio, and Vertex AI, positioning it as the “core intelligence” from Gemini 3 Deep Think scaled down for practical product use. The announcement emphasized a big reasoning jump—especially ARC-AGI-2 = 77.1%—plus strong coding and agentic-tool benchmarks (e.g., SWE-Bench Verified = 80.6%) and improved hallucination behavior. Independent leaderboards and evaluators largely corroborated top-tier performance and strong cost/intelligence positioning, while reaction threads highlighted (a) excitement about practical gains (SVG/web/UI/code quality, agentic use cases), (b) skepticism about benchmark-targeting and “eval tweeting,” (c) concerns around GDPval (real-world agentic tasks) not leading despite other SOTA scores, and (d) rollout friction: users finding some products (Gemini CLI / Code Assist / Antigravity) unavailable or inconsistently updated at launch.

Facts vs. opinions

[Read more](https://www.latent.space/p/ainews-gemini-31-pro-2x-30-on-arc)

Key Takeaways

Notable Quotes

It’s getting a little hard to say interesting things with all the round robin minor version updates of frontier models every week, but Gemini 3.1 Pro seems like a decent enough advance.

Context: Discussing the rapid evolution of AI models and the significance of Gemini 3.1 Pro’s release.

Google shipped Gemini 3.1 Pro (generally described as a Preview for developers) and rolled it out across the Gemini app, NotebookLM, Gemini API / AI Studio, and Vertex AI.

Context: Detailing the rollout and application of Gemini 3.1 Pro across various platforms.

The announcement emphasized a big reasoning jump—especially ARC-AGI-2 = 77.1%—plus strong coding and agentic-tool benchmarks.

Context: Highlighting the significant improvements in reasoning and coding benchmarks with the release of Gemini 3.1 Pro.

[[topics/prompt-engineering]]
[[topics/agent-native-architecture]]
[[topics/ai-agents]]

Gemini 3.1 Pro

Simon Willison · reference · 80% similar

[AINews] new Gemini 3 Deep Think, Anthropic $30B @ $380B, GPT-5.3-Codex Spark, MiniMax M2.5

Swyx · explanation · 77% similar

Gemini 3 Deep Think

Simon Willison · explanation · 74% similar

Originally published at https://www.latent.space/p/ainews-gemini-31-pro-2x-30-on-arc.

Research

Personal

Planning

[AINews] Gemini 3.1 Pro: 2x 3.0 on ARC-AGI 2

Summary

Key Insights

Topics

Full Article

Top Story: Gemini 3.1 release facts and reactions/opinions

Facts vs. opinions

Key Takeaways

Notable Quotes

Gemini 3.1 Pro

[AINews] new Gemini 3 Deep Think, Anthropic $30B @ $380B, GPT-5.3-Codex Spark, MiniMax M2.5

Gemini 3 Deep Think

Research

Personal

Planning

​Summary

​Key Insights

​Topics

​Full Article

​Top Story: Gemini 3.1 release facts and reactions/opinions

​Facts vs. opinions

​

​Key Takeaways

​Notable Quotes

​Related Topics

​Related Articles

Gemini 3.1 Pro

[AINews] new Gemini 3 Deep Think, Anthropic $30B @ $380B, GPT-5.3-Codex Spark, MiniMax M2.5

Gemini 3 Deep Think

Summary

Key Insights

Topics

Full Article

Top Story: Gemini 3.1 release facts and reactions/opinions

Facts vs. opinions

Key Takeaways

Notable Quotes

Related Topics

Related Articles