Background

'DeepSeek-V3.1' Released Quietly, Challenging GPT-5, Grok-4, Claude 4, With An Open-Source Smirk

DeepSeek-V3

Once upon a not-so-distant past, AI was nothing more than just a word popularized by science-fiction, foretold by researchers.

But since the arrival of ChatGPT from OpenAI, the technology lit a spark that ignited an explosive arms race in Silicon Valley. Suddenly tech giants raced to outdo one another by pouring billions of dollars into developing ever-more powerful generative AIs, with ambitions framed towards achieving both AGI and dominance.

Not far behind, China quietly took up the mantle, striving to build its own answer to the disruption.

And in that charged arena emerged DeepSeek, an underdog from Hangzhou, a startup that dared challenge the incumbents with leaner resources and steely ambition.

It trembled the West, and also beyond.

And this time, after a long wait and many speculations, DeepSeek seemingly awake from its slumber by unveiling 'DeepSeek-V3.1.'

Without flashy fanfare, DeepSeek slipped its V3.1 model onto Hugging Face.

It was quite a while before someone uncovered a monster: a 685-billion parameter behemoth, freely downloadable under open-source terms, playing in the same league as models costing tens or hundreds of millions in development

Picking up the pace of its predecessor, the DeepSeek-V3, this new version is large, and also capable.

It effortlessly devours a staggering 128,000-token context window, or an equivalent of about a 400-page novel. This huge processing power enables it to maintain far more coherent and lengthy interactions than most rivals in its league.

That’s not all. DeepSeek-V3.1 is essentially a hybrid architecture that blends chat-style conversations, internal reasoning mechanisms, and coding prowess; special tokens enable "thinking" and even real-time web integration.

If that isn't enough, DeepSeek makes it speak in different formats: from BF16, to the cutting-edge FP8 variants, as well as the F32.

What this means, DeepSeek-V3.1 is efficient across diverse hardware platforms.

Within mere hours of release, testers sang its praises.

Early benchmarks placed DeepSeek V3.1 at a 71.6 % score on the Aider coding benchmark, matching OpenAI and Anthropic’s best scores while operating at a fraction of their cost. The model’s inference costs are measured in pennies per coding task, delivering results comparable to systems charging nearly seventy dollars per task.

One researcher even stated that DeepSeek-V3.1 is just "1 % more than Claude Opus 4" but "68 times cheaper."

The AI world gasped.

While it has been a while since DeepSeek released the powerful DeepSeek-R1, this V3.1 stands independently as a powerful adversary offering seamless chat, reasoning, and coding within a single elegant architecture.

This move isn’t merely about tech bragging rights. It's simply a strategic jolt.

DeepSeek’s launch comes just weeks after OpenAI’s GPT-5, roughly a month after xAI unveiled Grok-4, and following Anthropic’s Claude 4. By delivering comparable performance while remaining open-source and globally accessible, DeepSeek poses a subtle yet powerful challenge to the West’s AI order—contrasting controlled access with true democratization.

A day later, DeepSeek announced this V-3.1 in an official post.

Published: 
20/08/2025