OpenAI Launches 'GPT-OSS' Open-Weight Models To Rival Meta, Mistral, And DeepSeek

Large Language Models (LLMs) are smart, and can be made even more capable with more training data and more parameters, compute power, and fine-tuning.

When OpenAI introduced ChatGPT, it ignited a worldwide battle for AI supremacy. When tech titans like Google, Microsoft, as well as younger ones like Anthropic and Perplexity and others know that bigger is better, they all scrambled to build ever‑bigger and more powerful and sophisticated LLMs.

However, not everyone chose to compete in sheer scale.

A parallel movement embraced lightweight, open-weight models, compact, developer-friendly alternatives that can be run locally or fine-tuned with ease.

Among the notable players is China’s DeepSeek, which sparked U.S. concerns after releasing DeepSeek-R1. Also in the race: Meta with its LLaMA family, Microsoft-backed Mistral from France, and several others.

These smaller models aren't 'dumb' despite their smaller size. But they demonstrated rapid escalation and adaptability.

OpenAI realizes that the LLM was isn't just about scale, but also who could host the most capable one.

This is where the company introduces lightweight GPT-OSS models, just for that purpose.

we are going to take a little more time with our open-weights model, i.e. expect it later this summer but not june.

our research team did something unexpected and quite amazing and we think it will be very very worth the wait, but needs a bit longer.
— Sam Altman (@sama) June 10, 2025

we planned to launch our open-weight model next week.

we are delaying it; we need time to run additional safety tests and review high-risk areas. we are not yet sure how long it will take us.

while we trust the community will build great things with this model, once weights are…
— Sam Altman (@sama) July 12, 2025

In a dramatic shift OpenAI makes for the first time since GPT‑2 in 2019, the company releases two open-weight reasoning models:

GPT‑OSS‑120B: This model packs ~117 billion parameters using Mixture‑of‑Experts (MoE) architecture, with ~5.1 billion parameters activated per token. It delivers performance on par with OpenAI’s proprietary o4‑mini, and can run on a single H100 GPU or enterprise infrastructure.

GPT‑OSS‑20B: A larger model with ~21 billion parameters and ~3.6 billion active per token, targets resource‑limited setups, capable of running on consumer devices with 16 GB VRAM, like laptops or desktops.

We released two open-weight reasoning models—gpt-oss-120b and gpt-oss-20b—under an Apache 2.0 license.

Developed with open-source community feedback, these models deliver meaningful advancements in both reasoning capabilities & safety.https://t.co/PdKHqDqCPf
— OpenAI (@OpenAI) August 5, 2025

These models are lightweight, and that it allows anyone to download, run, and inspect, as well as fine-tune the weights, ideal for.

Developing on-device AI agents with no cloud dependency.
Building domain‑specific assistants with fine‑tuned weights.
Research and experimentation with total transparency.

Among the things that made them lightweight, is the fact that they are text-only models.

Regardless, the are reasoning models, meaning that they support chain-of-thought reasoning, tool use, instruction-following, code execution, and structured output. As a result, despite their relative smaller size, they come with strong performance on benchmarks like MMLU, AIME, HealthBench, and Codeforces.

OpenAI framed this release as a return to democratic AI values, aiming to empower wider innovation beyond its own APIs. CEO Sam Altman said:

"We’re excited to make this model, the result of billions of dollars of research, available to the world to get AI into the hands of the most people possible."

These models are trained for agentic workflows—supporting function calling, web search, Python execution, configurable reasoning effort, and full raw chain-of-thought access. https://t.co/hPSeiNILvv
— OpenAI (@OpenAI) August 5, 2025

What started as an AI arms race driven by size and scale is now branching into an age of open adaptability. With GPT‑OSS, OpenAI hands over powerful, customizable models to developers worldwide—flexible, transparent, and battle-tested.

To OpenAI, these models matter due to their openness, flexibility, and also safety.

As for the latter, OpenAI has internally conducted adversarial fine-tuning experiments to simulate potential misuse in domains like bioweapons or cyberwar. The researchers concluded did not enable high-risk capability under the company’s Preparedness Framework

This move positions OpenAI firmly alongside rivals like Meta and DeepSeek, which have also released open-weight models.

But there’s nuance: while these models expose weights, they are not full open-source. OpenAI still keeps the training code and system-level safeguards private, and tightly governs the usage policy around GPT‑OSS. Then, there are risks that come with open-weight models. One of which, is how these models can enable new threat vectors, including weight poisoning, backdoor insertion, or automated cyberattacks using AI tooling.

Following the announcement, the two GPT‑OSS models are made available on platforms like Hugging Face, AWS SageMaker & Bedrock, Microsoft Azure AI Foundry, and Northflank—bringing them to both enterprise and edge environments.

Both gpt-oss models are free to download on Hugging Face, with native MXFP4 quantization built in for efficient deployment.

Full list of day-one support is available on our blog.https://t.co/IHgrXLnhRT
— OpenAI (@OpenAI) August 5, 2025

Published:

05/08/2025

Dark Mode

Search form

OpenAI Launches 'GPT-OSS' Open-Weight Models To Rival Meta, Mistral, And DeepSeek