Background

OpenAI Introduces 'GPT-5.4,' A Model That Thinks Out Loud And Can Be Interrupted Mid-Thought

OpenAI GPT-5.4 Thinking

The rapid development of large language models (LLMs) has turned the AI industry into a highly competitive arena.

Since the arrival of ChatGPT in late 2022, companies across the technology sector have raced to build increasingly capable models that can write code, analyze data, and assist with complex knowledge work. What started as conversational AI has evolved into a broader ecosystem of reasoning systems, coding assistants, and AI agents that can interact with tools and software environments.

Within this landscape, OpenAI has continued iterating on its GPT series of models. Each new generation attempts to improve reliability, reasoning ability, and integration with real-world workflows.

Now, that progression is the release of 'GPT-5.4,' a new frontier model that introduces several upgrades, including specialized variants designed for deeper reasoning and higher-performance workloads.

In the announcement, OpenAI said that:

"GPT‑5.4 brings together the best of our recent advances in reasoning, coding, and agentic workflows into a single frontier model. It incorporates the industry-leading coding capabilities of GPT‑5.3‑Codex⁠ while improving how the model works across tools, software environments, and professional tasks involving spreadsheets, presentations, and documents. The result is a model that gets complex real work done accurately, effectively, and efficiently—delivering what you asked for with less back and forth."

GPT-5.4 is released in multiple forms, including GPT-5.4 Thinking and GPT-5.4 Pro.

The Thinking version is optimized for multi-step reasoning tasks, while the Pro variant is designed to allocate more compute resources to difficult problems and enterprise-scale workloads. Together they represent OpenAI’s attempt to combine several capabilities that include reasoning, coding, and computer interaction, all within a single model family.

One of the most visible changes in the Thinking model is how it approaches problem solving.

Instead of immediately generating a final answer, the system can first outline the steps it intends to take. This allows the model to effectively "think out loud," giving users a brief preview of its reasoning before the response is completed. If the approach is incorrect or incomplete, the user can intervene and redirect the model before it finishes generating the output.

This ability to interrupt a model mid-reasoning changes the interaction pattern between users and AI systems.

Previously, users often had to wait for a full response and then refine the prompt through several follow-up messages. With the Thinking mode, feedback can happen while the reasoning process is still unfolding, which can reduce iteration cycles and help steer complex tasks more efficiently.

Another notable capability of GPT-5.4 is its integration of computer-use functionality.

The model can interact with software environments by interpreting screenshots and issuing keyboard or mouse commands, allowing it to operate applications and navigate digital workflows. This makes it possible for the model to execute multi-step tasks across programs rather than simply generating text responses.

The release also reflects a broader shift toward AI systems that function as agents rather than static chatbots. GPT-5.4 is designed to call tools, browse the web, and interact with external applications while working toward a goal. These capabilities allow the model to iterate on tasks, gather additional information, and carry out workflows that would previously require multiple separate tools or manual steps.

OpenAI has also emphasized improvements in efficiency and scale. The model supports context windows of up to one million tokens, enabling it to process much larger documents, codebases, or datasets in a single session.

This extended memory allows the system to maintain awareness of earlier parts of a conversation or project while performing long, multi-step tasks.

Performance benchmarks suggest incremental improvements across coding, reasoning, and academic tasks compared with earlier models in the GPT-5 series.

According to OpenAI’s internal evaluations, GPT-5.4 improves on previous models in areas such as software engineering benchmarks and complex problem-solving tests, while also reducing latency in many reasoning scenarios.

The release of GPT-5.4 illustrates how the focus of AI development has shifted from conversational ability toward task execution.

Whereas earlier language models were primarily evaluated on their ability to generate coherent text, newer systems are increasingly designed to plan, reason, and interact with digital tools in ways that resemble collaborative work.

In that sense, models like GPT-5.4 Thinking represent an incremental step toward more interactive AI systems: ones that expose part of their reasoning process, allow real-time guidance, and integrate more directly with software environments.

Published: 
06/03/2026