xAI Releases 'Grok 4.3,' Extending Context, Strong Tool Use, And A Steady Step Forward

The race between large language models did not slow down.

Ever since OpenAI ignited the modern AI boom with ChatGPT, every major player and a wave of ambitious newcomers pushed aggressively to outdo one another. The competition spanned everything from reasoning depth and response speed to multimodal capabilities and real-world usefulness.

Benchmarks shifted constantly, new releases arrived in rapid cycles, and yet amid all this noise, Grok from xAI carved out a distinctly different path.

It was not just about climbing leaderboards with successive iterations like Grok-1, Grok-2, Grok-3 and Grok-4.

What made Grok stand out was a deeper architectural bet: instead of simply scaling a single massive model, xAI experimented with how intelligence itself could be structured.

And now, xAI has just released 'Grok-4.3,' a model that goes a step beyond what Grok-4.2 managed.

Grok 4.3 is now live on the xAI API. It’s our fastest, most intelligent model to date.

It tops the @ArtificialAnlys leaderboards in agentic tool calling and instruction following, and ranks #1 in @ValsAI enterprise domains like case law and corporate finance.

Grok 4.3 supports… pic.twitter.com/83NiWoFDY2
— xAI (@xai) May 5, 2026

First off, Grok 4.3 arrived with a one-million-token context window.

This allowed it to handle extensive documents, long conversations, or complex datasets within a single interaction. The model accepted both text and image inputs and produced text outputs. It also featured built-in reasoning that activated by default, enabling the system to think through responses before delivering them.

Pricing came in at 1.25 dollars per million input tokens and 2.50 dollars per million output tokens, lower than the rates for its immediate predecessor.

Independent evaluations positioned the model strongly on certain leaderboards.

It reached the top spots on Artificial Analysis for agentic tool calling and instruction following.

It also performed well on enterprise-oriented benchmarks from ValsAI, especially in domains such as case law and corporate finance. Its score on the Artificial Analysis Intelligence Index hit 53, placing it competitively among frontier models while preserving relatively high speed.

Developers observed practical gains in real-world tasks.

Many reported more reliable performance in sequential tool calls and stronger adherence to detailed instructions. The model followed a period of beta testing for select subscribers before it became widely available through the xAI API and partners including OpenRouter and Vercel.

Reactions in the community varied but centered on usability.

Early users pointed to its speed in research and coding workflows. Others noted that benchmarks did not always carry over directly to every production environment, particularly in extended chains of interactions or highly specialized fields. Some welcomed the reduced cost for large-context applications, while a few questioned consistency in certain edge cases.

Grok 4.3 is xAI's latest model, now live on the xAI API. It's our fastest and most intelligent yet—#1 on Artificial Analysis leaderboards for agentic tool calling & instruction following, plus tops enterprise benchmarks like case law and corporate finance.

It has a 1M token…
— Grok (@grok) May 6, 2026

The release of Grok-4.3 continued xAI's pattern of steady, iterative updates.

Earlier models in the Grok series had emphasized speed, lower hallucination rates in targeted tests, and practical tool integration. Grok 4.3 built on that foundation with a clearer focus on efficiency and agentic capabilities rather than headline benchmark numbers alone.

For developers building applications, the extended context, native tool calling, and accessible pricing created new options in document analysis, agent-based systems, and other knowledge-intensive areas.

As with any new model, actual performance depended on implementation details and prompt design.

The API documentation offered guidance for integration, and direct testing remained the clearest way to determine suitability for specific projects.

Published:

06/05/2026