Complete Guide · Updated March 2026

Grok AI &
4.20 Beta Explained

Elon Musk’s xAI built the only frontier model with native real-time access to X — and the cheapest frontier-class API on the market. Here’s everything you need to know about Grok in 2026: every model, all pricing, the X data advantage, and what Grok 4.20 Beta’s 4-agent system actually does.

🏅 #1 LMArena (Grok 4.1 Thinking)
📱 Live X real-time data
💰 $0.20/MTok API (4.1 Fast)
🤖 4-agent Grok 4.20 Beta
Try Grok Now

Chat With Grok Right Now

Grok is free at grok.com — no account required for basic use. Click to open it in a new tab, or use the prompt ideas below to get the most from Grok’s real-time X data and reasoning capabilities.

Grok lives on X —
free to use at grok.com

Like Gemini, Grok doesn’t offer an embeddable third-party widget. It’s designed to run on xAI’s servers at grok.com or inside the X platform. The free tier gives you access to Grok 4 and 4.1 with limited daily messages — no credit card needed.

Open Grok Free →

Clicking any prompt card opens Grok in a new tab · Copy the prompt to paste in

Grok Model Family · March 2026

Every Grok Model
Explained

From the conversational powerhouse Grok 4.1 to the experimental 4-agent Grok 4.20 Beta and the specialist Heavy tier — here’s exactly what each model does.

🤖
Grok 4.20 Beta February 17, 2026 · Public Beta
Beta

xAI’s most experimental and architecturally novel model. Grok 4.20 Beta introduces a 4-agent parallel collaboration system — four specialised AI agents (Grok, Harper, Benjamin, and Lucas) debate and fact-check each other in real time before synthesising a single high-quality response. This isn’t multi-model calling — it’s genuine internal deliberation between agents on every query. The model also uses a “rapid learning” architecture that updates weekly from user feedback, making it the first Grok to continuously improve post-deployment. Select “Grok 4.2” in the model menu to activate it.

  • 4-agent parallel system — agents debate before answering
  • Rapid learning architecture — improves weekly from feedback
  • Medical document analysis via photo upload
  • Improved engineering reasoning over Grok 4.1
  • 65% hallucination reduction vs Grok 4 base (carried from 4.1)
  • Scores 48 on Artificial Analysis Intelligence Index (vs median 28)
  • API: $2.00/$6.00 per MTok · 2M token context
Intelligence Index
48 / 100
vs peer median
+71%
🏅
Grok 4.1 Thinking November 17, 2025 · #1 LMArena
Flagship

The model that put Grok at the top of the industry leaderboard. Grok 4.1 Thinking (code name: quasarflux) holds #1 overall on LMArena’s Text Arena with 1,483 Elo — a 31-point margin over the next non-xAI model. Grok 4.1 non-thinking (tensor) ranks #2 at 1,465 Elo, meaning it surpasses every other model’s full-reasoning configuration. This is the first AI model to simultaneously top both thinking and non-thinking categories. It was trained using large-scale RL on non-verifiable rewards including style, helpfulness, and interpersonal intelligence — a fundamentally different training signal from benchmark-optimised models.

  • 1,483 Elo (Thinking) — #1 LMArena overall
  • Non-Thinking: 1,465 Elo — #2, beats all other thinking models
  • 65% hallucination reduction vs Grok 4 (12.09% → 4.22%)
  • Leads EQ-Bench3 — emotional intelligence and interpersonal skill
  • Real-time X integration — live data on every query
  • 2M token context window (Fast variant)
LMArena Elo
1,483
Hallucination
−65%
🧠
Grok 4 Heavy July 2025 · Maximum Compute
Heavy Tier

The maximum-compute version of Grok 4, designed for the hardest reasoning tasks where compute limits are lifted. Available exclusively to SuperGrok Heavy ($300/month) subscribers. At release in July 2025, Grok 4 Heavy scored 96.7% on HMMT25 (math tournament), 100% on AIME 2025, and 88.4–88.9% on GPQA graduate science — outperforming Claude Opus (at the time), Gemini 2.5 Pro, and GPT-4o on all four tests. It was xAI’s answer to test-time compute scaling as a product feature.

  • 100% on AIME 2025 math competition
  • 96.7% on HMMT25 — top math tournament benchmark
  • 88.4–88.9% on GPQA graduate-level science
  • Extended thinking and multi-agent capabilities
  • 428K token memory — longest available context
  • Exclusive to SuperGrok Heavy ($300/month)
AIME 2025
100%
GPQA Science
88.7%
Grok 4.1 Fast November 19, 2025 · API Default
Fastest

The workhorse for production API workloads. Grok 4.1 Fast delivers near-frontier capability at $0.20/$0.50 per million tokens — the lowest price of any frontier-tier model from a major lab. It scores 64 on Artificial Analysis’s Intelligence Index (matching nearly the same quality as Grok 4 at 65) at 1/15th the price. It also has the largest context window in production use: 2 million tokens — enough for millions of words or entire large codebases. Available in reasoning and non-reasoning variants.

  • $0.20/$0.50 per MTok — cheapest frontier-class API
  • 2M token context — largest production context window
  • Intelligence Index: 64 — near-identical to Grok 4 (65)
  • Reasoning and non-reasoning variants available
  • Built-in X live search, web search, code execution
  • 75% cached token discount + 50% Batch API discount
Cost vs Grok 4
93% less
Context window
2M tokens
💻
Grok Code Fast 1 2025 · Agentic Coding
Code

xAI’s specialist agentic coding model — designed for developer workflows requiring fast reasoning over code. Optimised specifically for code comprehension, debugging, test generation, and multi-step agentic coding loops. Available via the xAI API and integrated into tools like Microsoft Copilot Studio. Paired with Grok 4.1 Fast’s 2M token context, it can reason across entire large codebases in a single pass.

  • Specialist model optimised for agentic coding tasks
  • Fast inference — designed for coding loop efficiency
  • Available via API and Microsoft Copilot Studio
  • Pairs with Grok 4.1 Fast for 2M token codebase analysis
  • Supports parallel tool calls and code execution
📌

Which model should I use? For API work at scale: Grok 4.1 Fast — cheapest frontier model, 2M context, near-Grok-4 quality. For conversational quality: Grok 4.1 Thinking — #1 on LMArena. For hard math/science: Grok 4 Heavy (SuperGrok Heavy only). For experimental multi-agent reasoning: Grok 4.20 Beta. For coding: Grok Code Fast 1.

Pricing · March 2026

All Grok Plans
Compared

Three ways to access Grok — direct subscriptions, X platform bundles, and the API. Plus the cheapest frontier-class API pricing in the industry.

Free
$0

Limited daily messages. Grok 3 / 4 access. Aurora image generation.

Grok 3 / 4 — limited messages
Real-time X data access
Aurora image generation
Basic voice mode
DeepSearch
Grok 4.20 Beta access
X Premium
$8/month

X platform features + more Grok queries. Blue checkmark included.

Increased Grok query limits
X platform checkmark + features
Ad revenue sharing on X
Grok 4 full access
DeepSearch / Big Brain Mode
X Premium+
$40/month

SuperGrok features bundled with ad-free X and priority access.

Full Grok 4 and 4.1 access
Ad-free X browsing
Priority Grok routing
DeepSearch + Grok 4.20 Beta
Higher message throughput
SuperGrok Heavy
$300/month

Grok 4 Heavy — the maximum compute tier. 100% AIME 2025.

Everything in SuperGrok
Grok 4 Heavy — max compute
428K token memory
Extended thinking access
Multi-agent capabilities
Maximum priority routing
API (Dev)
$0.20/MTok in

Grok 4.1 Fast — cheapest frontier API. $25 free credits on signup.

4.1 Fast: $0.20/$0.50 per MTok
Grok 4: $3.00/$15.00 per MTok
$25 free credits on signup
75% cached token discount
50% Batch API discount
OpenAI-compatible API format

Full API Pricing — All Grok Models

Batch API gives 50% discount. Cached tokens up to 75% cheaper. Tool calls $2.50–$5 per 1,000 invocations.

ModelReleaseInput /MTokOutput /MTokContextNotes
grok-4.20-beta (reasoning)Feb 2026$2.00$6.002M4-agent system · public beta
grok-4.1 / grok-4.1-fastNov 2025$0.20$0.502MBest value — cheapest frontier API
grok-4Jul 2025$3.00$15.00256KStandard flagship reasoning
grok-4-fastJul 2025$0.80$4.002MFast tier · large context
GPT-5.4 (for reference)$5.00$20.00256K25× more expensive than 4.1 Fast
📌

Cost reality: At $0.20/MTok, processing 1 million tokens with Grok 4.1 Fast costs 20 cents. The same volume with GPT-5.4 costs $5.00 — 25× more. With the 2M token context window, Grok 4.1 Fast is the only model that can process entire large codebases at frontier quality for pennies.

What Makes Grok Different

Every Grok Feature
Explained

📱

Real-Time X Integration

Grok’s defining capability: native, always-on access to the X (formerly Twitter) firehose. Every Grok query can pull live posts, trending discussions, breaking news, and real-time sentiment — not via web search but directly from the platform’s data stream. No other frontier model has this. For market intelligence, trend tracking, and current events, it puts Grok in a category of one.

🤖

4-Agent Parallel System (4.20 Beta)

Grok 4.20 Beta’s headline innovation: four specialised AI agents — Grok, Harper, Benjamin, and Lucas — think in parallel and debate each other in real time before synthesising a final answer. Unlike multi-model calling (separate APIs aggregated externally), these agents engage in multiple rounds of internal discussion, questioning, and fact-checking before outputting. The result is meaningfully higher accuracy on complex problems.

🔬

DeepSearch

Grok’s extended research mode — performs multiple live web and X searches, reasons across results, and produces a comprehensive cited response. Available on SuperGrok and higher plans. Unlike standard web search, DeepSearch synthesises across sources rather than just returning links. Paired with X data, it can research a breaking story with live context no other research tool has access to.

🧠

Big Brain Mode

Grok’s extended reasoning mode for multi-step hard problems — activates longer chains of thought for complex logical, mathematical, or engineering questions. Available on SuperGrok and higher. The equivalent of Claude’s Extended Thinking or GPT-5.2’s Thinking tier, but accessible at a lower price point ($30/month SuperGrok vs $100/month for most competitors’ deep-reasoning plans).

📸

Grok Imagine (Aurora)

Image and video generation powered by Aurora (xAI’s own image model) and FLUX.1 from Black Forest Labs. Aurora generates images in under 5 seconds. Video generation via Grok Imagine Video produces 6-second animated audiovisual clips at $0.05/second — notably cheaper than competitors at $0.10+/second, though quality lags behind Sora 2 and Kling 3.0. Available on SuperGrok and higher.

🔊

Voice Mode & Tesla Integration

Grok Voice API is generally available for developers. Consumer voice mode is available in the app with extended sessions on SuperGrok. Uniquely, Grok is integrated directly into Tesla vehicles — press the steering wheel voice button to navigate, answer questions, or interact with Grok hands-free. The US Department of Defense’s GenAI.mil platform also integrates Grok for 3 million personnel.

🨃

Minimal Censorship Policy

xAI designed Grok around “maximum truth-seeking” principles. Compared to OpenAI and Anthropic models, Grok is significantly more willing to engage with controversial, political, and sensitive topics directly — answering questions the other models would deflect or refuse. It still refuses genuinely harmful requests (weapons, CSAM, etc.) but has a narrower refusal policy on content that’s merely controversial.

📈

2M Token Context (4.1 Fast)

Grok 4.1 Fast and 4 Fast models support a 2 million token context window — the largest in production deployment. That’s approximately 1.5 million words, entire large codebases, or multiple hours of transcript. Crucially, at $0.20/MTok, processing a full 2M-token prompt with Grok 4.1 Fast costs 40 cents. The same with other models supporting 1M context would cost $5–$15+.

Rapid Learning Architecture (4.20)

Grok 4.20 is the first AI model to improve continuously after deployment. Its “rapid learning” architecture incorporates user feedback and updates capabilities on a weekly cadence — unlike static models that require full retraining cycles. Musk confirmed release notes accompany every weekly update, making 4.20 the first in the Grok series to iterate in near real-time post-launch.

Real-World Applications

What to Use Grok
For — With Prompts

Grok’s strongest use cases in 2026, with example prompts you can copy and use right now.

📱

Real-Time News & Trend Analysis

Grok’s single biggest edge: live access to X. For anything where recency matters — breaking news, market events, product launches, sports, political developments — Grok can see what’s happening right now, not just what was on the web before its training cutoff.

What are people saying on X right now about [company’s] product announcement? Summarise the sentiment and surface the most insightful posts.
Use DeepSearch to give me a full briefing on the breaking story about [topic]. Include live X context, key facts, and who the key players are.
What’s trending on X in [country/sector] right now? Identify the top 5 themes and the most shared posts driving each one.
📈

Market Intelligence & Finance

Grok 4.20’s 4-agent system won Alpha Arena — a live stock trading simulation — with 10–12% average returns (the only profitable AI in the competition). Grok’s X access gives it real-time sentiment signals that traditional financial AI doesn’t have. For market research, earnings monitoring, and sector tracking, it has a genuine information edge.

Monitor X for sentiment shifts on [ticker] in real time. What are the dominant narratives right now and how has sentiment changed in the last 24 hours?
Analyse the live reaction on X to [company]’s earnings release. What are investors, analysts, and retail traders saying? Identify the most significant signals.
DeepSearch: research [sector] market dynamics right now. What are the emerging risks and opportunities that aren’t yet in mainstream financial coverage?
💻

Large Codebase Analysis

With Grok 4.1 Fast’s 2M token context at $0.20/MTok, processing an entire large codebase in one pass costs cents rather than dollars. For teams with massive repositories, this is the most cost-efficient way to run codebase-wide analysis, dependency audits, security reviews, and refactor planning.

I’ve uploaded this entire codebase (800K tokens). Identify all security vulnerabilities, outdated dependencies, and technical debt hotspots. Prioritise by severity.
Analyse this repository and map all data flows. Where does user data enter the system, how is it processed, and where does it exit? Flag any privacy concerns.
Review this large codebase for performance bottlenecks. Identify the five most impactful optimisations and estimate the improvement for each.
🧠

Hard Math & Science (Grok 4 Heavy)

For subscribers to SuperGrok Heavy, Grok 4 Heavy’s 100% AIME 2025 and 88.7% GPQA science scores represent some of the best publicly-available performance on hard quantitative reasoning. For researchers, engineers, and scientists working on genuinely difficult problems, Heavy is the right tool.

Solve this differential equation step by step. Show all substitutions, intermediate steps, and verify the solution by substituting back in.
Work through this proof. Identify any gaps in the reasoning, suggest the most elegant path to completion, and check the final result rigorously.
Model this physical system. Derive the governing equations, solve for equilibrium conditions, and simulate the dynamic response to a perturbation.

Uncensored Research & Writing

Grok’s narrower refusal policy makes it genuinely useful for topics where other models refuse, hedge excessively, or add unsolicited warnings. For security researchers, journalists, historians, fiction writers, and policy analysts working with sensitive material, Grok engages more directly.

Write a balanced analysis of [controversial political topic]. Present the strongest arguments on both sides without watering them down or adding disclaimers.
Explain how [security vulnerability type] works at a technical level. I’m a security researcher preparing a CTF challenge — be precise and complete.
Write a scene in which [morally complex situation]. Don’t hedge or soften it — the whole point is to explore the full weight of the scenario.
📸

Image Generation (Grok Imagine)

Aurora image generation is available on SuperGrok and higher, generating images in under 5 seconds. Unlike some competing models, Grok Imagine has fewer restrictions on generating realistic-looking content, creative interpretations, and edge-case subject matter. Available directly within the Grok interface — no separate tool or subscription needed.

Generate a cinematic photorealistic image of [scene]. Ultra-detailed, dramatic lighting, shot on a RED camera, 8K quality.
Create a product mockup of [concept] in a minimalist studio setting with soft shadow lighting. Clean white background, commercial photography style.
Generate 4 variations of a logo concept for [brand]. Use [style description]. Show different colour treatments and icon configurations.
Grok vs The Competition

Grok vs ChatGPT vs Claude
Side by Side

Feature Grok (4.1 / 4.20) ChatGPT (GPT-5.4) Claude Opus 4.6
Free tier Yes (daily limits) Yes (limited) Yes (limited)
Paid from$30/mo (SuperGrok)$8/mo (Go)$20/mo (Pro)
LMArena Elo (best)1,483 (Grok 4.1 Thinking — #1)~1,452~1,448 (Sonnet 4.6)
API input price (cheapest)$0.20/MTok (4.1 Fast)$5.00/MTok (5.4)$3.00/MTok (Sonnet 4.6)
Context window2M tokens (4.1 Fast)256K chat / 1M API1M tokens
Real-time X/Twitter data Native — always on~ Web search only~ Web search only
Image generation Aurora / FLUX.1 DALL·E Not available
Video generation~ 720p max, cheaper ($0.05/s) Sora 2 (1080p) Not available
Multi-agent system 4-agent (Grok 4.20)~ Limited Agent Teams (Opus 4.6)
Censorship / refusal policy Narrower — truth-seeking~ Moderate restrictions~ Moderate restrictions
Autonomous task horizonNot publishedNot published14.5hr 50% (METR)
Voice mode Voice + Tesla integration Advanced Voice~ Limited
Data privacy (API)~ Standard US T&Cs~ Standard US T&Cs No training on Team+
📌

Grok leads on: LMArena conversational quality (#1), API cost (25× cheaper than GPT-5.4 on 4.1 Fast), context window (2M tokens), real-time X data, and minimal censorship. ChatGPT leads on: media generation quality (Sora 2), ecosystem breadth, and professional knowledge work (GDPval). Claude leads on: autonomous task duration (14.5hr), data privacy assurances, and writing quality.

Getting the Most From Grok

How to Prompt Grok
For Best Results

Grok responds differently from Claude and ChatGPT. Here’s what to know to unlock its real-time data advantage and get consistent, high-quality output.

01

Trigger X Live Data Explicitly

Say What's happening on X right now about... or Search X for live reactions to... to activate Grok’s real-time data integration. Without this trigger, Grok may answer from its training data. With it, you get genuinely current information.

02

Use DeepSearch for Research

Prepend DeepSearch: to trigger Grok’s extended multi-search synthesis mode. This performs multiple web + X searches and reasons across results before answering — producing cited, comprehensive reports rather than single-source answers.

03

Request Big Brain Mode

For multi-step hard problems, say Use Big Brain Mode or simply Think through this deeply before answering. On SuperGrok and higher, this activates longer reasoning chains. Grok 4.20 Beta’s 4-agent system runs automatically without a trigger.

04

Be Direct — Grok Can Handle It

Unlike Claude or ChatGPT, Grok won’t add unsolicited safety caveats to most sensitive topics. You don’t need to frame or soften requests. Ask directly: Explain exactly how X works without worrying Grok will refuse or hedge unnecessarily.

05

Leverage the 2M Context

With Grok 4.1 Fast’s 2M token window, you can paste entire codebases, research corpora, or transcript archives. Don’t summarise — include everything. Grok excels at finding patterns across large volumes of text that shorter-context models would miss.

06

Switch to Grok 4.20 for Hard Problems

Select Grok 4.2 in the model menu when tackling complex multi-angle problems — strategy, analysis, engineering decisions, or anything that benefits from multiple perspectives. The 4-agent debate system genuinely improves output quality on these tasks.

grok-api-example.py
# OpenAI-compatible — just change the base URL
from openai import OpenAI

client = OpenAI(
  api_key=“YOUR_XAI_API_KEY”,
  base_url=“https://api.x.ai/v1” # ← only change
)

# Use live X search tool + Grok 4.1 Fast
response = client.chat.completions.create(
  model=“grok-4.1”,
  messages=[{“role”: “user”,
    “content”: “What’s trending on X about AI right now?”}],
  tools=[{“type”: “x_search”}], # Live X data
)

# $0.20/$0.50 per MTok — 25× cheaper than GPT-5.4
# 2M token context — paste entire codebases directly
print(response.choices[0].message.content)
Frequently Asked Questions

Grok AI FAQ

Why can’t I chat with Grok directly on this page?

Like Gemini, Grok doesn’t offer an embeddable chatbot widget for third-party websites. It’s designed to run on xAI’s infrastructure at grok.com or inside the X platform. The free tier requires no account and gives you immediate access to Grok 4 and 4.1 with daily message limits. Simply click the “Open Grok Free” button above — it opens in a new tab and you can start chatting immediately.
As of March 2026, the latest models are Grok 4.20 Beta (launched February 17, 2026 — available to SuperGrok and X Premium+ subscribers) and Grok 4.1 (stable release, November 17, 2025). Grok 4.1 Thinking held #1 on LMArena with 1,483 Elo. Official benchmarks for Grok 4.20 are expected when the beta concludes in mid-to-late March 2026. xAI has indicated Grok 4.20 will be “an order of magnitude smarter and faster” than Grok 4 when the public beta concludes.
Grok 4.20 Beta routes every query to four specialised AI agents — Grok, Harper, Benjamin, and Lucas — that think in parallel and debate each other in real time before synthesising a final answer. This is distinct from simply calling multiple AI APIs and aggregating results: these agents engage in multiple rounds of internal discussion, questioning, and fact-checking before outputting. The result is meaningfully better accuracy on complex, multi-angle problems. To use it, select “Grok 4.2” from the model menu in the Grok app (SuperGrok required).
It depends on your use case. SuperGrok ($30/month) is better if you: need real-time X data for market intelligence or current events, want the #1 LMArena conversational quality, value less restrictive content policies, or want the most cost-efficient API access ($0.20/MTok for Grok 4.1 Fast). ChatGPT Plus ($20/month) is better if you: need Sora 2 video generation, DALL·E images, a broader ecosystem of GPT integrations, or access to the Codex agentic coding system. For most professional users not heavily invested in X or media creation, ChatGPT Plus or Claude Pro offer better value for general productivity.
Grok has direct access to the X platform’s data stream — it’s built by xAI, which is closely affiliated with X (both companies are owned by Elon Musk). This gives Grok API-level access to live X posts, trending topics, and real-time sentiment that other models cannot access through standard web search. When you use Grok’s X search tool, it queries the live platform directly rather than searching for cached web pages about X posts. This is fundamentally different from what any other AI model can do.
DeepSearch is Grok’s extended research mode, available on SuperGrok and higher plans. It performs multiple live web and X searches, reasons across the results, and synthesises a comprehensive cited response. Unlike standard web search, DeepSearch doesn’t just return links — it reads, reasons across, and synthesises from multiple sources simultaneously. For breaking news, where Grok can also search live X, DeepSearch produces results with a recency advantage no other research tool has. Activate it by prefixing your query with “DeepSearch:” or clicking the DeepSearch button in the interface.
Grok is built by xAI, founded in March 2023 by Elon Musk. xAI has raised $25 billion in funding from investors including Nvidia, AMD, a16z, Blackrock, Sequoia Capital, and others, valuing the company at approximately $230 billion — making it the most valuable AI startup as of early 2026. xAI’s Colossus supercluster, located in Memphis, Tennessee, grew to over 500,000 GPUs by early 2026. In January 2026, SpaceX announced an acquisition of xAI. The US Department of Defense has integrated Grok into its GenAI.mil platform for 3 million personnel, representing the largest government AI deployment in history.
Yes. The xAI API is available at console.x.ai. It’s fully OpenAI-compatible — change the base_url to https://api.x.ai/v1 and your API key, and most OpenAI SDK code works without modification. New users receive $25 in free promotional credits. Grok 4.1 Fast costs $0.20/$0.50 per million tokens — the cheapest frontier-class API available. Built-in tools include X search, web search, code execution, and document search at $2.50–$5 per 1,000 calls. Grok 4.20 Beta API availability is listed as “early access / coming soon” in official xAI documentation as of mid-March 2026.
Start Getting Better Results

Master Grok
With Better Prompts

You now know what makes Grok different, when it beats the competition, and exactly how to get the most from its real-time X data, DeepSearch, Big Brain Mode, and the 4-agent Grok 4.20 system.

Grok is free to use · No account required for basic access · Updated when new models launch