xAI

x.ai

About

Elon Musk AI company

Portfolio Stats

Total Models8

Multimodal7

Benchmarks Run54

Avg Performance75.3%

Latest Release

Grok-4

Released: Jul 9, 2025

Multimodal

Release Timeline

Recent model releases by year

2025

4 models

2024

4 models

Performance Overview

Top models and benchmark performance

Top Performing Models

By avg score

#1Grok-3 Mini

87.8%

#2Grok-3

85.7%

#3Grok-4 Heavy

79.5%

#4Grok-2

76.5%

#5Grok-2 mini

74.0%

Benchmark Categories

code

80.9%

general

75.6%

vision

75.1%

math

66.5%

reasoning

15.9%

Model Statistics

Multimodal Ratio

88%

Models with Providers

All Models

Complete portfolio of 8 models with advanced filtering

		License
Grok-4 Grok 4, announced by xAI in summer 2025, represents a major leap in AI capabilities, described as 'the smartest AI in the world.' Built on version 6 of xAI's foundation model, it uses 100x more training compute than Grok 2 and 10x more reinforcement learning compute than Grok 3. The model achieves PhD-level performance across all academic disciplines simultaneously, scoring perfect on standardized tests like the SAT and near-perfect on graduate exams like the GRE. Unlike Grok 3, tool usage is built into the training process rather than relying on generalization. Trained using 200,000 GPUs, Grok 4 excels at complex reasoning, mathematical problem-solving, and coding tasks, though it has acknowledged weaknesses in multimodal capabilities that are being addressed in the next version.	Jul 9, 2025	Proprietary	-	-	-	79.0%	-
Grok-4 Heavy Grok 4 Heavy is the multi-agent version of Grok 4, released alongside the standard model in summer 2025. This system spawns multiple Grok 4 agents in parallel that work independently on problems and then collaborate by comparing their solutions, similar to a study group. The agents share insights and tricks they discover, with the system intelligently combining their work rather than simply using majority voting. Grok 4 Heavy uses approximately 10x more test-time compute than regular Grok 4, enabling it to solve significantly more complex problems. On the Humanities Last Exam, it achieves over 50% accuracy on text-only problems, and it scored a perfect result on the AIME 2025 mathematics competition. The system represents a major advancement in multi-agent AI collaboration and reasoning capabilities.	Jul 9, 2025	Proprietary	-	-	-	79.4%	-
Grok-3 Mini Grok 3 Mini is a streamlined version of xAI's Grok 3 AI model, designed for quicker response times while maintaining utility. It's tailored for users who require speed over the comprehensive capabilities of the full Grok 3 model, making it suitable for tasks where rapid information retrieval is key. Grok 3 Mini still leverages the advanced training and data that Grok 3 was built on but offers a lighter, more efficient version for everyday use.	Feb 17, 2025	Proprietary	-	-	-	80.4%	-
Grok-3 Grok 3, launched by xAI on February 17, 2025, is an advanced AI model with significantly enhanced capabilities compared to Grok 2, boasting an order of magnitude increase in performance. Trained on a vast dataset that includes legal documents among others, and utilizing a massive compute infrastructure with around 200,000 GPUs in a Memphis data center, Grok 3's training used ten times more compute than its predecessor. It features specialized models like Grok 3 Reasoning and Grok 3 Mini Reasoning for complex problem-solving, and it excels in benchmarks like AIME for mathematics and GPQA for PhD-level science.	Feb 17, 2025	Proprietary	-	-	-	79.4%	-
Grok-2 mini Grok-2 mini is a smaller, faster variant of Grok-2 that offers a balance between speed and answer quality. While more compact than its larger sibling, it maintains strong capabilities across various tasks including reasoning, coding, and chat interactions.	Aug 13, 2024	Proprietary	-	-	85.7%	-	-
Grok-2 Grok-2 is a frontier language model with state-of-the-art reasoning capabilities, featuring advanced abilities in chat, coding, and reasoning. It demonstrates superior performance in visual math reasoning, document-based question answering, and excels across various academic benchmarks including reasoning, reading comprehension, math, and science.	Aug 13, 2024	Proprietary	-	-	88.4%	-	-
Grok-1.5V A multimodal model capable of processing text and visual information, including documents, diagrams, charts, screenshots, and photographs. Notable for strong real-world spatial understanding capabilities.	Apr 12, 2024	Proprietary	-	-	-	-	-
Grok-1.5 An advanced language model with improved reasoning capabilities, particularly excelling in coding and mathematical tasks. Features a 128K token context window and enhanced problem-solving abilities compared to its predecessor.	Mar 28, 2024	Proprietary	-	-	74.1%	-	-

Resources

Official Website