Pixtral Large

Name: Pixtral Large
Price: 2 USD
Rating: 80.5 (7 reviews)
Author: Mistral AI

Multimodal

Zero-eval

#1VQAv2

#1MM-MT-Bench

#3AI2D

by Mistral AI

About

Pixtral Large is a multimodal language model developed by Mistral AI. This model demonstrates exceptional performance with an average score of 80.5% across 7 benchmarks. It excels particularly in AI2D (93.8%), DocVQA (93.3%), ChartQA (88.1%). The model shows particular specialization in general tasks with an average performance of 91.0%. It supports a 256K token context window for handling large documents. The model is available through 1 API provider. As a multimodal model, it can process and understand text, images, and other input formats seamlessly. Released in 2024, it represents Mistral AI's latest advancement in AI technology.

Pricing Range

Input (per 1M)$2.00 -$2.00

Output (per 1M)$6.00 -$6.00

Providers1

Timeline

AnnouncedNov 18, 2024

ReleasedNov 18, 2024

Specifications

Capabilities

Multimodal

License & Family

License

Mistral Research License (MRL) for research; Mistral Commercial License for commercial use

Base ModelMistral Large 2

Benchmark Performance Overview

Performance metrics and category breakdown

Overall Performance

7 benchmarks

Average Score

80.5%

Best Score

93.8%

High Performers (80%+)

Performance Metrics

Max Context Window

256.0K

Avg Throughput

0.1 tok/s

Avg Latency

1ms

Top Categories

general

91.0%

vision

79.4%

roleplay

74.0%

math

69.4%

Benchmark Performance

Top benchmark scores with normalized values (0-100%)

Ranking Across Benchmarks

Position relative to other models on each benchmark

AI2D

Rank #3 of 17

#1GPT-4o

94.2%

#2Claude 3.5 Sonnet

94.7%

#3Pixtral Large

93.8%

#4Mistral Small 3.2 24B Instruct

92.9%

#5Llama 3.2 90B Instruct

92.3%

#6Llama 3.2 11B Instruct

91.1%

DocVQA

Rank #12 of 26

#9DeepSeek VL2

93.3%

#10Nova Pro

93.5%

#11Grok-2

93.6%

#12Pixtral Large

93.3%

#13Grok-2 mini

93.2%

#14Phi-4-multimodal-instruct

93.2%

#15GPT-4o

92.8%

ChartQA

Rank #7 of 24

#4Qwen2-VL-72B-Instruct

88.3%

#5Llama 4 Scout

88.8%

#6Nova Pro

89.2%

#7Pixtral Large

88.1%

#8Mistral Small 3.2 24B Instruct

87.4%

#9Qwen2.5 VL 7B Instruct

87.3%

#10Nova Lite

86.8%

VQAv2

Rank #1 of 3

#1Pixtral Large

80.9%

#2Pixtral-12B

78.6%

#3Llama 3.2 90B Instruct

78.1%

MM-MT-Bench

Rank #1 of 3

#1Pixtral Large

74.0%

#2Pixtral-12B

60.5%

#3Qwen2.5-Omni-7B

6.0%

All Benchmark Results for Pixtral Large

Complete list of benchmark scores with detailed information


AI2D AI2D benchmark	general	text	0.94	93.8%	Self-reported
DocVQA DocVQA benchmark	vision	multimodal	0.93	93.3%	Self-reported
ChartQA ChartQA benchmark	general	multimodal	0.88	88.1%	Self-reported
VQAv2 VQAv2 benchmark	vision	multimodal	0.81	80.9%	Self-reported
MM-MT-Bench MM-MT-Bench benchmark	roleplay	text	74.00	74.0%	Self-reported
MathVista MathVista benchmark	math	text	0.69	69.4%	Self-reported
MMMU MMMU benchmark	vision	multimodal	0.64	64.0%	Self-reported

Resources

API Reference Playground Blog Post Model Weights