Alibaba

Major Contributor

www.alibabacloud.com

About

Chinese technology company

Portfolio Stats

Total Models20

Multimodal6

Benchmarks Run357

Avg Performance67.9%

Latest Release

Qwen3-235B-A22B-Instruct-2507

Released: Jul 22, 2025

Release Timeline

Recent model releases by year

2025

9 models

2024

11 models

Performance Overview

Top models and benchmark performance

Top Performing Models

By avg score

#1Qwen2.5 72B Instruct

77.4%

#2Qwen3 235B A22B

76.2%

#3Qwen2-VL-72B-Instruct

75.8%

#4Qwen3 32B

75.3%

#5QwQ-32B

74.6%

Benchmark Categories

reasoning

75.2%

math

71.5%

code

69.7%

general

204

66.6%

vision

64.3%

Model Statistics

Multimodal Ratio

30%

Models with Providers

All Models

Complete portfolio of 20 models with advanced filtering

		License
Qwen3-235B-A22B-Instruct-2507 Qwen3-235B-A22B-Instruct-2507 is the updated instruct version of Qwen3-235B-A22B featuring significant improvements in general capabilities including instruction following, logical reasoning, text comprehension, mathematics, science, coding and tool usage. It provides substantial gains in long-tail knowledge coverage across multiple languages and markedly better alignment with user preferences in subjective and open-ended tasks.	Jul 22, 2025	Apache 2.0	-	57.3%	-	-	-
Qwen3 32B Qwen3-32B is a large language model from Alibaba's Qwen3 series. It features 32.8 billion parameters, a 128k token context window, support for 119 languages, and hybrid thinking modes allowing switching between deep reasoning and fast responses. It demonstrates strong performance in reasoning, instruction-following, and agent capabilities.	Apr 29, 2025	Apache 2.0	-	-	-	65.7%	-
Qwen3 235B A22B Qwen3 235B A22B is a large language model developed by Alibaba, featuring a Mixture-of-Experts (MoE) architecture with 235 billion total parameters and 22 billion activated parameters. It achieves competitive results in benchmark evaluations of coding, math, general capabilities, and more, compared to other top-tier models.	Apr 29, 2025	Apache 2.0	-	-	-	70.7%	81.4%
Qwen3 30B A3B Qwen3-30B-A3B is a smaller Mixture-of-Experts (MoE) model from the Qwen3 series by Alibaba, with 30.5 billion total parameters and 3.3 billion activated parameters. Features hybrid thinking/non-thinking modes, support for 119 languages, and enhanced agent capabilities. It aims to outperform previous models like QwQ-32B while using significantly fewer activated parameters.	Apr 29, 2025	Apache 2.0	-	-	-	62.6%	-
Qwen2.5-Omni-7B Qwen2.5-Omni is the flagship end-to-end multimodal model in the Qwen series. It processes diverse inputs including text, images, audio, and video, delivering real-time streaming responses through text generation and natural speech synthesis using a novel Thinker-Talker architecture.	Mar 27, 2025	Apache 2.0	-	-	78.7%	-	73.2%
QwQ-32B A model focused on advancing AI reasoning capabilities, particularly excelling in mathematics and programming. Features deep introspection and self-questioning abilities while having some limitations in language mixing and recursive/endless reasoning patterns.	Mar 5, 2025	Apache 2.0	-	-	-	63.4%	-
Qwen2.5 VL 32B Instruct Qwen2.5-VL is a vision-language model from the Qwen family. Key enhancements include visual understanding (objects, text, charts, layouts), visual agent capabilities (tool use, computer/phone control), long video comprehension with event pinpointing, visual localization (bounding boxes/points), and structured output generation.	Feb 28, 2025	Apache 2.0	-	-	91.5%	-	84.0%
Qwen2.5 VL 72B Instruct Qwen2.5-VL is the new flagship vision-language model of Qwen, significantly improved from Qwen2-VL. It excels at recognizing objects, analyzing text/charts/layouts in images, acting as a visual agent, understanding long videos (over 1 hour) with event pinpointing, performing visual localization (bounding boxes/points), and generating structured outputs from documents.	Jan 26, 2025	tongyi-qianwen	-	-	-	-	-
Qwen2.5 VL 7B Instruct Qwen2.5-VL is a vision-language model from the Qwen family. Key enhancements include visual understanding (objects, text, charts, layouts), visual agent capabilities (tool use, computer/phone control), long video comprehension with event pinpointing, visual localization (bounding boxes/points), and structured output generation.	Jan 26, 2025	Apache 2.0	-	-	-	-	-
QvQ-72B-Preview An experimental research model focusing on advanced visual reasoning and step-by-step cognitive capabilities. Achieves strong performance on multi-modal science and mathematics tasks, though exhibits some limitations such as potential language mixing and recursive reasoning loops.	Dec 25, 2024	Qwen	-	-	-	-	-

Showing 1 to 10 of 20 models

Resources

Official Website