GLM-4.5

Name: GLM-4.5
Price: 0.4 USD
Rating: 64.0 (14 reviews)
Author: Zhipu AI

Zero-eval

#1MATH-500

#1BFCL-v3

#1AA-Index

+5 more

by Zhipu AI

About

GLM-4.5 is a language model developed by Zhipu AI. It achieves strong performance with an average score of 64.0% across 14 benchmarks. It excels particularly in MATH-500 (98.2%), AIME 2024 (91.0%), MMLU-Pro (84.6%). It supports a 262K token context window for handling large documents. The model is available through 1 API provider. It's licensed for commercial use, making it suitable for enterprise applications. Released in 2025, it represents Zhipu AI's latest advancement in AI technology.

Pricing Range

Input (per 1M)$0.40 -$0.40

Output (per 1M)$1.60 -$1.60

Providers1

Timeline

AnnouncedJul 28, 2025

ReleasedJul 28, 2025

Specifications

Training Tokens23.0T

License & Family

License

MIT

Benchmark Performance Overview

Performance metrics and category breakdown

Overall Performance

14 benchmarks

Average Score

64.0%

Best Score

98.2%

High Performers (80%+)

Performance Metrics

Max Context Window

262.1K

Top Categories

math

98.2%

general

79.3%

agents

55.5%

code

50.7%

reasoning

41.1%

Additional Information

Content coming soon...

All Benchmark Results for GLM-4.5

Complete list of benchmark scores with detailed information


MATH-500	math	text	0.98	98.2%	Self-reported
AIME 2024	general	text	0.91	91.0%	Self-reported
MMLU-Pro	general	text	0.85	84.6%	Self-reported
TAU-bench Retail	agents	text	0.80	79.7%	Self-reported
GPQA	general	text	0.79	79.1%	Self-reported
BFCL-v3	general	text	0.78	77.8%	Self-reported
LiveCodeBench	code	text	0.73	72.9%	Self-reported
AA-Index	reasoning	text	0.68	67.7%	Self-reported
SWE-Bench Verified	general	text	0.64	64.2%	Self-reported
TAU-bench Airline	agents	text	0.60	60.4%	Self-reported

Showing 1 to 10 of 14 benchmarks

Resources

API Reference Playground Research Paper Blog Post Repository Model Weights