Sambanova

Platform Stats

Total Models6

Organizations2

Verified Benchmarks0

Multimodal Models2

Pricing Overview

Avg Input (per 1M)$1.15

Avg Output (per 1M)$2.38

Cheapest Model$0.10

Premium Model$5.00

Supported Features

Number of models supporting each feature

web Search

0

function Calling

6

structured Output

6

code Execution

0

batch Inference

6

finetuning

0

Input Modalities

Models supporting different input types

text

6 (100%)

image

2 (33%)

audio

0 (0%)

video

0 (0%)

Models Overview

Top performers and pricing distribution

Pricing Distribution

Input pricing per 1M tokens

$0-1

5 models

$5-15

1 models

Top Performing Models

By benchmark avg

#1Llama 3.3 70B Instruct

79.9%

#2Qwen3 32B

75.3%

#3Llama 3.1 70B Instruct

74.7%

#4Llama 4 Maverick

71.8%

#5Llama 3.2 11B Instruct

63.6%

Most Affordable Models

Llama 3.1 8B Instruct

$0.10/1M

Llama 3.2 11B Instruct

$0.15/1M

Qwen3 32B

$0.40/1M

Available Models

6 models available through Sambanova

			License						Links
Llama 3.1 8B Instruct Llama 3.1 8B Instruct is a multilingual large language model optimized for dialogue use cases. It features a 128K context length, state-of-the-art tool use, and strong reasoning capabilities.	Meta	Jul 23, 2024	Llama 3.1 Community License	-	-	72.6%	-	-
Llama 3.2 11B Instruct Llama 3.2 11B Vision Instruct is an instruction-tuned multimodal large language model optimized for visual recognition, image reasoning, captioning, and answering general questions about an image. It accepts text and images as input and generates text as output.	Meta	Sep 25, 2024	Llama 3.2 Community License	-	-	-	-	-
Qwen3 32B Qwen3-32B is a large language model from Alibaba's Qwen3 series. It features 32.8 billion parameters, a 128k token context window, support for 119 languages, and hybrid thinking modes allowing switching between deep reasoning and fast responses. It demonstrates strong performance in reasoning, instruction-following, and agent capabilities.	Alibaba	Apr 29, 2025	Apache 2.0	-	-	-	65.7%	-
Llama 3.3 70B Instruct Llama 3.3 is a multilingual large language model optimized for dialogue use cases across multiple languages. It is a pretrained and instruction-tuned generative model with 70 billion parameters, outperforming many open-source and closed chat models on common industry benchmarks. Llama 3.3 supports a context length of 128,000 tokens and is designed for commercial and research use in multiple languages.	Meta	Dec 6, 2024	Llama 3.3 Community License Agreement	-	-	88.4%	-	-
Llama 4 Maverick Llama 4 Maverick is a natively multimodal model capable of processing both text and images. It features a 17 billion active parameter mixture-of-experts (MoE) architecture with 128 experts, supporting a wide range of multimodal tasks such as conversational interaction, image analysis, and code generation. The model includes a 1 million token context window.	Meta	Apr 5, 2025	Llama 4 Community License Agreement	-	-	-	43.4%	77.6%
Llama 3.1 70B Instruct Llama 3.1 70B Instruct is a large language model optimized for multilingual dialogue use cases. It outperforms many available open source and closed chat models on common industry benchmarks.	Meta	Jul 23, 2024	Llama 3.1 Community License	-	-	80.5%	-	-

Resources

Official Website