cerebras

cerebras/qwen-3-32b

Qwen 3 32B model optimized for fast inference on Cerebras hardware. Supports up to 16,382 tokens context length.

cerebras

cerebras/qwen-3-32b

Qwen 3 32B model optimized for fast inference on Cerebras hardware. Supports up to 16,382 tokens context length.

Provider:

cerebras

Model type:

chat

Location:

US

Context Window

128000

Intelligence Rating

Speed Rating

Cost Efficiency Rating

Pricing

$

0.6

Input tokens per million

$

0.6

Output tokens per million

Features

Tool Calling

Supported

JSON Mode

Supported

Create an account and start building today.

Create an account and start building today.