
cerebras
cerebras/qwen-3-32b
Qwen 3 32B model optimized for fast inference on Cerebras hardware. Supports up to 16,382 tokens context length.
Provider:
cerebras
Model type:
chat
Location:
US
Context Window
128000
Intelligence Rating
Speed Rating
Cost Efficiency Rating
Pricing
$
0.6
Input tokens per million
$
0.6
Output tokens per million
Features
Tool Calling
Supported
JSON Mode
Supported
