cerebras

cerebras/llama-3.3-70b

Llama 3.3 70B model optimized for fast inference on Cerebras hardware. Supports up to 128,000 tokens context length.

cerebras

Llama 3.3 70B model optimized for fast inference on Cerebras hardware. Supports up to 128,000 tokens context length.

cerebras

Llama 3.3 70B model optimized for fast inference on Cerebras hardware. Supports up to 128,000 tokens context length.

Learn more

Provider:

Model type:

chat

Location:

Context Window

Intelligence Rating

Speed Rating

Cost Efficiency Rating

Input tokens per million

Output tokens per million

Tool Calling

Supported

Create an account and start building today.

Book a demo

Explore docs

Book a demo

Explore docs

Book a demo

Explore docs