cerebras

cerebras/deepseek-r1-distill-llama-70b

DeepSeek R1 Distill Llama 70B model optimized for fast inference on Cerebras hardware. Supports up to 65,536 tokens context length.

cerebras

DeepSeek R1 Distill Llama 70B model optimized for fast inference on Cerebras hardware. Supports up to 65,536 tokens context length.

cerebras

DeepSeek R1 Distill Llama 70B model optimized for fast inference on Cerebras hardware. Supports up to 65,536 tokens context length.

Learn more

Provider:

Model type:

chat

Location:

Context Window

Intelligence Rating

Speed Rating

Cost Efficiency Rating

Input tokens per million

Output tokens per million

Create an account and start building today.

Book a demo

Explore docs

Book a demo

Explore docs

Book a demo

Explore docs