cerebras

cerebras/llama3.1-8b

Llama 3.1 8B model optimized for fast inference on Cerebras hardware. Supports up to 8,192 tokens context length.

cerebras

cerebras/llama3.1-8b

Llama 3.1 8B model optimized for fast inference on Cerebras hardware. Supports up to 8,192 tokens context length.

cerebras

cerebras/llama3.1-8b

Llama 3.1 8B model optimized for fast inference on Cerebras hardware. Supports up to 8,192 tokens context length.

Provider:

cerebras

Model type:

chat

chat

chat

Location:

us

Context Window

8.2K

Intelligence Rating

Speed Rating

Cost Efficiency Rating

Pricing

$0.10

Input tokens per million

$0.10

Output tokens per million

Features

Tool Calling

Supported

Create an account and start building today.

Create an account and start building today.

Create an account and start building today.