
Model Garden
Browse models across providers with detailed specs, pricing, and performance signals.
Models
aws
chat
apac
anthropic.claude-3-haiku-20240307-v1:0 (APAC)
Input
$
0.25
/M tokens
Output
$
1.25
/M tokens
aws
chat
europe
anthropic.claude-3-haiku-20240307-v1:0 (EU)
Input
$
0.25
/M tokens
Output
$
1.25
/M tokens
aws
chat
US
anthropic.claude-3-haiku-20240307-v1:0 (US)
Input
$
0.25
/M tokens
Output
$
1.25
/M tokens
aws
chat
apac
anthropic.claude-3-sonnet-20240229-v1:0 (APAC)
Input
$
3
/M tokens
Output
$
15
/M tokens
contextualai
rerank
US
ctxl-rerank-v2-instruct-multilingual-mini
Input
$
/M tokens
Output
$
/M tokens
google-ai
chat
US
gemini-2.5-flash-lite-preview-09-2025
Input
$
0.1
/M tokens
Output
$
0.4
/M tokens
google-ai
realtime
US
gemini-live-2.5-flash-preview-native-audio
Input
$
0.5
/M tokens
Output
$
2
/M tokens
aws
chat
US
global.anthropic.claude-sonnet-4-5-20250929-v1:0
Input
$
3
/M tokens
Output
$
15
/M tokens
togetherai
chat
US
meta-llama/Llama-3.3-70B-Instruct-Turbo
Input
$
0.88
/M tokens
Output
$
0.88
/M tokens
groq
chat
US
meta-llama/llama-4-maverick-17b-128e-instruct
Input
$
0.2
/M tokens
Output
$
0.6
/M tokens
togetherai
chat
US
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
Input
$
0.27
/M tokens
Output
$
0.85
/M tokens
groq
chat
US
meta-llama/llama-4-scout-17b-16e-instruct
Input
$
0.11
/M tokens
Output
$
0.34
/M tokens
togetherai
chat
US
meta-llama/Llama-4-Scout-17B-16E-Instruct
Input
$
0.18
/M tokens
Output
$
0.59
/M tokens
chat
rest
qwen/qwen3-235b-a22b-instruct-2507-maas
Input
$
0.22
/M tokens
Output
$
0.88
/M tokens
alibaba
chat
singapore
qwen3-livetranslate-flash-realtime
Input
$
10
/M tokens
Output
$
10
/M tokens
