Llama 3.3 70B Turbo model ID, context window & pricing
llama
Quick facts
Model ID meta-llama/Llama-3.3-70B-Instruct-Turbo
Source Deep Infra
Context Window 131072
Pricing $0.10 input / $0.32 output per 1M tokens
Capabilities tool calling, open weights
Model overview
Llama 3.3 70B Turbo is an AI model from Deep Infra with 131072 token context window and text input support.
Published pricing is $0.10 input and $0.32 output per 1M tokens.
- Workloads that use text inputs with text outputs.
- Agent and tool workflows that need function calling.
Model ID meta-llama/Llama-3.3-70B-Instruct-Turbo
Provider Deep Infra
Family llama
Status -
Knowledge Cutoff -
Release Date 2024-12-06
Input Modalities text
Output Modalities text
Context Window 131072
Input Limit -
Output Limit 16384
Tool Calling Yes
Reasoning No
Structured Output -
Temperature Control -
Open Weights Yes
Input Cost / 1M tokens $0.10
Output Cost / 1M tokens $0.32
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -