whichllm — Browse and compare AI model specs and pricing

Helicone

Meta Llama 3.1 8B Instant model ID, context window & pricing

llama

Quick facts

Model ID llama-3.1-8b-instant
Source Helicone
Context Window 131072
Pricing $0.05 input / $0.08 output per 1M tokens
Capabilities tool calling, temperature control

Model overview

Meta Llama 3.1 8B Instant is an AI model from Helicone with 131072 token context window and text input support.

Published pricing is $0.05 input and $0.08 output per 1M tokens.

  • Workloads that use text inputs with text outputs.
  • Agent and tool workflows that need function calling.
Model ID llama-3.1-8b-instant
Provider Helicone
Family llama
Status -
Knowledge Cutoff 2024-07
Release Date 2024-07-01
Input Modalities text
Output Modalities text
Context Window 131072
Input Limit -
Output Limit 32678
Tool Calling Yes
Reasoning No
Structured Output -
Temperature Control Yes
Open Weights No
Input Cost / 1M tokens $0.05
Output Cost / 1M tokens $0.08
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -