Skip to content
Meta

Llama 3.3 70B Instruct

flagship
Meta · released 2024-11-26 · text->text
currently routing · 4.2k rpm
131K tokens
Context
$0.10 / 1M
Input
$0.32 / 1M
Output
— t/s
Speed
open
License
/ ABOUT

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model is optimized for multilingual dialogue use cases and outperforms many of the available open source and closed chat models on common industry benchmarks.

Supported languages: English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.

[Model Card](https://github.com/meta-llama/llama-models/blob/main/models/llama3_3/MODEL_CARD.md)

BENCHMARKS Artificial Analysis Index
Intelligence 14.5
Coding 10.7
Agentic 9.1

Providers for Llama 3.3 70B Instruct

8 routes · sorted by uptime

ClosedRouter routes requests to the providers best able to handle your prompt size and parameters, with automatic fallbacks to maximize uptime.

Provider
Context
Quant
Uptime · 30d
131K
bf16
0.00%
131K
bf16
0.00%
131K
bf16
0.00%
131K
bf16
0.00%
131K
bf16
0.00%
131K
bf16
0.00%
131K
bf16
0.00%