Meta Llama 3.1 Instruct 8B

Instruction-tuned Llama 3.1 model with 8 billion parameters, optimized for resource-constrained environments

Creator: MetaLicense: Open

Summary

Artificial Analysis Quality
Index
53
Output Speed (Tokens per
Second)
159
Model Pricing (USD per 1M Tokens)
$0.100
Latency (to receive the first
token)
14.21s
Context Window (tokens)
128k

Comparison with other models

Quality

Artificial Analysis Quality Index; Higher is better

+ Add model

Speed

Output Tokens per Second; Higher is better

+ Add model

Pricing

USD per 1M Tokens; Lower is better

+ Add model

Latency

Seconds to First Tokens Chunk Received; Lower is better

+ Add model