Cohere Aya Expanse 32B

Compact version of the Aya Expanse model with 8 billion parameters, optimized for efficiency

Creator: CohereLicense: Open

Summary

Artificial Analysis Quality
Index
Output Speed (Tokens per
Second)
121
Model Pricing (USD per 1M Tokens)
$0.800
Latency (to receive the first
token)
14.21s
Context Window (tokens)
128k

Comparison with other models

Quality

Artificial Analysis Quality Index; Higher is better

+ Add model

Speed

Output Tokens per Second; Higher is better

+ Add model

Pricing

USD per 1M Tokens; Lower is better

+ Add model

Latency

Seconds to First Tokens Chunk Received; Lower is better

+ Add model