Mistral's official instruct fine-tuned version of Mixtral 8x22B. It uses 39B active parameters out of 141B, offering unparalleled cost efficiency for its size. Its strengths include:
- strong math, coding, and reasoning
- large context length (64k)
- fluency in English, French, Italian, German, and Spanish
See benchmarks on the launch announcement here(opens in new tab). #moe
Modalities
Input Price
$2per 1M
Output Price
$6per 1M
Context
66K
Weekly Tokens
220M
Released
Apr 17, 2024
