First Mamba in ultra-large MoE - 10%+ STEM improvement, 24% coding gain, 39% math boost
Context Window
256,000
tokens
Max Output
8,192
tokens
Input Pricing
$0.50
per million tokens
Output Pricing
$2.00
per million tokens
No benchmark scores available