355B params flagship - #3 globally, #1 domestic/open-source, needs only 8 Nvidia H20 chips
Context Window
128,000
tokens
Max Output
8,192
tokens
Input Pricing
$0.11
per million tokens
Output Pricing
$0.28
per million tokens
No benchmark scores available