1T params, trained on 15.5T tokens - modified MIT license, 60% cheaper than Western competitors
Context Window
256,000
tokens
Max Output
8,192
tokens
Input Pricing
$0.60
per million tokens
Output Pricing
$2.50
per million tokens
No benchmark scores available