RAG-optimized flagship - 50% faster throughput, 20% lower latency
Context Window
128,000
tokens
Max Output
4,096
Input Pricing
$2.50
per million tokens
Output Pricing
$10.00
No benchmark scores available