nvidia/nemotron-3-120b-a12b Details | |
|---|---|
| Overview | |
| Description | NVIDIA's Mixture-of-Experts model with 120B total parameters and 12B active, optimized for efficient inference with strong reasoning capabilities. |
| Author | nvidia |
| Category | Text Generation |
| Context length | 131K |
| Providers | 2 |
| Pricing (per 1M tokens) | |
| Input | $0.10 |
| Output | $0.30 |
| Capabilities | |
| Reasoning | |
| Tool calling | |
| Vision | – |
| Streaming | |
| Token activity (30 days) | |
| Usage | |
| Artificial Analysis | |
| Intelligence | 36.0 |
| Coding | 31.2 |
| Benchmarks | |
| GPQA Diamond | 0.8 |
| HLE | 0.2 |
| IFEval | 0.7 |