Nemotron 3 Nano Omni Reasoning is available through 1 API providers on LMSpeed. Compare API pricing from $0.050 to $0.050 per million input tokens across providers. In speed benchmarks, the fastest provider reaches 274 tok/s.
Также известна как
nvidia/nemotron-3-nano-omni-30b-a3b-reasoning
274.49 tok/s |
0.66s |
| 10 |
OpenRouter nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free | 110.33 tok/s | 9.33s | 5 |
Показано 1–2 из 2 провайдеров
kimi-k2-5
Moonshot Kimi K2.5 is a large language model in the Kimi series, offering advanced reasoning, code generation, and multimodal capabilities.
deepseek-v3-2
DeepSeek V3.2 is a large language model in the DeepSeek V3 series, offering advanced reasoning, code generation, and multimodal capabilities.
minimax-m2-5
MiniMax M2.5 is MiniMax's flagship text model for coding and agents, with SOTA-level programming and agentic performance, improved token efficiency, and fast high-TPS API deployment.
glm-5-1
Zhipu GLM-5.1 is a next-generation GLM model aimed at frontier reasoning, coding, and bilingual agent applications.
deepseek-v4-flash
DeepSeek V4 Flash is a fast, cost-efficient language model in the DeepSeek V4 family, optimized for low-latency chat, coding assistance, and high-throughput API workloads while retaining strong reasoning quality.
deepseek-v4-pro
DeepSeek V4 Pro is a large language model in the DeepSeek series, offering advanced reasoning, code generation, and multimodal capabilities.
Рейтинги основаны на тестах, предоставленных сообществом, и периодических зондах работоспособности. Носит рекомендательный характер, не является официальными данными.