LogoLMSpeed
  • Home
  • Free
  • Categories
  • Models
  • Docs
LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2025 LMSpeed All Rights Reserved.
BACK TO INDEX
integrate.api.nvidia.com logo

integrate.api.nvidia.com

Website
Updated 12/8/2025
Country美国免费试用
integrate.api.nvidia.com interface preview
Performance Stats
Avg Speed
73.57t/s
Latency
4.28s
Total Tests
188
Models
23

About integrate.api.nvidia.com

NVIDIA provides AI and accelerated computing APIs for building, customizing, and deploying multimodal generative AI models.

OpenAIgpt-ossQwenQwen3-NextQwenQwen3MetaAILlama 3.3MetaAILlama 3.1DeepSeekDeepSeek-V3.1DeepSeekDeepSeek-V3

NVIDIA offers APIs for AI and accelerated computing, primarily through services like NeMo for building and deploying multimodal generative AI models. Key capabilities include model development, customization, and deployment across various domains. Notable strengths include integration with NVIDIA's hardware platforms (e.g., DGX, HGX) for high-performance computing. Typical use cases involve AI research, life sciences (via BioNeMo), and 3D simulation workflows (via Omniverse Cloud).

Supported Models

ModelSpeedLatencyTests
openai/gpt-oss-20b
239.61 t/s
10.88s
5
openai/gpt-oss-120b
148.36 t/s
14.82s
25
qwen/qwen3-next-80b-a3b-instruct
118.96 t/s
0.58s
10
qwen/qwen3-next-80b-a3b-instruct
118.96 t/s
0.58s
10
mistralai/mixtral-8x22b-instruct-v0.1
89.66 t/s
0.22s
5
deepseek-ai/deepseek-r1
87.63 t/s
8.96s
15
nvidia/llama-3.3-nemotron-super-49b-v1.5
79.68 t/s
0.28s
5
microsoft/phi-4-mini-flash-reasoning
74.19 t/s
0.46s
5
google/gemma-3-27b-it
62.41 t/s
0.20s
5
ai21labs/jamba-1.5-large-instruct
55.60 t/s
0.29s
10
moonshotai/kimi-k2-instruct
54.35 t/s
0.53s
15
nvidia/llama-3.1-nemotron-70b-instruct
52.19 t/s
0.23s
5
meta/llama-3.1-70b-instruct
51.18 t/s
0.23s
5
moonshotai/kimi-k2-instruct-0905
47.28 t/s
0.72s
5
01-ai/yi-large
43.74 t/s
0.22s
5
google/gemma-2-27b-it
43.69 t/s
0.23s
10
deepseek-ai/deepseek-r1-distill-qwen-14b
40.96 t/s
0.49s
5
deepseek-ai/deepseek-v3.1
38.10 t/s
2.50s
25
deepseek-ai/deepseek-v3.1
38.10 t/s
2.50s
25
qwen/qwen3-235b-a22b
34.77 t/s
25.15s
5
deepseek-ai/deepseek-r1-distill-qwen-32b
33.97 t/s
0.59s
5
mistralai/mistral-small-24b-instruct
29.68 t/s
0.49s
10
microsoft/phi-3-medium-128k-instruct
18.27 t/s
0.53s
5

Recent Test Records

TimeModelSpeedLatency
Nov 8, 12:52 PMopenai/gpt-oss-120b
149.96 t/s
18.38s
Nov 8, 12:45 PMnvidia/llama-3.3-nemotron-super-49b-v1.5
79.68 t/s
0.28s
Nov 6, 01:20 PMdeepseek-ai/deepseek-v3.1
20.43 t/s
5.37s
Oct 31, 05:26 AMqwen/qwen3-next-80b-a3b-instruct
158.38 t/s
0.50s
Oct 5, 02:42 AMdeepseek-ai/deepseek-v3.1
28.14 t/s
0.52s
Oct 5, 12:04 AMdeepseek-ai/deepseek-v3.1
29.94 t/s
0.38s
Oct 5, 12:01 AMdeepseek-ai/deepseek-r1
117.81 t/s
7.08s
Sep 28, 04:25 PMqwen/qwen3-next-80b-a3b-instruct
79.54 t/s
0.67s
Sep 28, 10:27 AMmoonshotai/kimi-k2-instruct-0905
47.28 t/s
0.72s
Sep 20, 04:57 AMgoogle/gemma-2-27b-it
43.48 t/s
0.22s