LogoLMSpeed
  • Home
  • Free
  • Categories
  • Models
  • Docs
LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2025 LMSpeed All Rights Reserved.
Back to models
publish

Llama 3.1

llama-3.1

Total tests10
Avg speed51.68 t/s
First token0.23 s
Providers1
Variants2

Variants

Showing 1-2 of 2 providers

VariantSpeedLatencyTests
integrate.api.nvidia.com
52.19 t/s
0.23s
5
integrate.api.nvidia.com
51.18 t/s
0.23s
5

Recent test records

10 records
TimeModelSpeedLatency
Aug 21, 12:17 AMnvidia/llama-3.1-nemotron-70b-instruct
52.14 t/s
0.40s
Aug 21, 12:17 AMnvidia/llama-3.1-nemotron-70b-instruct
52.30 t/s
0.17s
Aug 21, 12:17 AM
nvidia/llama-3.1-nemotron-70b-instruct
52.13 t/s
0.18s
Aug 21, 12:17 AMnvidia/llama-3.1-nemotron-70b-instruct
52.10 t/s
0.16s
Aug 21, 12:17 AMnvidia/llama-3.1-nemotron-70b-instruct
52.28 t/s
0.22s
Aug 1, 10:55 AMmeta/llama-3.1-70b-instruct
39.00 t/s
0.40s
Aug 1, 10:55 AMmeta/llama-3.1-70b-instruct
38.29 t/s
0.20s
Aug 1, 10:55 AMmeta/llama-3.1-70b-instruct
53.02 t/s
0.17s
Aug 1, 10:55 AMmeta/llama-3.1-70b-instruct
75.07 t/s
0.18s
Aug 1, 10:55 AMmeta/llama-3.1-70b-instruct
50.52 t/s
0.21s
nvidia/llama-3.1-nemotron-70b-instruct
meta/llama-3.1-70b-instruct