LogoLMSpeed
  • Home
  • Free
  • Providers
  • Docs
LogoLMSpeed
LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
Leaderboard
  • Overview
  • Speed Ranking
  • Latency Ranking
  • Health Ranking
Models
  • All Models
  • GPT
  • Claude
  • Gemini
  • DeepSeek
  • Llama
  • Qwen
Free Models
  • All Free Models
  • Free GPT
  • Free Claude
  • Free Gemini
  • Free DeepSeek
  • Free Llama
  • Free Qwen
Resources
  • Speed Test
  • Provider Directory
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2026 LMSpeed All Rights Reserved.Made by Nexmoe with ❤️
首页交流 QQ 群:1034193296,欢迎中转站站长加入讨论 AI 最热话题、newapi、openclaw 等,获取最新测速动态与反馈支持。
Back to models
pending

Llama3 1 API Pricing & Performance

llama3-1

Also known as

llama3-1-405bllama3.1-8Bllama3.1-8b快速/llama3.1-8B

Llama3 1 is available through 15 API providers on LMSpeed. Compare API pricing from $0.0000 to $75.00 per million input tokens across providers. Free API access is offered by 1 provider. In speed benchmarks, the fastest provider reaches 1956 tok/s.

Avg speed
1161.06t/s
First token
0.65s
Total tests
211
Providers
19
Variants
21

Pricing Comparison

Compare Llama3 1 API pricing across 14 providers. Input prices range from $0.0000 to $75.00 per million input. 91VIP offers the lowest rate at $0.0000/M.

ProviderModel VariantInput ($/M)Output ($/M)Speed (t/s)
玄黄MetaAIllama3.1-8bFreeFree—
91VIPMetaAIllama3.1-8b$0.0000$0.00011910.8 t/s
FutureppoMetaAIllama3.1-8b$0.0000$0.00011956.2 t/s
素墨APIMetaAIllama3.1-8b$0.010$0.010672.0 t/s
uglycatMetaAIllama3.1-8b$0.010$0.010—
人人 APIMetaAIllama3.1-8B$0.010$0.010—
天絮 APIMetaAIllama3.1-8b$0.100$0.100—
SMLC666 APIMetaAIllama3.1-8b$0.100$0.100—
Seamee APIMetaAIllama3.1-8b$0.100$0.100—
SynapseMetaAIllama3.1-8b$1.00$1.00—
LLM APIMetaAIllama3-1-405b$20.00$20.00—
钠 APIMetaAIllama3.1-8B$40.00$40.00988.0 t/s
IXIOCCAPIMetaAIllama3.1-8b$75.00$75.00—
HotaruAPIMetaAIllama3.1-8b$75.00$75.00—
Chlink APIMetaAIllama3.1-8B$75.00$75.00—

Pricing data from provider public APIs

Free Llama3 1 API Tier & Credits

Llama3 1 is free to use through 1 provider with no per-token charges. These providers offer free API credits or a free tier:

ProviderSpeed (t/s)
玄黄—

API Speed Benchmarks by Provider

Compare speed and latency performance across all API providers.

Showing 1-9 of 9 providers

Most testedRecently testedA–Z
ProviderSpeedLatencyTests
Futureppo

llama3.1-8b

1956.17 tok/s
0.38s
10
91VIP

llama3.1-8b

1910.78 tok/s
0.43s
5
Supabase AI ProxySupabase AI Proxy

llama3.1-8b

1829.04 tok/s
0.58s
10
GPT Load (Shiho)GPT Load (Shiho)

llama3.1-8b

1629.33 tok/s
0.36s
25
素墨API

llama3.1-8B

1421.44 tok/s
0.94s
10
素墨API

快速/llama3.1-8B

1140.96 tok/s
1.24s
15
钠 API钠 API

llama3.1-8B

987.95 tok/s
0.45s
106
Koru API

llama3.1-8B

714.17 tok/s
1.60s
15
素墨API

llama3.1-8b

672.05 tok/s
1.17s
15

Recent API Speed Tests

20 records

Latest benchmark results measuring API response speed and first-token latency.

TimeModelSpeedLatency
04/03/2026, 14:47
MetaAIllama3.1-8b
2556.19 tok/s
0.55s
04/03/2026, 14:47
MetaAIllama3.1-8b
2347.73 tok/s
0.59s
04/03/2026, 14:47
MetaAIllama3.1-8b
2548.74 tok/s
0.56s
04/03/2026, 14:47
MetaAIllama3.1-8b
212.86 tok/s
0.60s
04/03/2026, 14:47
MetaAIllama3.1-8b
2384.06 tok/s
0.54s
04/03/2026, 14:47
MetaAIllama3.1-8b
1557.57 tok/s
0.66s
04/03/2026, 14:47
MetaAIllama3.1-8b
2465.36 tok/s
0.62s
04/03/2026, 14:47
MetaAIllama3.1-8b
2383.83 tok/s
0.57s
04/03/2026, 14:47
MetaAIllama3.1-8b
154.51 tok/s
0.59s
04/03/2026, 14:47
MetaAIllama3.1-8b
1679.53 tok/s
0.53s
03/30/2026, 18:50
MetaAIllama3.1-8B
563.20 tok/s
0.80s
03/30/2026, 18:50
MetaAIllama3.1-8B
1137.98 tok/s
0.50s
03/30/2026, 18:50
MetaAIllama3.1-8B
1165.71 tok/s
0.50s
03/30/2026, 18:50
MetaAIllama3.1-8B
606.41 tok/s
0.49s
03/30/2026, 18:50
MetaAIllama3.1-8B
867.77 tok/s
0.49s
03/30/2026, 18:49
MetaAIllama3.1-8B
83.02 tok/s
6.16s
03/30/2026, 18:49
MetaAIllama3.1-8B
59.68 tok/s
10.72s
03/30/2026, 18:49
MetaAIllama3.1-8B
1201.22 tok/s
0.50s
03/30/2026, 18:49
MetaAIllama3.1-8B
280.86 tok/s
0.47s
03/30/2026, 18:49
MetaAIllama3.1-8B
784.80 tok/s
0.48s
See all free LLM models →

Frequently Asked Questions

Is Llama3 1 API free?
Yes, Llama3 1 is available for free through 1 API provider on LMSpeed, including 玄黄. These providers offer free API credits or a free tier with no per-token charges.
How much does Llama3 1 API cost?
Llama3 1 API pricing ranges from $0.0000 to $75.00 per million input tokens across 15 providers. 91VIP offers the cheapest rate at $0.0000/M. Output pricing varies by provider.
Which provider has the cheapest Llama3 1 API pricing?
The cheapest Llama3 1 API pricing is offered by 91VIP at $0.0000 per million input tokens. Compare all 15 providers above to find the best pricing per token for your use case.

Alternatives & Similar Models

Gemini
Gemini 2.5 Flashgemini-2-5-flash
15 shared providers
OpenAI
GPT-OSSgpt-oss
15 shared providers
Gemini
Gemini 2.5 Progemini-2-5-pro
14 shared providers
Gemini
Gemini 3 Flashgemini-3-flash
14 shared providers
Minimax
MiniMax-M2.5minimax-m2-5
14 shared providers
ChatGLM
GLM-4.7glm-4-7
14 shared providers