llama-3-1-instruct
Developer: Meta
Also known as
Llama 3.1 Instruct by Meta is available through 85 API providers on LMSpeed. Compare API pricing from $0.0000 to $75.00 per million input tokens across providers. Free API access is offered by 4 providers. In speed benchmarks, the fastest provider reaches 200 tok/s.
Llama 3.1 Instruct is free to use through 4 providers with no per-token charges. These providers offer free API credits or a free tier:
| Provider | Speed (t/s) |
|---|---|
| 91VIP API | — |
| 91VIP API | — |
| MapleLeaf API | — |
| MapleLeaf API | — |
Compare speed and latency performance across all API providers.
Showing 1-3 of 3 providers
| Provider | Speed | Latency | Tests |
|---|---|---|---|
meta/llama-3.1-8b-instruct | 200.27 tok/s | 0.42s | 15 |
meta/llama-3.1-70b-instruct | 51.18 tok/s | 0.23s | 5 |
meta/llama-3.1-405b-instruct | 22.66 tok/s | 5.87s | 20 |
Latest benchmark results measuring API response speed and first-token latency.
| Time | Model | Speed | Latency |
|---|---|---|---|
| 03/31/2026, 06:33 | meta/llama-3.1-405b-instruct | 30.39 tok/s | 3.58s |
| 03/31/2026, 06:33 | meta/llama-3.1-405b-instruct | 30.92 tok/s | 0.26s |
| 03/31/2026, 06:33 | meta/llama-3.1-405b-instruct | 7.99 tok/s | 0.26s |
| 03/31/2026, 06:33 | meta/llama-3.1-405b-instruct | 5.28 tok/s | 0.69s |
| 03/31/2026, 06:33 | meta/llama-3.1-405b-instruct | 5.06 tok/s | 0.41s |
| 03/31/2026, 04:03 | meta/llama-3.1-405b-instruct | 45.23 tok/s | 0.38s |
| 03/31/2026, 04:03 | meta/llama-3.1-405b-instruct | 41.63 tok/s | 0.24s |
| 03/31/2026, 04:03 | meta/llama-3.1-405b-instruct | 40.49 tok/s | 3.89s |
| 03/31/2026, 04:03 | meta/llama-3.1-405b-instruct | 37.25 tok/s | 3.50s |
| 03/31/2026, 04:03 | meta/llama-3.1-405b-instruct | 44.30 tok/s | 0.23s |
| 03/30/2026, 16:38 | meta/llama-3.1-405b-instruct | 41.09 tok/s | 0.38s |
| 03/30/2026, 16:38 | meta/llama-3.1-405b-instruct | 34.76 tok/s | 0.39s |
| 03/30/2026, 16:38 | meta/llama-3.1-405b-instruct | 15.04 tok/s | 32.05s |
| 03/30/2026, 16:38 | meta/llama-3.1-405b-instruct | 6.34 tok/s | 10.53s |
| 03/30/2026, 16:38 | meta/llama-3.1-405b-instruct | 39.81 tok/s | 0.23s |
| 03/26/2026, 18:08 | meta/llama-3.1-405b-instruct | 5.61 tok/s | 34.07s |
| 03/26/2026, 18:08 | meta/llama-3.1-405b-instruct | 2.82 tok/s | 0.49s |
| 03/26/2026, 18:08 | meta/llama-3.1-405b-instruct | 3.23 tok/s | 0.48s |
| 03/26/2026, 18:08 | meta/llama-3.1-405b-instruct | 1.33 tok/s | 20.46s |
| 03/26/2026, 18:08 | meta/llama-3.1-405b-instruct | 14.62 tok/s | 4.81s |