llama-3-1-instruct
开发者: Meta
别名
Llama 3.1 Instruct(Meta)可通过 LMSpeed 上的 85 家 API 服务商获取。API 价格对比:输入价格从 $0.0000 到 $75.00/M,91VIP提供最低价。4 家服务商提供免费 API 额度。速度基准测试中,最快服务商达到 200 tok/s。
对比 Llama 3.1 Instruct 在 81 家服务商的 API 价格。输入价格从 $0.0000/M 到 $75.00/M,其中 91VIP 提供最低价 $0.0000/M。
| 服务商 | 模型变体 | 输入 ($/M) | 输出 ($/M) | 速度 (t/s) |
|---|---|---|---|---|
| 91VIP API | meta/llama-3.1-405b-instruct | Free | Free | — |
| 91VIP API | meta/llama-3.1-70b-instruct | Free | Free | — |
| MapleLeaf API | meta/llama-3.1-405b-instruct | Free | Free | — |
| MapleLeaf API | meta/llama-3.1-70b-instruct | Free | Free | — |
| 91VIP | llama-3.1-8b-instruct | $0.0000 | $0.0000 | — |
| Futureppo | llama-3.1-8b-instruct | $0.0000 | $0.0000 | — |
| 91VIP | llama-3.1-70b-instruct | $0.0008 | $0.0003 | — |
| Futureppo | llama-3.1-70b-instruct | $0.0008 | $0.0003 | — |
| 91VIP | llama-3.1-405b-instruct | $0.0009 | $0.0009 | — |
| Futureppo | llama-3.1-405b-instruct | $0.0009 | $0.0009 | — |
| 素墨API | llama-3.1-405b-instruct | $0.010 | $0.010 | — |
| 素墨API | llama-3.1-70b-instruct | $0.010 | $0.010 | — |
| 素墨API | llama-3.1-8b-instruct | $0.010 | $0.010 | — |
| 素墨API | meta/llama-3.1-405b-instruct | $0.010 | $0.010 | — |
| 素墨API | meta/llama-3.1-70b-instruct | $0.010 | $0.010 | — |
| 素墨API | meta/llama-3.1-8b-instruct | $0.010 | $0.010 | — |
| meta-llama/llama-3.1-8b-instruct | $0.020 | $0.050 | — | |
| meta/llama-3.1-8b-instruct | $0.025 | $0.025 | — | |
| Seamee API | meta/llama-3.1-8b-instruct | $0.025 | $0.025 | — |
| MapleLeaf API | meta/llama-3.1-8b-instruct | $0.025 | $0.025 | — |
价格数据来自各服务商公开 API
Llama 3.1 Instruct 可通过 4 家服务商免费使用,无需按 token 付费。以下服务商提供免费 API 额度或免费套餐:
| 提供商 | 速度 | 延迟 | 测试次数 |
|---|---|---|---|
meta/llama-3.1-8b-instruct | 200.27 tok/s | 0.42s | 15 |
meta/llama-3.1-70b-instruct | 51.18 tok/s | 0.23s | 5 |
meta/llama-3.1-405b-instruct | 22.66 tok/s | 5.87s | 20 |
最新基准测试结果,测量 API 响应速度与首字延迟。
| 时间 | 模型 | 速度 | 延迟 |
|---|---|---|---|
| 03/31/2026, 06:33 | meta/llama-3.1-405b-instruct | 30.39 tok/s | 3.58s |
| 03/31/2026, 06:33 | meta/llama-3.1-405b-instruct | 30.92 tok/s | 0.26s |
| 03/31/2026, 06:33 | meta/llama-3.1-405b-instruct | 7.99 tok/s | 0.26s |
| 03/31/2026, 06:33 | meta/llama-3.1-405b-instruct | 5.28 tok/s | 0.69s |
| 03/31/2026, 06:33 | meta/llama-3.1-405b-instruct | 5.06 tok/s | 0.41s |
| 03/31/2026, 04:03 | meta/llama-3.1-405b-instruct | 45.23 tok/s | 0.38s |
| 03/31/2026, 04:03 | meta/llama-3.1-405b-instruct | 41.63 tok/s | 0.24s |
| 03/31/2026, 04:03 | meta/llama-3.1-405b-instruct | 40.49 tok/s | 3.89s |
| 03/31/2026, 04:03 | meta/llama-3.1-405b-instruct | 37.25 tok/s | 3.50s |
| 03/31/2026, 04:03 | meta/llama-3.1-405b-instruct | 44.30 tok/s | 0.23s |
| 03/30/2026, 16:38 | meta/llama-3.1-405b-instruct | 41.09 tok/s | 0.38s |
| 03/30/2026, 16:38 | meta/llama-3.1-405b-instruct | 34.76 tok/s | 0.39s |
| 03/30/2026, 16:38 | meta/llama-3.1-405b-instruct | 15.04 tok/s | 32.05s |
| 03/30/2026, 16:38 | meta/llama-3.1-405b-instruct | 6.34 tok/s | 10.53s |
| 03/30/2026, 16:38 | meta/llama-3.1-405b-instruct | 39.81 tok/s | 0.23s |
| 03/26/2026, 18:08 | meta/llama-3.1-405b-instruct | 5.61 tok/s | 34.07s |
| 03/26/2026, 18:08 | meta/llama-3.1-405b-instruct | 2.82 tok/s | 0.49s |
| 03/26/2026, 18:08 | meta/llama-3.1-405b-instruct | 3.23 tok/s | 0.48s |
| 03/26/2026, 18:08 | meta/llama-3.1-405b-instruct | 1.33 tok/s | 20.46s |
| 03/26/2026, 18:08 | meta/llama-3.1-405b-instruct | 14.62 tok/s | 4.81s |
