Discover free LLM API models across providers with real speed and latency benchmarks.
LMSpeed tracks 234 LLM models available for free across 39 API providers. Free tiers vary by provider — some offer limited daily requests, others provide free credits for new users. All speed data is from real API tests.
| Model | Providers | Speed | Latency | Tests |
|---|---|---|---|---|
mistralai/mistral-small-3.1-24b-instruct-2503NvidiaFREE ToolsOpen Weights128KMistral Small 3.1 24b Instruct 2503 is an AI model provided by nvidia. | N/A | N/A | 0 | |
nvidia/llama-3.1-nemotron-ultra-253b-v1NvidiaFREE ReasoningTools131.1KLlama-3.1-Nemotron-Ultra-253B-v1 is an AI model provided by nvidia. | N/A | N/A | 0 | |
qwen/qwen2.5-coder-32b-instructNvidiaFREE ToolsOpen Weights128KQwen2.5 Coder 32b Instruct is an AI model provided by nvidia. |
| N/A |
| N/A |
| 0 |
qwen/qwen3.5-397b-a17bFREE ReasoningToolsFilesOpen WeightsQwen3.5 397B A17B is an AI model provided by openrouter. | N/A | N/A | 0 |
qwen3-max-previewQiniuFREE Tools256KQwen3 Max Preview is an AI model provided by qiniu-ai. | N/A | N/A | 0 |
z-ai/glm5FREE ReasoningToolsOpen Weights202.8KGLM5 is an AI model provided by nvidia. | N/A | N/A | 0 |
ReasoningTools128KOpen WeightsKimi K2 0905 is an AI model provided by nvidia. | N/A | N/A | 0 |
ReasoningTools262KKimi K2 Thinking is an AI model provided by zenmux. | N/A | N/A | 0 |
ReasoningToolsFilesVisionKimi K2.5 is an AI model provided by zenmux. | N/A | N/A | 0 |
MiniMax-M2FREE ReasoningToolsOpen Weights128KMiniMax-M2 is an AI model provided by nvidia. | N/A | N/A | 0 |
MiniMax-M2.1FREE ReasoningToolsOpen Weights204.8KMiniMax-M2.1 is an AI model provided by nvidia. | N/A | N/A | 0 |
ReasoningToolsOpen Weights204.8KMiniMax-M2.5 is an AI model provided by nvidia. | N/A | N/A | 0 |