llama-3-1-swallow-instruct-v0-1
Developer: Meta
Also known as
Llama 3.1 Swallow Instruct V0.1 by Meta is available through 7 API providers on LMSpeed. Compare API pricing from $0.010 to $10.71 per million input tokens across providers. Free API access is offered by 1 provider. In speed benchmarks, the fastest provider reaches 19 tok/s.
Compare speed and latency performance across all API providers.
| Provider | Speed | Latency | Tests |
|---|---|---|---|
NVIDIA NIM institute-of-science-tokyo/llama-3.1-swallow-70b-instruct-v0.1 | 19.04 tok/s | 0.49s | 10 |
Showing 1-1 of 1 providers
deepseek-v3-2
DeepSeek V3.2 is a large language model in the DeepSeek V3 series, offering advanced reasoning, code generation, and multimodal capabilities.
minimax-m2-5
MiniMax M2.5 is MiniMax's flagship text model for coding and agents, with SOTA-level programming and agentic performance, improved token efficiency, and fast high-TPS API deployment.
glm-5-1
Zhipu GLM-5.1 is a next-generation GLM model aimed at frontier reasoning, coding, and bilingual agent applications.
gpt-oss
GPT-OSS is an open-source language model offering advanced reasoning, code generation, and multimodal capabilities.
deepseek-v3-1
DeepSeek V3.1 is an open-weights style frontier model from DeepSeek with strong math, coding, and Chinese-English bilingual reasoning.
deepseek-v3-1-terminus
DeepSeek V3.1 Terminus is a Terminus-branded DeepSeek V3.1 variant tuned for stable long-context reasoning and code generation.
Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.