LogoLMSpeed
  • Home
  • Free
  • Models
  • Providers
  • Docs
LogoLMSpeed
LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
Leaderboard
  • Overview
  • Speed Ranking
  • Latency Ranking
  • Health Ranking
  • Input Price
  • Output Price
  • Reasoning
  • Coding
Models
  • All Models
  • GPT
  • Claude
  • Gemini
  • DeepSeek
  • Llama
  • Qwen
Free Models
  • All Free Models
  • Free GPT
  • Free Claude
  • Free Gemini
  • Free DeepSeek
  • Free Llama
  • Free Qwen
Resources
  • Speed Test
  • Provider Directory
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2026 LMSpeed All Rights Reserved.Made by Nexmoe with ❤️
Back to models

Phi 3.5 MoE Instruct API Pricing & Performance

phi-3-5-moe-instruct

Microsoft Phi 3.5 MoE Instruct is a mixture-of-experts instruction-tuned variant in the Phi series, optimized for following instructions and conversational tasks.

Also known as

microsoft/phi-3.5-moe-instructphi-3.5-moe-instruct英伟达/microsoft/phi-3.5-moe-instruct

Phi 3.5 MoE Instruct is available through 16 API providers on LMSpeed. Compare API pricing from $0.010 to $75.00 per million input tokens across providers. Free API access is offered by 2 providers.

Avg speed
-t/s
First token
-s
Total tests
0
Providers
19
Variants
23

Data as of Jun 6, 2026, 11:58 AM·Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.

Pricing Comparison

Compare Phi 3.5 MoE Instruct API pricing across 14 providers. Input prices range from $0.010 to $75.00 per million input. 素墨API offers the lowest rate at $0.010/M. 2 providers offer free API credits or a free tier.

ProviderModel VariantAuditInput ($/M)Output ($/M)Speed (t/s)First token
星见雅 API100%
英伟达/microsoft/phi-3.5-moe-instruct—FreeFree——
Cuz AI100%
phi-3.5-moe-instruct—$0.160$0.640——
6345ywz API99.7%
microsoft/phi-3.5-moe-instruct—$0.137$0.137——
初叶🍂Furry API98.9%
microsoft/phi-3.5-moe-instruct—$0.050$0.050——
猫羽霖API98.7%
microsoft/phi-3.5-moe-instruct—$0.050$0.050——
AI Claw API90.8%
microsoft/phi-3.5-moe-instruct—$25.00$62.50——
9527 API100%
microsoft/phi-3.5-moe-instruct—FreeFree——
素墨API100%
microsoft/phi-3.5-moe-instruct—$0.010$0.010——
phi-3.5-moe-instruct—$0.010$0.010——
SMLC666 API100%
microsoft/phi-3.5-moe-instruct—$75.00$75.00——
KFCV5099.8%
phi-3.5-moe-instruct—$10.27$10.27——
Koyeb AI Gateway99.3%
phi-3.5-moe-instruct—$0.037$0.037——
SWT-API99.1%
phi-3.5-moe-instruct—$0.037$0.037——
MIX API84.8%
microsoft/phi-3.5-moe-instruct—$75.00$75.00——
C85 API100%
microsoft/phi-3.5-moe-instruct—$75.00$75.00——
温云0%
microsoft/phi-3.5-moe-instruct—$75.00$75.00——

Pricing data from provider public APIs

Frequently Asked Questions

Is Phi 3.5 MoE Instruct API free?
Yes, Phi 3.5 MoE Instruct is available for free through 2 API providers on LMSpeed, including 星见雅 API, 9527 API. These providers offer free API credits or a free tier with no per-token charges.
How much does Phi 3.5 MoE Instruct API cost?
Phi 3.5 MoE Instruct API pricing ranges from $0.010 to $75.00 per million input tokens across 16 providers. 素墨API offers the cheapest rate at $0.010/M. Output pricing varies by provider.
Which provider has the cheapest Phi 3.5 MoE Instruct API pricing?
The cheapest Phi 3.5 MoE Instruct API pricing is offered by 素墨API at $0.010 per million input tokens. Compare all 16 providers above to find the best pricing per token for your use case.

Alternatives & Similar Models

MoonshotAIKimi K2.5

kimi-k2-5

Moonshot Kimi K2.5 is a large language model in the Kimi series, offering advanced reasoning, code generation, and multimodal capabilities.

19 shared providers

MinimaxMiniMax M2.5

minimax-m2-5

MiniMax M2.5 is MiniMax's flagship text model for coding and agents, with SOTA-level programming and agentic performance, improved token efficiency, and fast high-TPS API deployment.

19 shared providers

ChatGLMGLM-5.1

glm-5-1

Zhipu GLM-5.1 is a next-generation GLM model aimed at frontier reasoning, coding, and bilingual agent applications.

19 shared providers

MinimaxMiniMax M2.7

minimax-m2-7

MiniMax M2.7 is a large language model in the MiniMax series, offering advanced reasoning, code generation, and multimodal capabilities.

19 shared providers

OpenAIGPT-OSS

gpt-oss

GPT-OSS is an open-source language model offering advanced reasoning, code generation, and multimodal capabilities.

19 shared providers

MoonshotAIKimi K2 Thinking

kimi-k2-thinking

Moonshot AI Kimi K2 Thinking is a reasoning model in the Kimi series, designed for complex reasoning, problem-solving, and analytical tasks.

19 shared providers