Ollama

ollama.com

Models: 46 models
From: --
Speed: 54 tok/s
Updated: 6/8/2026

Ollama provides a platform to run and integrate open-source AI models locally or in the cloud.

Latency20.77 s

Created At3/23/2026

Website

API Endpoints

ollama.com

Model

Input ($/M)

Output ($/M)

Audit

Speed

Latency

Tests

glm-5.1:cloud

—

61.6 t/s

25.98 s

minimax-m2.7

—

32.9 t/s

12.04 s

glm-5

—

63.6 t/s

21.68 s

Time

Model

Speed

Latency

Apr 13, 02:24 PM

glm-5.1:cloud

40.89 tok/s

42.51s

Apr 13, 02:20 PM

glm-5.1:cloud

49.79 tok/s

25.82s

Apr 13, 02:12 PM

glm-5.1:cloud

94.03 tok/s

9.61s

Apr 7, 11:24 AM

minimax-m2.7

34.06 tok/s

5.97s

Apr 7, 08:55 AM

minimax-m2.7

31.79 tok/s

18.11s

Apr 7, 08:11 AM

glm-5

51.14 tok/s

34.23s

Mar 23, 09:10 AM

glm-5

76.14 tok/s

9.13s

Provider

Why compare

Models

Free

Avg price

Speed

30d uptime

Ollama

ollama-com

Ollama provides a platform to run and integrate open-source AI models locally or in the cloud.

Current provider baseline

N/A

54 tok/s

99.4%

Future Hub

api-futureppo-top

Future Hub is a large-scale OpenAI-compatible API gateway at api.futureppo.top with extensive model coverage and pricing data.

Faster measured speed
Higher 30-day availability
More free-model options
Broader model coverage

$2.45/M

67 tok/s

100%

ai-071129-xyz

MapleLeaf API runs a New API-powered gateway on ai.071129.xyz for aggregated access to multiple AI models.

Higher 30-day availability
More free-model options
Broader model coverage

260

$55.27/M

38 tok/s

100%

api-kr777-top

CaMeL AI provides an OpenAI-compatible API gateway with extensive model coverage and pricing options.

Faster measured speed
More free-model options
Broader model coverage

178

$27.28/M

119 tok/s

98.9%

naapi-cc

Na API (naapi.cc) is an OpenAI-compatible LLM API gateway with competitive pricing and stable access to 100+ models from OpenAI, Anthropic, Google, and more.

Faster measured speed
Higher 30-day availability
Broader model coverage

$540.57/M

649 tok/s

100%

catclaw-moetu-vip

CatClaw API is an OpenAI-compatible LLM gateway at catclaw.moetu.vip, offering multi-model API access with transparent pricing.

Faster measured speed
Higher 30-day availability
More free-model options

$0.014/M

62 tok/s

100%

aio-intelligence

Provides API access to 230+ AI models including GPT, Claude, DeepSeek, Gemini, and Qwen with OpenAI-compatible interfaces.

Faster measured speed
Broader model coverage

171

$38.65/M

122 tok/s

31.5%

Ollama

API Endpoints

Ollama

API Endpoints

Health Check

API Benchmarks & Pricing

Recent Test Records

Similar API Provider Alternatives to Compare

Similar API Provider Alternatives to Compare

Provider	Why compare	Models	Free	Avg price	Speed	30d uptime
Ollama ollama-com Ollama provides a platform to run and integrate open-source AI models locally or in the cloud.	Current provider baseline	29	0	N/A	54 tok/s	99.4%
Future Hub api-futureppo-top Future Hub is a large-scale OpenAI-compatible API gateway at api.futureppo.top with extensive model coverage and pricing data.	Faster measured speed Higher 30-day availability More free-model options Broader model coverage	98	1	$2.45/M	67 tok/s	100%
ai-071129-xyz MapleLeaf API runs a New API-powered gateway on ai.071129.xyz for aggregated access to multiple AI models.	Higher 30-day availability More free-model options Broader model coverage	260	21	$55.27/M	38 tok/s	100%
api-kr777-top CaMeL AI provides an OpenAI-compatible API gateway with extensive model coverage and pricing options.	Faster measured speed More free-model options Broader model coverage	178	4	$27.28/M	119 tok/s	98.9%
naapi-cc Na API (naapi.cc) is an OpenAI-compatible LLM API gateway with competitive pricing and stable access to 100+ models from OpenAI, Anthropic, Google, and more.	Faster measured speed Higher 30-day availability Broader model coverage	65	0	$540.57/M	649 tok/s	100%
catclaw-moetu-vip CatClaw API is an OpenAI-compatible LLM gateway at catclaw.moetu.vip, offering multi-model API access with transparent pricing.	Faster measured speed Higher 30-day availability More free-model options	29	2	$0.014/M	62 tok/s	100%
aio-intelligence Provides API access to 230+ AI models including GPT, Claude, DeepSeek, Gemini, and Qwen with OpenAI-compatible interfaces.	Faster measured speed Broader model coverage	171	0	$38.65/M	122 tok/s	31.5%

Ollama

API Endpoints

Ollama

API Endpoints

About Ollama

Health Check

API Benchmarks & Pricing

Recent Test Records

Similar API Provider Alternatives to Compare

Similar API Provider Alternatives to Compare