Koru API

new.api.koru.ink

Models: 45 models
From: --
Speed: 516 tok/s
Updated: 6/8/2026

Koru API delivers an OpenAI-compatible API relay with seamless access to a wide range of AI models.

Latency2.41 s

Created At3/30/2026

Recharge Rate¥7.30 per $1 quota

Features

DrawingTaskData Export

Website

API Endpoints

Endpoint 1Historical / Unverified
https://new.api.koru.ink

Claim this provider

Verify ownership to unlock provider management features:

Edit provider name, content, and links
Get featured with priority traffic and visibility boost
Display a verified badge to build user trust

Health Check

8%Recent availability

History (72 pts)

PastNow

API Benchmarks & Pricing

Compare 16 model rows across audit recency, latest speed tests, throughput, latency, and per-token pricing.

View all 45 models

Model	Input ($/M)	Output ($/M)	Audit	Speed	Latency
talkie-lm	-	-	—	93.7 t/s	0.87 s
nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:freedefault	$75.00/M	$75.00/M	—	—	—
poolside/laguna-xs.2:freedefault	$75.00/M	$75.00/M	—	—	—
poolside/laguna-m.1:freedefault	$75.00/M	$75.00/M	—	—	—
deepseek/deepseek-v4-flash:freedefault	$75.00/M	$75.00/M	—	—	—
gemma-4-e2b-it	-	-	—	224.1 t/s	1.55 s
gemini-2.5-flash-禁止用于openclaw等	-	-	—	203.6 t/s	3.90 s
arcee-ai/trinity-large-thinking:freedefault	$75.00/M	$75.00/M	—	—	—
llama3.1-8B	-	-	—	891.4 t/s	0.91 s
minimax-m2.7default	$75.00/M	$75.00/M	—	—	—
nvidia/nemotron-3-super-120b-a12b:freedefault	$75.00/M	$75.00/M	—	—	—
qwen3.5-27b	-	-	—	63.6 t/s	13.78 s
qwen3.5-9b	-	-	—	164.7 t/s	7.09 s
mercury-2default	$75.00/M	$75.00/M	—	458.6 t/s	0.90 s
minimax/minimax-m2.5:freedefault	$75.00/M	$75.00/M	—	—	—
minimaxai/minimax-m2.5default	$75.00/M	$75.00/M	—	—	—

Showing 16 of 16 model rows

Recent Test Records

Time	Model	Speed	Latency
May 2, 05:48 PM	talkie-lm	93.73 tok/s	0.87s
May 2, 05:47 PM	llama3.1-8B	660.66 tok/s	0.64s
Apr 29, 03:22 AM	mercury-2	537.27 tok/s	0.75s
Apr 29, 03:21 AM	mercury-2	364.18 tok/s	0.83s
Apr 29, 03:21 AM	mercury-2	447.03 tok/s	0.99s
Apr 29, 03:17 AM	llama3.1-8B	935.36 tok/s	0.39s
Apr 29, 03:17 AM	llama3.1-8B	890.44 tok/s	0.53s
Apr 29, 03:16 AM	llama3.1-8B	1687.87 tok/s	0.38s
Apr 29, 03:16 AM	llama3.1-8B	814.09 tok/s	0.53s
Apr 21, 11:43 AM	gemma-4-e2b-it	266.55 tok/s	1.61s

Notes

Health checks: Scope: the 72-hour chart and recent availability measure API connectivity only. Each bar summarizes one hour of checks. Targets: LMSpeed tries the configured health check URL and provider status URL first, then API endpoints derived from known API hosts and recent speed-test base URLs. A website host is considered only when it looks like an API endpoint. Probe steps: each candidate goes through DNS lookup, TCP connection, TLS handshake for HTTPS, and an HTTP HEAD request with redirects followed. Probing stops after the first reachable candidate. Reachable criteria: every required network step must succeed. An HTTP response below 500 is treated as reachable, including 401 because it confirms that an authenticated API endpoint responded, except for statuses classified as blocked. Blocked results: HTTP 403, 429, 521, 525, and 530, plus detected WAF or Cloudflare challenges, are shown as blocked and excluded from availability calculations because LMSpeed cannot determine whether the API itself is down. Model availability: when a dedicated test key is configured, LMSpeed sends an authenticated GET request to a derived /models endpoint and compares returned model IDs with this provider's listed models. These per-model results appear in Models & Pricing and are not included in the provider connectivity percentage. Timeouts: TCP connection, TLS handshake, HTTP connectivity, and model requests each use a 5-second timeout. A full run can take longer when several candidates are tried. Frequency: a background worker checks all providers hourly by default. The schedule may be changed by the service operator, so timestamps show when checks actually ran. Limit: automated samples are not an SLA and do not guarantee account quota, every model, every region, or successful completion requests. Check the provider's own status page before making operational decisions.

Model

Input ($/M)

Output ($/M)

Audit

Speed

Latency

talkie-lm

—

93.7 t/s

0.87 s

nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:freedefault

$75.00/M

—

poolside/laguna-xs.2:freedefault

$75.00/M

—

poolside/laguna-m.1:freedefault

$75.00/M

—

deepseek/deepseek-v4-flash:freedefault

$75.00/M

—

gemma-4-e2b-it

—

224.1 t/s

1.55 s

gemini-2.5-flash-禁止用于openclaw等

—

203.6 t/s

3.90 s

arcee-ai/trinity-large-thinking:freedefault

$75.00/M

—

llama3.1-8B

—

891.4 t/s

0.91 s

minimax-m2.7default

$75.00/M

—

nvidia/nemotron-3-super-120b-a12b:freedefault

$75.00/M

—

qwen3.5-27b

—

63.6 t/s

13.78 s

qwen3.5-9b

—

164.7 t/s

7.09 s

mercury-2default

$75.00/M

—

458.6 t/s

0.90 s

minimax/minimax-m2.5:freedefault

$75.00/M

—

minimaxai/minimax-m2.5default

$75.00/M

—

Time

Model

Speed

Latency

May 2, 05:48 PM

talkie-lm

93.73 tok/s

0.87s

May 2, 05:47 PM

llama3.1-8B

660.66 tok/s

0.64s

Apr 29, 03:22 AM

mercury-2

537.27 tok/s

0.75s

Apr 29, 03:21 AM

mercury-2

364.18 tok/s

0.83s

Apr 29, 03:21 AM

mercury-2

447.03 tok/s

0.99s

Apr 29, 03:17 AM

llama3.1-8B

935.36 tok/s

0.39s

Apr 29, 03:17 AM

llama3.1-8B

890.44 tok/s

0.53s

Apr 29, 03:16 AM

llama3.1-8B

1687.87 tok/s

0.38s

Apr 29, 03:16 AM

llama3.1-8B

814.09 tok/s

0.53s

Apr 21, 11:43 AM

gemma-4-e2b-it

266.55 tok/s

1.61s

Provider

Why compare

Models

Free

Avg price

Speed

30d uptime

Koru API

new-api-koru-ink

Koru API delivers an OpenAI-compatible API relay with seamless access to a wide range of AI models.

Current provider baseline

N/A

516 tok/s

18%

Seamee API

napi-seaya-link

Seamee API provides an AI model relay for accessing multiple LLMs through OpenAI-compatible endpoints.

Higher 30-day availability
More free-model options
Broader model coverage

575

275

N/A

101 tok/s

95.2%

llm-api

Provides API access to large language models for developers.

Higher 30-day availability
More free-model options
Broader model coverage

N/A

67.7%

api-kr777-top

CaMeL AI provides an OpenAI-compatible API gateway with extensive model coverage and pricing options.

Higher 30-day availability
Broader model coverage

194

$92.56/M

93 tok/s

95.1%

laozhang-api

A proxy service providing access to OpenAI, Claude, and Gemini models with simplified billing and setup.

Higher 30-day availability
Broader model coverage

134

$12.86/M

N/A

92.4%

api-futureppo-top

Future Hub is a large-scale OpenAI-compatible API gateway at api.futureppo.top with extensive model coverage and pricing data.

More free-model options
Broader model coverage

132

223

N/A

263 tok/s

14.8%

mynav-website

MyNav AI provides an OpenAI-compatible API relay at mynav.website.

More free-model options
Broader model coverage

221

N/A

14.8%

Notes

Health checks: Scope: the 72-hour chart and recent availability measure API connectivity only. Each bar summarizes one hour of checks. Targets: LMSpeed tries the configured health check URL and provider status URL first, then API endpoints derived from known API hosts and recent speed-test base URLs. A website host is considered only when it looks like an API endpoint. Probe steps: each candidate goes through DNS lookup, TCP connection, TLS handshake for HTTPS, and an HTTP HEAD request with redirects followed. Probing stops after the first reachable candidate. Reachable criteria: every required network step must succeed. An HTTP response below 500 is treated as reachable, including 401 because it confirms that an authenticated API endpoint responded, except for statuses classified as blocked. Blocked results: HTTP 403, 429, 521, 525, and 530, plus detected WAF or Cloudflare challenges, are shown as blocked and excluded from availability calculations because LMSpeed cannot determine whether the API itself is down. Model availability: when a dedicated test key is configured, LMSpeed sends an authenticated GET request to a derived /models endpoint and compares returned model IDs with this provider's listed models. These per-model results appear in Models & Pricing and are not included in the provider connectivity percentage. Timeouts: TCP connection, TLS handshake, HTTP connectivity, and model requests each use a 5-second timeout. A full run can take longer when several candidates are tried. Frequency: a background worker checks all providers hourly by default. The schedule may be changed by the service operator, so timestamps show when checks actually ran. Limit: automated samples are not an SLA and do not guarantee account quota, every model, every region, or successful completion requests. Check the provider's own status page before making operational decisions.

Koru API

Features

API Endpoints

Health Check

API Benchmarks & Pricing

Recent Test Records

Notes

Koru API

Features

API Endpoints

Health Check

API Benchmarks & Pricing

Recent Test Records

Similar API Provider Alternatives to Compare

Notes

Similar API Provider Alternatives to Compare

Provider	Why compare	Models	Free	Avg price	Speed	30d uptime
Koru API new-api-koru-ink Koru API delivers an OpenAI-compatible API relay with seamless access to a wide range of AI models.	Current provider baseline	40	45	N/A	516 tok/s	18%
Seamee API napi-seaya-link Seamee API provides an AI model relay for accessing multiple LLMs through OpenAI-compatible endpoints.	Higher 30-day availability More free-model options Broader model coverage	575	275	N/A	101 tok/s	95.2%
llm-api Provides API access to large language models for developers.	Higher 30-day availability More free-model options Broader model coverage	82	71	N/A	N/A	67.7%
api-kr777-top CaMeL AI provides an OpenAI-compatible API gateway with extensive model coverage and pricing options.	Higher 30-day availability Broader model coverage	194	4	$92.56/M	93 tok/s	95.1%
laozhang-api A proxy service providing access to OpenAI, Claude, and Gemini models with simplified billing and setup.	Higher 30-day availability Broader model coverage	134	1	$12.86/M	N/A	92.4%
api-futureppo-top Future Hub is a large-scale OpenAI-compatible API gateway at api.futureppo.top with extensive model coverage and pricing data.	More free-model options Broader model coverage	132	223	N/A	263 tok/s	14.8%
mynav-website MyNav AI provides an OpenAI-compatible API relay at mynav.website.	More free-model options Broader model coverage	82	221	N/A	N/A	14.8%

Koru API

Features

API Endpoints

About Koru API

Health Check

API Benchmarks & Pricing

Recent Test Records

Notes

Koru API

Features

API Endpoints

About Koru API

Health Check

API Benchmarks & Pricing

Recent Test Records

Similar API Provider Alternatives to Compare

Notes

Similar API Provider Alternatives to Compare