LogoLMSpeed
  • Home
  • Free
  • Models
  • Providers
  • Docs
LogoLMSpeed

Check whether your API relay swaps, injects, or leaks data

Paste an OpenAI-compatible endpoint, API key, and model ID. LMSpeed runs supply-chain security probes and returns a redacted shareable report.

Steps
11
Risk surfaces
9
Output
Redacted report

More than a speed test: inspect whether the relay path was tampered with

lmspeed puts model identity, prompt leakage, context boundaries, error leakage, and stream integrity into one security comparison table, so you can baseline a relay before wiring it into production.

Dimensionlmspeedhvoy.aicctest.ai
Token injectionCompare actual token usage with the expected countCoveredNot coveredCovered
Prompt extractionProbe hidden system prompt leakageCoveredNot coveredNot covered
Identity substitutionDetect whether Claude is actually answered by another modelCoveredCoveredNot covered
Jailbreak defenseCheck common jailbreak vectorsCoveredNot coveredNot covered
Context truncationFind the real context-window boundaryCoveredNot coveredNot covered
Tool-call rewrite (AC-1.a)Detect rewritten package commands and tool argumentsCoveredNot coveredNot covered
Error response leakage (AC-2)Probe credentials, paths, and internal field leakageCoveredNot coveredNot covered
Stream integrity (SSE)Validate event types, usage, and thinking signaturesCoveredCoveredNot covered
Web3 injectionCheck whether signing context is polluted by the relay layerCoveredNot coveredNot covered
Channel fingerprintProtobuf signatures and multimodal interpretation checksIn designSoonNot coveredCovered
CoveredCoveredNot coveredNot coveredIn designSoonIn design

How the 11-step audit breaks down relay risk

Each check keeps public evidence redacted: you can see where the path looks suspicious without publishing API keys, system prompts, or internal paths.

Threat categories are based on Liu et al., "Your Agent Is Mine" (arXiv:2604.08407)

Step 1-2

Infrastructure reconnaissance

DNS, CDN, SSL certificates, admin-panel fingerprints, and model-list enumeration establish the relay's technical stack.

Step 3

Token injection (AC-1)

Compare actual token usage with expected usage. Hidden system prompt injection adds extra tokens, and the delta can reveal the injection size.

Step 4 & 6

Prompt extraction

Try three vectors to extract hidden system prompts: direct repetition, translation, and JSON continuation, plus jailbreak-defense checks.

Step 5

Identity substitution

Use 24 keywords to detect whether Claude is actually GPT, DeepSeek, GLM, Qwen, or another model, then confirm with anchor phrases.

Step 7

Context truncation

Five canary markers plus binary search locate the real context-window boundary. Does your 200K context really hold 200K?

Step 8 (AC-1.a)

Tool-call rewrite

Detect whether the relay rewrites package-install commands on the return path, a proxy-layer typosquatting supply-chain attack.

Step 9 (AC-2)

Error response leakage

Send seven intentionally malformed requests to see whether API keys, environment variables, file paths, or LiteLLM internals leak through errors.

Step 10-11

Stream integrity & Web3

Validate the SSE event allowlist, usage monotonicity, thinking-signature validity, and model identity, then add profile-gated Web3 signing-isolation probes.

LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
Leaderboard
  • Overview
  • Speed Ranking
  • Latency Ranking
  • Health Ranking
  • Input Price
  • Output Price
  • Reasoning
  • Coding
Models
  • All Models
  • GPT
  • Claude
  • Gemini
  • DeepSeek
  • Llama
  • Qwen
Free Models
  • All Free Models
  • Free GPT
  • Free Claude
  • Free Gemini
  • Free DeepSeek
  • Free Llama
  • Free Qwen
Resources
  • Speed Test
  • Provider Directory
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2026 LMSpeed All Rights Reserved.Made by Nexmoe with ❤️