统一的LLM API网关,提供对多种AI模型的访问,具备企业级安全性、低延迟和高并发能力。
| 模型 | 速度 | 延迟 | 测试数 |
|---|---|---|---|
| gpt-3.5-turbo-1106 | 156.38 t/s | 0.82s | 5 |
| gpt-3.5-turbo-16k | 145.74 t/s | 0.64s | 5 |
| gpt-3.5-turbo-0613 | 140.29 t/s | 0.50s | 5 |
| gpt-4o-mini-2024-07-18 | 130.72 t/s | 0.60s | 5 |
| gpt-4o-2024-05-13 | 122.74 t/s | 1.60s | 5 |
| gpt-3.5-turbo | 121.66 t/s | 0.79s | 5 |
| gpt-3.5-turbo-16k-0613 | 108.01 t/s | 0.73s | 5 |
| gpt-4o-2024-08-06 | 107.84 t/s | 3.23s | 5 |
| gpt-4o | 101.73 t/s | 1.24s | 10 |
| gpt-4o-mini | 101.71 t/s | 0.68s | 5 |
| gpt-3.5-turbo-0125 | 98.60 t/s | 0.70s | 5 |
| gpt-4o-2024-11-20 | 90.36 t/s | 0.93s | 10 |
| gpt-4-1106-preview | 42.07 t/s | 0.82s | 5 |
| gpt-4-0125-preview | 28.16 t/s | 0.67s | 5 |
| gpt-4-turbo | 27.93 t/s | 0.73s | 5 |
| deepseek-r1 | 9.84 t/s | 8.43s | 8 |
| 时间 | 模型 | 速度 | 延迟 |
|---|---|---|---|
| Feb 20, 03:09 AM | gpt-4o | 86.05 t/s | 0.58s |
| Feb 19, 07:59 AM | deepseek-r1 | 6.01 t/s | 16.05s |
| Feb 19, 06:21 AM | gpt-4-0125-preview | 28.16 t/s | 0.67s |
| Feb 19, 06:01 AM | gpt-4o-mini-2024-07-18 | 130.72 t/s | 0.60s |
| Feb 19, 05:55 AM | gpt-4-1106-preview | 42.07 t/s | 0.82s |
| Feb 19, 05:53 AM | gpt-3.5-turbo-16k-0613 | 108.01 t/s | 0.73s |
| Feb 19, 05:53 AM | gpt-3.5-turbo-16k | 145.74 t/s | 0.64s |
| Feb 19, 05:53 AM | gpt-3.5-turbo-0613 | 140.29 t/s | 0.50s |
| Feb 19, 05:52 AM | gpt-3.5-turbo-1106 | 156.38 t/s | 0.82s |
| Feb 19, 05:52 AM | gpt-3.5-turbo | 121.66 t/s | 0.79s |