llama-3.1
0.23s |
| 5 |
35.89 t/s |
3.54s |
| Feb 16, 03:23 AM | llama-3.1-nemotron-nano-4b-v1.1 | 61.56 t/s | 2.93s |
| Feb 16, 03:23 AM | llama-3.1-nemotron-nano-4b-v1.1 | 68.81 t/s | 2.74s |
| Aug 21, 12:17 AM | nvidia/llama-3.1-nemotron-70b-instruct | 52.14 t/s | 0.40s |
| Aug 21, 12:17 AM | nvidia/llama-3.1-nemotron-70b-instruct | 52.30 t/s | 0.17s |
| Aug 21, 12:17 AM | nvidia/llama-3.1-nemotron-70b-instruct | 52.13 t/s | 0.18s |
| Aug 21, 12:17 AM | nvidia/llama-3.1-nemotron-70b-instruct | 52.10 t/s | 0.16s |
| Aug 21, 12:17 AM | nvidia/llama-3.1-nemotron-70b-instruct | 52.28 t/s | 0.22s |
| Aug 1, 10:55 AM | meta/llama-3.1-70b-instruct | 39.00 t/s | 0.40s |
| Aug 1, 10:55 AM | meta/llama-3.1-70b-instruct | 38.29 t/s | 0.20s |
| Aug 1, 10:55 AM | meta/llama-3.1-70b-instruct | 53.02 t/s | 0.17s |
| Aug 1, 10:55 AM | meta/llama-3.1-70b-instruct | 75.07 t/s | 0.18s |
| Aug 1, 10:55 AM | meta/llama-3.1-70b-instruct | 50.52 t/s | 0.21s |