Analytics

Request per minute

Inference requests per minute

TTFT

Time to first token latency

TPS

Completion tokens per second

Duration

Inference duration

Success Rate

Percentage of successful requests

Requests by Model

Requests by Model Provider