triton

Triton Inference Server Get model inference statistics

Retrieve inference statistics for a specific model including request count, execution count, and cumulative timing information. This is a Triton extension to the KServe protocol.

GitHub