Metrics
Keeping track of system metrics is crucial for maintaining a stable Rime self-hosted deployment. These insights help guide decisions about scaling and performance. To support this, Rime services expose multiple endpoints that let you monitor system health.
Rime API
A /health
route is available on port 8000 to give you a quick snapshot of overall status. You can check it with the following command:
A typical response looks like this:
This provides a simple health check mechanism to verify that both api and model services are up and responding.
Rime Model
To check if the model is running properly, you can perform a liveness probe using the /ping
endpoint:
A typical response looks like this:
Prometheus Metrics
⚠️ More metrics to be added in future releases.
For more detailed operational insights, the model service exposes Prometheus-compatible metrics at the /metrics endpoint:
This endpoint provides telemetry data including:
- HTTP Request Counters: Detailed breakdown of requests by endpoint, status code, and HTTP method
- Error Tracking: Counts of HTTP errors by type and status code
Example metrics include:
These metrics can be integrated with Prometheus monitoring systems to create dashboards and alerts for your Rime deployment. Read integration setup in the next page.