Useful endpoints for deploying and observing the TensorZero gateway.
GET /metrics
/metrics
endpoint for monitoring.
At the moment, the only available metric is request_count
, which counts the number of successful requests to the gateway.
The metric reports counts for both inference and feedback requests.
GET /status
/status
endpoint is a simple liveness probe.
It returns HTTP status code 200 if the gateway is running.
GET /health
/health
endpoint is a simple readiness probe that checks if the gateway can communicate with the database.
It returns HTTP status code 200 if the gateway is ready to serve requests.