Learn how to use streaming inference with TensorZero Gateway.
stream parameter to true in your inference request.
The response will be returned as a Server-Sent Events (SSE) stream, followed by a final [DONE] message.
When using a client library, the client will handle the SSE stream under the hood and return a stream of chunk objects.
See API Reference for more details.
[DONE] message)