This guide shows how to set up a minimal deployment to use the TensorZero Gateway with GCP Vertex AI Gemini.
Setup
For this minimal setup, you’ll need just two files in your project directory:
Directoryconfig/
- docker-compose.yml
For production deployments, see our Deployment Guide.
Configuration
Create a minimal configuration file that defines a model and a simple chat function:
See the list of models available on GCP Vertex AI Gemini.
Credentials
You must generate a GCP service account key in JWT form (described here) and point to it in the GCP_VERTEX_CREDENTIALS_PATH
environment variable.
Deployment (Docker Compose)
Create a minimal Docker Compose configuration:
You can start the gateway with docker compose up
.
Inference
Make an inference request to the gateway: