diff --git a/integrations/llms/vertex-ai.mdx b/integrations/llms/vertex-ai.mdx index e0ba4e70..a05c3ce0 100644 --- a/integrations/llms/vertex-ai.mdx +++ b/integrations/llms/vertex-ai.mdx @@ -111,6 +111,12 @@ Save your configuration. Your provider slug will be `@vertex-ai` (or a custom na + +**Self-Hosted GKE Deployments:** When running the Portkey Gateway on GKE with [Workload Identity Federation](/self-hosting/hybrid-deployments/gcp#setting-up-iam-permission) enabled (`GCP_AUTH_MODE=workload`), the Gateway automatically acquires Vertex AI access tokens from the GCP metadata server. No manual `Authorization` header is needed. + +The GSA bound to the Gateway's KSA must have the `roles/aiplatform.user` role. See the [GCP deployment guide](/self-hosting/hybrid-deployments/gcp#setting-up-iam-permission) for setup. + + **To use Anthropic models on Vertex AI**, prepend `anthropic.` to the model name. Example: `@vertex-ai/anthropic.claude-3-5-sonnet@20240620`