AI Gateway

The AI Gateway is a standalone service that gives access to AI-native GitLab Duo features.

GitLab operates an instance of the AI Gateway, based in the cloud. This instance is used by GitLab.com, GitLab Self-Managed, and GitLab Dedicated.

You can also use a self-hosted AI Gateway instance on GitLab Self-Managed through GitLab Duo Self-Hosted.

Region support

GitLab.com

For GitLab.com, the routing mechanism is based on the GitLab instance location instead of the user’s instance location.

Because GitLab.com is single-homed in us-east1, requests to the AI Gateway are routed to us-east4 in almost all cases. The routing might not always result in the absolute nearest deployment for every user.

GitLab Self-Managed and GitLab Dedicated

For GitLab Self-Managed and GitLab Dedicated, GitLab manages region selection. You cannot choose the deployment region of the AI Gateway. For more information, see available regions in the Runway service manifest.

Runway is the GitLab internal developer platform and is not available to external customers.

Automatic data routing

GitLab uses Cloudflare and Google Cloud Platform (GCP) load balancers to route AI gateway requests to the nearest available deployment automatically. This routing mechanism prioritizes low latency and efficient processing of user requests.

You cannot manually control this routing process. The following factors influence where data is routed:

Network latency: The primary routing mechanism focuses on minimizing latency. Data might be processed in a region other than the nearest one if network conditions dictate.
Service availability: In case of regional outages or service disruptions, requests might be automatically rerouted to ensure uninterrupted service.
Third-party dependencies: The GitLab AI infrastructure relies on third-party model providers, like Google Vertex AI, which have their own data-handling practices.

Direct and indirect connections

The IDE communicates directly with the AI Gateway by default, bypassing the GitLab monolith. This direct connection improves routing efficiency.

To change this behavior, configure direct and indirect connections for Code Suggestions.

Tracing requests to specific regions

You cannot directly trace your AI requests to specific regions.

If you need assistance with tracing a particular request, GitLab Support can access and analyze logs that contain Cloudflare headers and instance UUIDs. These logs provide insights into the routing path and can help identify the region where a request was processed.

Data sovereignty

The multi-region AI Gateway deployment does not enforce strict data sovereignty. Requests are not guaranteed to go to or remain in a particular region.

This service is not a data residency solution.

Deployment regions

GitLab deploys the AI Gateway in the following regions:

North America (us-east4)
Europe (europe-west2, europe-west3, and europe-west9)
Asia Pacific (asia-northeast1 and asia-northeast3)

For the most current information, see the Runway configuration file.

The exact location of the LLM models used by the AI Gateway is determined by third-party model providers. The models are not guaranteed to reside in the same geographical regions as the AI Gateway deployments. Data might flow to other regions where the model provider operates, even if the AI Gateway processes the initial request in a different region. Data is routed to the most optimal region based on performance and availability.