Troubleshooting your self-managed GitLab Duo setup

This content tells administrators how to debug their self-managed GitLab Duo setup.

Before you begin

Before you begin debugging, you should:

  • Be able to access open the gitlab-rails console.
  • Open a shell in the AI Gateway Docker image.
  • Know the endpoint where your:
    • AI Gateway is hosted.
    • Model is hosted.

You should also enable the feature flag expanded_ai_logging on the gitlab-rails console:

Feature.enable(:expanded_ai_logging)

Now, requests and responses from GitLab to the AI Gateway are logged to llm.log

Use debugging scripts

We provide two debugging scripts to help administrators verify their self-hosted setup.

  1. Debug the GitLab to AI Gateway connection. From your GitLab instance, run the Rake task:

    gitlab-rake gitlab:duo:verify_self_hosted_setup
    
  2. Debug the AI Gateway setup. For your AI Gateway container, run:

    docker exec -it <ai-gateway-container> sh
    poetry run python scripts/troubleshoot_selfhosted_installation.py --model-name "codegemma_7b" --model-endpoint
    "http://localhost:4000"
    

Verify the output of the commands, and fix accordingly.

If both commands are successful, but GitLab Duo Code Suggestions is still not working, raise an issue on the issue tracker.

Check if GitLab make a request to the model

From the GitLab Rails console, verify that the model is reachable by running:

model_name = "<your_model_name>"
model_endpoint = "<your_model_endpoint>"
model_api_key = "<your_model_api_key>"
body = {:prompt_components=>[{:type=>"prompt", :metadata=>{:source=>"GitLab EE", :version=>"17.3.0"}, :payload=>{:content=>[{:role=>:user, :content=>"Hello"}], :provider=>:litellm, :model=>model_name, :model_endpoint=>model_endpoint, :model_api_key=>model_api_key}}]}
client = Gitlab::Llm::AiGateway::Client.new(User.find_by_id(1), service_name: :self_hosted_models)
client.complete(endpoint: "/v1/chat/agent", body: body)

This should return a response from the model in the format:

{"response"=> "<Model response>",
 "metadata"=>
  {"provider"=>"litellm",
   "model"=>"<>",
   "timestamp"=>1723448920}}

If that is not the case, this means that the:

Check if a user can request Code Suggestions

In the GitLab Rails console, verify that a user can request Code Suggestions.

User.find_by_id("<user_id>").can?(:access_code_suggestions)

If this returns false, it means some configuration is missing that does not allow the user to access Code Suggestions.

This missing configuration might be either of the following:

Check if GitLab instance is configured to use self-hosted-models

To check if GitLab Duo was configured correctly:

  1. On the left sidebar, at the bottom, select Admin.
  2. Select Settings > General.
  3. Expand AI-powered features.
  4. Under Features, check that Code Suggestions and Code generation are set to Self-hosted model.

Check that GitLab environmental variables are set up correctly

To check if the GitLab environmental variables are set up correctly, run the following on the GitLab Rails console:

ENV["AI_GATEWAY_URL"] == "<your-ai-gateway-endpoint>"

If the environmental variables are not set up correctly, set them by following the Linux package custom environment variables setting documentation.

Check if GitLab can make an HTTP request to AI Gateway

In the GitLab Rails console, verify that the AI Gateway is reachable by running:

HTTParty.get('<your-aigateway-endpoint>/monitoring/healthz', headers: { 'accept' => 'application/json' }).code

If the response is not 200, this means either that:

Check if AI Gateway can make a request to the model

From the AI Gateway container, make an HTTP request to the AI Gateway API for a Code Suggestion. Replace:

  • <your_model_name> with the name of the model you are using, for example mistral or codegemma.
  • <your_model_endpoint> with the endpoint where the model is hosted.
docker exec -it <ai-gateway-container> sh
curl --request POST "http://localhost:5052/v1/chat/agent" \
     --header 'accept: application/json' \
     --header 'Content-Type: application/json' \
     --data '{ "prompt_components": [ { "type": "string", "metadata": { "source": "string", "version": "string" }, "payload": { "content": "Hello", "provider": "litellm", "model": "<your_model_name>", "model_endpoint": "<your_model_endpoint>" } } ], "stream": false }'

If the request fails:

Check if AI Gateway can process requests

docker exec -it <ai-gateway-container> sh
curl '<your-aigateway-endpoint>/monitoring/healthz'

If the response is not 200, this means that AI Gateway is not installed correctly. To resolve, follow the documentation on how to install AI Gateway.

Check that AI Gateway environmental variables are set up correctly

To check if the AI Gateway environmental variables are set up correctly, run the following in a console on the AI Gateway container:

docker exec -it <ai-gateway-container> sh
echo $AIGW_AUTH__BYPASS_EXTERNAL # must be true
echo $AIGW_CUSTOM_MODELS__ENABLED # must be true

If the environmental variables are not set up correctly, set them by creating a container.

Check if the model is reachable from AI Gateway

Create a shell on the AI Gateway container and make a curl request to the model. If you find that the AI Gateway cannot make that request, this might be caused by the:

  1. Model server not functioning correctly.
  2. Network settings around the container not being properly configured to allow requests to where the model is hosted.

To resolve this, contact your network administrator.