Metrics Dictionary Guide
Service Ping metrics are defined in individual YAML files definitions from which the Metrics Dictionary is built. Currently, the metrics dictionary is built automatically once an hour.
- When a change to a metric is made in a YAML file, you can see the change in the dictionary within 1 hour of the change getting deployed to production.
- When a change to an event is made in a YAML file, you can see the change in the dictionary within 1 hour of the change getting merged to the master branch.
This guide describes the dictionary and how it’s implemented.
Metrics Definition and validation
We are using JSON Schema to validate the metrics definition.
This process is meant to ensure consistent and valid metrics defined for Service Ping. All metrics must:
- Comply with the defined JSON schema.
- Have a unique
key_path
. - Have an owner.
We currently have tier
as one of the required fields for a metric definition file, however, we are now moving towards replacing tier
with tiers
, for this purpose it is valid to add tiers
as a field in the metric definition files. Until the replacement process is complete, both tier
and tiers
would be valid fields that can be added to the metric definition files.
All metrics are stored in YAML files:
removed
are added to the Service Ping JSON payload.Each metric is defined in a YAML file consisting of a number of fields:
Field | Required | Additional information |
---|---|---|
key_path
| yes | JSON key path for the metric, location in Service Ping payload. |
description
| yes | |
product_group
| yes | The group that owns the metric. |
value_type
| yes |
string ; one of string , number , boolean , object .
|
status
| yes |
string ; status of the metric, may be set to active , removed , broken .
|
time_frame
| yes |
string or array ; may be set to 7d , 28d , all , none or an array including any of these values except for none .
|
data_source
| yes |
string ; may be set to a value like database , redis , redis_hll , prometheus , system , license , internal_events .
|
data_category
| yes |
string ; categories of the metric, may be set to operational , optional , subscription , standard . The default value is optional .
|
instrumentation_class
| no |
string ; used for metrics with data_source other than internal_events . See the class that implements the metric.
|
performance_indicator_type
| no |
array ; may be set to one of gmau , smau , paid_gmau , umau , customer_health_score , devops_report , lighthouse , or leading_indicator .
|
tier
| yes |
array ; may contain one or a combination of free , premium or ultimate . The tier where the tracked feature is available. This should be verbose and contain all tiers where a metric is available.
|
tiers
| no |
array ; may contain one or a combination of free , premium or ultimate . The tiers where the tracked feature is available. This should be verbose and contain all tiers where a metric is available.
|
milestone
| yes | The milestone when the metric is introduced and when it’s available to self-managed instances with the official GitLab release. |
milestone_removed
| no | The milestone when the metric is removed. Required for removed metrics. |
introduced_by_url
| yes | The URL to the merge request that introduced the metric to be available for self-managed instances. |
removed_by_url
| no | The URL to the merge request that removed the metric. Required for removed metrics. |
repair_issue_url
| no | The URL of the issue that was created to repair a metric with a broken status.
|
options
| no |
object : options information needed to calculate the metric value.
|
Metric key_path
The key_path
of the metric is the location in the JSON Service Ping payload.
The key_path
could be composed from multiple parts separated by .
and it must be unique.
If a metric definition has an array time_frame
, the key_path
defined in the YAML file will have a suffix automatically added for each of the included time frames:
time_frame |
key_path suffix
|
---|---|
all
| no suffix |
7d
| _weekly
|
28d
| _monthly
|
The key_path
s shown in the Metrics Dictionary include those suffixes.
Metric statuses
Metric definitions can have one of the following statuses:
-
active
: Metric is used and reports data. -
broken
: Metric reports broken data (for example, -1 fallback), or does not report data at all. A metric marked asbroken
must also have therepair_issue_url
attribute. -
removed
: Metric was removed, but it may appear in Service Ping payloads sent from instances running on older versions of GitLab.
Metric value_type
Metric definitions can have one of the following values for value_type
:
boolean
number
string
-
object
: A metric withvalue_type: object
must havevalue_json_schema
with a link to the JSON schema for the object. In general, we avoid complex objects and prefer one of theboolean
,number
, orstring
value types. An example of a metric that usesvalue_type: object
istopology
(/config/metrics/settings/20210323120839_topology.yml
), which has a related schema in/config/metrics/objects_schemas/topology_schema.json
.
Metric time_frame
A metric’s time frame is calculated based on the time_frame
field and the data_source
of the metric. When time_frame
is an array, the metric’s values are calculated for each of the included time frames.
data_source | time_frame | Description |
---|---|---|
any | none
| A type of data that’s not tracked over time, such as settings and configuration information |
database
| all
| The whole time the metric has been active (all-time interval) |
database
| 7d
| 9 days ago to 2 days ago |
database
| 28d
| 30 days ago to 2 days ago |
internal_events
| all
| The whole time the metric has been active (all-time interval) |
internal_events
| 7d
| Most recent complete week |
internal_events
| 28d
| Most recent 4 complete weeks |
Data category
We use the following categories to classify a metric:
-
operational
: Required data for operational purposes. -
optional
: Default value for a metric. Data that is optional to collect. This can be enabled or disabled in the Admin area. -
subscription
: Data related to licensing. -
standard
: Standard set of identifiers that are included when collecting data.
Example YAML metric definition
The linked uuid
YAML file includes an example metric definition, where the uuid
metric is the GitLab
instance unique identifier.
key_path: uuid
description: GitLab instance unique identifier
product_group: analytics_instrumentation
value_type: string
status: active
milestone: 9.1
instrumentation_class: UuidMetric
introduced_by_url: https://gitlab.com/gitlab-org/gitlab/-/merge_requests/1521
time_frame: none
data_source: database
tier:
- free
- premium
- ultimate
tiers:
- free
- premium
- ultimate
Create a new metric definition
The GitLab codebase provides dedicated generators to create new metrics, which also create valid metric definition files:
For uniqueness, the generated files include a timestamp prefix in ISO 8601 format.
Performance Indicator Metrics
To use a metric definition to manage performance indicator:
- Create a merge request that includes related changes.
- Use labels
~"analytics instrumentation"
,"~Data Warehouse::Impact Check"
. - Update the metric definition
performance_indicator_type
field. - Create an issue in GitLab Product Data Insights project with the PI Chart Help template to have the new metric visualized.
Metrics Dictionary
Metrics Dictionary is a separate application.
All metrics available in Service Ping are in the Metrics Dictionary.