This document describes the metrics exported by Kepler for monitoring energy consumption at various levels (node, container, process, VM).
Kepler exports metrics in Prometheus format that can be scraped by Prometheus or other compatible monitoring systems.
- COUNTER: A cumulative metric that only increases over time
- GAUGE: A metric that can increase and decrease
These metrics provide energy and power information at the node level.
- Type: COUNTER
- Description: Energy consumption of cpu in active state at node level in joules
- Labels:
zonepath
- Constant Labels:
node_name
- Type: GAUGE
- Description: Power consumption of cpu in active state at node level in watts
- Labels:
zonepath
- Constant Labels:
node_name
- Type: COUNTER
- Description: Energy consumption of cpu in idle state at node level in joules
- Labels:
zonepath
- Constant Labels:
node_name
- Type: GAUGE
- Description: Power consumption of cpu in idle state at node level in watts
- Labels:
zonepath
- Constant Labels:
node_name
- Type: GAUGE
- Description: CPU information from procfs
- Labels:
processorvendor_idmodel_namephysical_idcore_id
- Type: COUNTER
- Description: Energy consumption of cpu at node level in joules
- Labels:
zonepath
- Constant Labels:
node_name
- Type: GAUGE
- Description: CPU usage ratio of a node (value between 0.0 and 1.0)
- Constant Labels:
node_name
- Type: GAUGE
- Description: Power consumption of cpu at node level in watts
- Labels:
zonepath
- Constant Labels:
node_name
- Type: COUNTER
- Description: Energy consumption of gpu in active state at node level in joules
- Labels:
gpugpu_uuidgpu_namevendor
- Constant Labels:
node_name
- Type: GAUGE
- Description: GPU active power (total - idle) in watts
- Labels:
gpugpu_uuidgpu_namevendor
- Constant Labels:
node_name
- Type: COUNTER
- Description: Energy consumption of gpu in idle state at node level in joules
- Labels:
gpugpu_uuidgpu_namevendor
- Constant Labels:
node_name
- Type: GAUGE
- Description: GPU idle power (auto-detected minimum) in watts
- Labels:
gpugpu_uuidgpu_namevendor
- Constant Labels:
node_name
- Type: COUNTER
- Description: Energy consumption of gpu at node level in joules
- Labels:
gpugpu_uuidgpu_namevendor
- Constant Labels:
node_name
- Type: GAUGE
- Description: Total GPU power consumption in watts
- Labels:
gpugpu_uuidgpu_namevendor
- Constant Labels:
node_name
These metrics provide energy and power information for containers.
- Type: COUNTER
- Description: Energy consumption of cpu at container level in joules
- Labels:
container_idcontainer_nameruntimestatezonepod_id
- Constant Labels:
node_name
- Type: GAUGE
- Description: Power consumption of cpu at container level in watts
- Labels:
container_idcontainer_nameruntimestatezonepod_id
- Constant Labels:
node_name
- Type: COUNTER
- Description: Energy consumption of gpu at container level in joules
- Labels:
container_idcontainer_nameruntimestatepod_id
- Constant Labels:
node_name
- Type: GAUGE
- Description: Power consumption of gpu at container level in watts
- Labels:
container_idcontainer_nameruntimestatepod_id
- Constant Labels:
node_name
These metrics provide energy and power information for individual processes.
- Type: COUNTER
- Description: Energy consumption of cpu at process level in joules
- Labels:
pidcommexetypestatecontainer_idvm_idzone
- Constant Labels:
node_name
- Type: COUNTER
- Description: Total user and system time of cpu at process level in seconds
- Labels:
pidcommexetypecontainer_idvm_id
- Constant Labels:
node_name
- Type: GAUGE
- Description: Power consumption of cpu at process level in watts
- Labels:
pidcommexetypestatecontainer_idvm_idzone
- Constant Labels:
node_name
- Type: COUNTER
- Description: Energy consumption of gpu at process level in joules
- Labels:
pidcommexetypestatecontainer_idvm_id
- Constant Labels:
node_name
- Type: GAUGE
- Description: Power consumption of gpu at process level in watts
- Labels:
pidcommexetypestatecontainer_idvm_id
- Constant Labels:
node_name
These metrics provide energy and power information for virtual machines.
- Type: COUNTER
- Description: Energy consumption of cpu at vm level in joules
- Labels:
vm_idvm_namehypervisorstatezone
- Constant Labels:
node_name
- Type: GAUGE
- Description: Power consumption of cpu at vm level in watts
- Labels:
vm_idvm_namehypervisorstatezone
- Constant Labels:
node_name
These metrics provide energy and power information for pods.
- Type: COUNTER
- Description: Energy consumption of cpu at pod level in joules
- Labels:
pod_idpod_namepod_namespacestatezone
- Constant Labels:
node_name
- Type: GAUGE
- Description: Power consumption of cpu at pod level in watts
- Labels:
pod_idpod_namepod_namespacestatezone
- Constant Labels:
node_name
- Type: COUNTER
- Description: Energy consumption of gpu at pod level in joules
- Labels:
pod_idpod_namepod_namespacestate
- Constant Labels:
node_name
- Type: GAUGE
- Description: Power consumption of gpu at pod level in watts
- Labels:
pod_idpod_namepod_namespacestate
- Constant Labels:
node_name
Additional metrics provided by Kepler.
- Type: GAUGE
- Description: A metric with a constant '1' value labeled with version information
- Labels:
archbranchrevisionversiongoversion
These experimental metrics provide platform-level power information from BMC sources (e.g., Redfish). Enable the experimental Redfish feature to collect these metrics.
- Type: GAUGE
- Description: Current platform power in watts from BMC (PowerSubsystem or deprecated Power API)
- Labels:
sourcenode_namebmc_idchassis_idsource_idsource_namesource_type
This documentation was automatically generated by the gen-metric-docs tool.