Configuring the monitoring stack

The {product-title} 4 installation program provides only a low number of configuration options before installation. Configuring most {product-title} framework components, including the cluster monitoring stack, happens post-installation.

This section explains what configuration is supported, shows how to configure the monitoring stack, and demonstrates several common configuration scenarios.

Prerequisites

The monitoring stack imposes additional resource requirements. Consult the computing resources recommendations in Scaling the Cluster Monitoring Operator and verify that you have sufficient resources.

modules/monitoring-maintenance-and-support.adoc modules/monitoring-support-considerations.adoc modules/monitoring-unmanaged-monitoring-operators.adoc

Preparing to configure the monitoring stack

You can configure the monitoring stack by creating and updating monitoring config maps.

modules/monitoring-creating-cluster-monitoring-configmap.adoc modules/monitoring-creating-user-defined-workload-monitoring-configmap.adoc

Additional resources

Enabling monitoring for user-defined projects

modules/monitoring-configuring-the-monitoring-stack.adoc

Additional resources

See Preparing to configure the monitoring stack for steps to create monitoring config maps
Enabling monitoring for user-defined projects

modules/monitoring-configurable-monitoring-components.adoc

modules/monitoring-moving-monitoring-components-to-different-nodes.adoc

Additional resources

See Preparing to configure the monitoring stack for steps to create monitoring config maps
Enabling monitoring for user-defined projects
Understanding how to update labels on nodes
Placing pods on specific nodes using node selectors
See the Kubernetes documentation for details on the nodeSelector constraint

modules/monitoring-assigning-tolerations-to-monitoring-components.adoc

Additional resources

See Preparing to configure the monitoring stack for steps to create monitoring config maps
Enabling monitoring for user-defined projects
See the {product-title} documentation on taints and tolerations
See the Kubernetes documentation on taints and tolerations

Configuring persistent storage

Running cluster monitoring with persistent storage means that your metrics are stored to a persistent volume (PV) and can survive a pod being restarted or recreated. This is ideal if you require your metrics or alerting data to be guarded from data loss. For production environments, it is highly recommended to configure persistent storage. Because of the high IO demands, it is advantageous to use local storage.

Important

If you are running cluster monitoring with an attached PVC for Prometheus, you might experience OOM kills during cluster upgrade. When persistent storage is in use for Prometheus, Prometheus memory usage doubles during cluster upgrade and for several hours after upgrade is complete. To avoid the OOM kill issue, allow worker nodes with double the size of memory that was available prior to the upgrade. For example, if you are running monitoring on the minimum recommended nodes, which is 2 cores with 8 GB of RAM, increase memory to 16 GB. For more information, see BZ#1925061.

Note	See Recommended configurable storage technology.

Persistent storage prerequisites

Dedicate sufficient local persistent storage to ensure that the disk does not become full. How much storage you need depends on the number of pods. For information on system requirements for persistent storage, see Prometheus database storage requirements.
Make sure you have a persistent volume (PV) ready to be claimed by the persistent volume claim (PVC), one PV for each replica. Because Prometheus has two replicas and Alertmanager has three replicas, you need five PVs to support the entire monitoring stack. The PVs should be available from the Local Storage Operator. This does not apply if you enable dynamically provisioned storage.
Use the block type of storage.

Configure local persistent storage.

Note	If you use a local volume for persistent storage, do not use a raw block volume, which is described with `volumeMode: block` in the `LocalVolume` object. Prometheus cannot use raw block volumes.

modules/monitoring-configuring-a-local-persistent-volume-claim.adoc modules/monitoring-modifying-retention-time-for-prometheus-metrics-data.adoc

Additional resources

See Preparing to configure the monitoring stack for steps to create monitoring config maps
Enabling monitoring for user-defined projects
Understanding persistent storage
Optimizing storage

modules/monitoring-configuring-remote-write.adoc

Additional resources

See Setting up remote write compatible endpoints for steps to create a remote write compatible endpoint (such as Thanos).
See Tuning remote write settings for information about how to optimize remote write settings for different use cases.
For information about additional optional fields, please refer to the API documentation.

modules/monitoring-limiting-scrape-samples-in-user-defined-projects.adoc modules/monitoring-setting-a-scrape-sample-limit-for-user-defined-projects.adoc modules/monitoring-creating-scrape-sample-alerts.adoc

Additional resources

Creating a user-defined workload monitoring config map
Enabling monitoring for user-defined projects
See Determining why Prometheus is consuming a lot of disk space for steps to query which metrics have the highest number of scrape samples

modules/monitoring-configuring-external-alertmanagers.adoc

modules/monitoring-attaching-additional-labels-to-your-time-series-and-alerts.adoc

Additional resources

See Preparing to configure the monitoring stack for steps to create monitoring config maps
Enabling monitoring for user-defined projects
See Preparing to configure the monitoring stack for steps to create monitoring config maps

modules/monitoring-setting-log-levels-for-monitoring-components.adoc

modules/monitoring-setting-query-log-file-for-prometheus.adoc

Additional resources

See Preparing to configure the monitoring stack for steps to create monitoring config maps
Enabling monitoring for user-defined projects

modules/monitoring-disabling-grafana.adoc

Additional resources

See Preparing to configure the monitoring stack for steps to create monitoring config maps

modules/monitoring-disabling-the-local-alertmanager.adoc

Additional resources

Prometheus Alertmanager documentation
Managing alerts

Next steps

Enabling monitoring for user-defined projects
Learn about remote health reporting and, if necessary, opt out of it

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

configuring-the-monitoring-stack.adoc

configuring-the-monitoring-stack.adoc

Configuring the monitoring stack

Prerequisites

Preparing to configure the monitoring stack

Configuring persistent storage

Persistent storage prerequisites

Next steps

Files

configuring-the-monitoring-stack.adoc

Latest commit

History

configuring-the-monitoring-stack.adoc

File metadata and controls

Configuring the monitoring stack

Prerequisites

Preparing to configure the monitoring stack

Configuring persistent storage

Persistent storage prerequisites

Next steps