Skip to content

Commit 393a726

Browse files
authored
Merge pull request #469 from beraldoleal/sli-metrics
alerts: adding prometheus rules for basic alerts
2 parents dacb45e + 6017dfd commit 393a726

File tree

1 file changed

+25
-0
lines changed

1 file changed

+25
-0
lines changed
Lines changed: 25 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,25 @@
1+
apiVersion: monitoring.coreos.com/v1
2+
kind: PrometheusRule
3+
metadata:
4+
name: osc-alerts
5+
namespace: openshift-sandboxed-containers-operator
6+
spec:
7+
groups:
8+
- name: osc_alerts
9+
rules:
10+
- alert: KataRemoteWorkloadFailureHigh
11+
expr: kata_remote_workload_failure_ratio > 25
12+
for: 30m
13+
labels:
14+
severity: warning
15+
annotations:
16+
summary: "High Kata Remote Workload Failure Ratio"
17+
description: "The failure ratio of kata-remote workloads is above 25% for more than 30 minutes. This may indicate issues with the runtime or configuration."
18+
19+
- alert: kata_active_instance
20+
expr: vector(1)
21+
labels:
22+
severity: info
23+
purpose: "alive_signal"
24+
annotations:
25+
summary: "Kata instance alive signal"

0 commit comments

Comments
 (0)