Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Evaluating rule failed Alert: CephNodeDiskspaceWarning #34

Open
Nils98Ar opened this issue Sep 4, 2023 · 1 comment
Open

Evaluating rule failed Alert: CephNodeDiskspaceWarning #34

Nils98Ar opened this issue Sep 4, 2023 · 1 comment

Comments

@Nils98Ar
Copy link

Nils98Ar commented Sep 4, 2023

I have extracted this from the docker logs prometheus_server output. Any idea what the problem is and how it can be fixed?

found duplicate series for the match group {instance=\"<internal_address of monitoring node>:9100\"} on the right hand-side of the operation: [
	{__name__=\"node_uname_info\", domainname=\"(none)\", instance=\"<internal_address of monitoring node>:9100\", job=\"node\",              machine=\"x86_64\", nodename=\"<hostname of monitoring node>\", release=\"<kernel release monitoring node>\", sysname=\"Linux\", version=\"<kernel version monitoring node>\"},
	{__name__=\"node_uname_info\", domainname=\"(none)\", instance=\"<internal_address of monitoring node>:9100\", job=\"ceph_nodeexporter\", machine=\"x86_64\", nodename=\"<hostname of monitoring node>\", release=\"<kernel release monitoring node>\", sysname=\"Linux\", version=\"<kernel version monitoring node>\"}
];
many-to-many matching not allowed: matching labels must be unique on one side

I have tried the ceph.rules from kolla-operations as well as this one from the upstream project: https://github.com/ceph/ceph/blob/7ae97667c9b7e4d86bb8976c2a96700aa3d4b1ce/monitoring/ceph-mixin/prometheus_alerts.yml

I'm not deep into prometheus...

@Nils98Ar
Copy link
Author

@berendt @michaelbayr

Any hint where I could continue? Am I right that you don't have this duplicate series for the match group problem?

This seems to be the last "false positive" alert in our environment.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant