Commit Graph

1163 Commits

Author SHA1 Message Date
Kevin Fenzi
a754144f19 Update infra pagure.io links to forge.fp.o (WIP)
This should update all the references we have to
https://pagure.io/fedora-infrastructure to the
new https://forge.fedoraproject.org/infra/tickets/ area.

Do not merge this before the migration on tuesday.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2026-01-20 14:39:40 -08:00
Greg Sutcliffe
4b8780246b Nagios: another batch of services moved to Zabbix
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2026-01-15 15:43:45 +00:00
Greg Sutcliffe
93ed0457e0 Nagios: remove first batch of services
This removes the known-good things we've had in Zabbix for a while -
RAID, disk space, processes, and mail queue. It also removes swap which
we've decided we don't need.

Also includes some FS overrides on the Zabbix side so the relevant
NFS mounts get monitored on the OCI, and pkgs hosts, as per Nagios had.

Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2026-01-14 10:25:15 +00:00
Kevin Fenzi
5dee660cac proxy3: use fqdn in nagios
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-12-25 08:49:17 -08:00
Kevin Fenzi
29a4165b81 nagios: pagure/pagure-stg: adjust smtp ssl check to use external ips
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-12-08 13:33:03 -08:00
Kevin Fenzi
e2eeee78f2 nagios / pagure.io/stg.pagure.io: setup external hosts for these
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-12-08 12:46:13 -08:00
Kevin Fenzi
a7a060af87 nagios: use logging_rdu3 host group and drop non rdu3 duplicate group
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-12-08 11:48:31 -08:00
Kevin Fenzi
57a4b9da41 nagios: make log01 not monitor / and have a higher limit for /var/log
/ and /var/log are the same filesystem on log01, so it makes little
sense to monitor both. Just monitor /var/log and increase it's limits.
We are going to archive things, but likely in january.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-12-04 14:23:11 -08:00
Kevin Fenzi
70c964ed9b pagure02: fare thee well.
We have moved to pagure01, retire pagure02

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-12-03 16:20:22 -08:00
Kevin Fenzi
91ca2a6bf3 pagure-stg01: say fare thee well
We have moved over to pagure-stg02 now in rdu3, so retire this vm.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-12-02 14:28:06 -08:00
Kevin Fenzi
293f68d9d7 nagios: disable notify-by-fedora-messaging contact since we removed the script call
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-11-11 15:26:13 -08:00
Anton Medvedev
ae53458feb removing fedora messaging notifications from nagios since it not used and not worked
Signed-off-by: Anton Medvedev <amedvede@redhat.com>
2025-11-05 10:48:11 +00:00
Greg Sutcliffe
5c3f92cee6 Nagios: exclude RDU3 copr hosts from noc02's config
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2025-11-03 12:05:43 +00:00
Anton Medvedev
5c7984f0f8 Revert "ref: this it part of releng repository refactoring, which affects logger scripts used in this config fedora-messaging-logger"
This reverts commit 099cfd03ca.
2025-10-06 16:15:11 +00:00
Anton Medvedev
68d298720c ref: this it part of releng repository refactoring, which affects logger scripts used in this config fedora-messaging-logger
Signed-off-by: Anton Medvedev <amedvede@redhat.com>
2025-10-06 16:15:11 +00:00
Kevin Fenzi
3f5b2c4401 nagios / bvmhost-p10-mgmt: try and fix http exclusion
This isn't a group, it's just a group variable, so try and change the
conditional to match.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-09-28 09:51:02 -07:00
Greg Sutcliffe
b1d0f7c744 Nagios: remove datacenter key filtering
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2025-09-23 12:49:45 +01:00
Greg Sutcliffe
5339774faf Nagios: Revert change to staging template
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2025-09-23 11:30:31 +01:00
Greg Sutcliffe
049eca9a7f Fix Nagios checking of staging hosts
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2025-09-23 10:56:59 +01:00
Greg Sutcliffe
67f182ecf7 Nagios: revert multi-DC handling from 1531c45283
Signed-off-by: Greg Sutcliffe <github@emeraldreverie.org>
2025-09-19 19:14:07 +00:00
Kevin Fenzi
3ac263f576 nagios: drop old mdapi messages check
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-09-17 13:11:47 -07:00
Michal Konecny
d5f83a7272 [nagios] Use server checks on noc01
Just move datanommer check to server plugins, so it's the same as before.
2025-07-31 10:26:01 +02:00
Michal Konecny
e4afc6cf7a [nagios_server] Remove datanommer check
This check is already installed as part of nagios_client playbook. The
nagios_server role contained old version which doesn't work anymore. Let's get rid
off it.
2025-07-31 10:02:02 +02:00
Kevin Fenzi
e36386c30d nagios: no nomail check at all externally
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-24 10:44:24 -07:00
Kevin Fenzi
9c1bb508b5 nagios: try a different way to not run mail_queue check externally
Revert the previous thing that tried to move a template to a file, and
instead move it back to a template and just conditionalize it only to
apply on rdu3_internal nagios.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-18 10:25:26 -07:00
Kevin Fenzi
fafc5b1baa Revert "nagios / external: make mail_queue internal only"
This reverts commit 84f03db63c.
2025-07-18 10:22:29 -07:00
Kevin Fenzi
168d030d9c nagios: turns out this newline is important syntax wise
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-18 09:34:01 -07:00
Kevin Fenzi
84f03db63c nagios / external: make mail_queue internal only
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-15 18:23:40 -07:00
Kevin Fenzi
f73944f190 nagios: try and adjust things so noc02 / nagios-external works again
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-15 17:23:22 -07:00
Greg Sutcliffe
0d71c0bce0 Nagios: remove http check on p10 mgmt interface
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2025-07-11 20:06:33 +00:00
Kevin Fenzi
70c633121e Add bodhi-backend01.stg and adjust value01
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-09 16:00:08 -07:00
Aurélien Bompard
cf00289c06 Add a Nagios check to monitor IPA ID ranges
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-07-09 17:27:19 +02:00
Kevin Fenzi
6d796a6fff basset: remove monitoring, we havent deployed this in years
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-08 10:45:24 -07:00
Nils Philippsen
6c85fda0c9 Mass remove/replace iad2 -> rdu3, 10.3. -> 10.16.
Signed-off-by: Nils Philippsen <nils@redhat.com>
2025-07-03 20:05:02 +02:00
James Antill
b5338d9050 nagios: gateway-hosts: Add conditionals for nagios_location as rdu3.
Signed-off-by: James Antill <james@and.org>
2025-06-30 22:00:55 -04:00
James Antill
a960bf8e7c nrpe: Add 10.16.163.10 to allowed_hosts.
Signed-off-by: James Antill <james@and.org>
2025-06-30 20:59:28 -04:00
Kevin Fenzi
d5d7e4f606 nagios_server: try and simplify logic
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-06-26 19:00:00 -07:00
Kevin Fenzi
2d741d3a63 nagios_server: another cred fix
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-06-26 18:36:13 -07:00
Aurélien Bompard
f185573c41 Do stuff on iad2_internal also on rdu3_internal
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 19:02:44 +02:00
Aurélien Bompard
d22bde741d Nagios: template the mail_queue.cfg file
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 18:11:28 +02:00
Aurélien Bompard
0b7bab72e6 Nagios: filter the hostgroups again
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 17:37:32 +02:00
Aurélien Bompard
aefb2eb4bc Filter staging-hosts
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 14:14:29 +02:00
Aurélien Bompard
2c2c06bde0 Filter the mirrorlist-proxies services
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 14:05:33 +02:00
Aurélien Bompard
1531c45283 Try to filter the group contents instead of the group names
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 12:10:52 +02:00
Aurélien Bompard
d3246f3c64 Filter the other nagios templates
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 11:20:06 +02:00
Aurélien Bompard
72881d29d2 Filter the mincheckgrp hostgroup
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 10:43:38 +02:00
Aurélien Bompard
3ab4e21dbc Filter the no_ping group
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 10:40:34 +02:00
Aurélien Bompard
933060bc15 Don't change the template name, or it will be the name of the remote file
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 10:27:35 +02:00
Aurélien Bompard
9007df7619 Don't change the template name, or it will be the name of the remote file
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 10:27:03 +02:00
Aurélien Bompard
b8fea68959 Try to exclude rdu3 hosts from the iad2 nagios template
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 09:12:25 +02:00