Kevin Fenzi
a754144f19
Update infra pagure.io links to forge.fp.o (WIP)
...
This should update all the references we have to
https://pagure.io/fedora-infrastructure to the
new https://forge.fedoraproject.org/infra/tickets/ area.
Do not merge this before the migration on tuesday.
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2026-01-20 14:39:40 -08:00
Greg Sutcliffe
4b8780246b
Nagios: another batch of services moved to Zabbix
...
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org >
2026-01-15 15:43:45 +00:00
Greg Sutcliffe
93ed0457e0
Nagios: remove first batch of services
...
This removes the known-good things we've had in Zabbix for a while -
RAID, disk space, processes, and mail queue. It also removes swap which
we've decided we don't need.
Also includes some FS overrides on the Zabbix side so the relevant
NFS mounts get monitored on the OCI, and pkgs hosts, as per Nagios had.
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org >
2026-01-14 10:25:15 +00:00
Kevin Fenzi
5dee660cac
proxy3: use fqdn in nagios
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-12-25 08:49:17 -08:00
Kevin Fenzi
29a4165b81
nagios: pagure/pagure-stg: adjust smtp ssl check to use external ips
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-12-08 13:33:03 -08:00
Kevin Fenzi
e2eeee78f2
nagios / pagure.io/stg.pagure.io: setup external hosts for these
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-12-08 12:46:13 -08:00
Kevin Fenzi
a7a060af87
nagios: use logging_rdu3 host group and drop non rdu3 duplicate group
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-12-08 11:48:31 -08:00
Kevin Fenzi
57a4b9da41
nagios: make log01 not monitor / and have a higher limit for /var/log
...
/ and /var/log are the same filesystem on log01, so it makes little
sense to monitor both. Just monitor /var/log and increase it's limits.
We are going to archive things, but likely in january.
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-12-04 14:23:11 -08:00
Kevin Fenzi
70c964ed9b
pagure02: fare thee well.
...
We have moved to pagure01, retire pagure02
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-12-03 16:20:22 -08:00
Kevin Fenzi
91ca2a6bf3
pagure-stg01: say fare thee well
...
We have moved over to pagure-stg02 now in rdu3, so retire this vm.
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-12-02 14:28:06 -08:00
Kevin Fenzi
293f68d9d7
nagios: disable notify-by-fedora-messaging contact since we removed the script call
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-11-11 15:26:13 -08:00
Anton Medvedev
ae53458feb
removing fedora messaging notifications from nagios since it not used and not worked
...
Signed-off-by: Anton Medvedev <amedvede@redhat.com >
2025-11-05 10:48:11 +00:00
Greg Sutcliffe
5c3f92cee6
Nagios: exclude RDU3 copr hosts from noc02's config
...
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org >
2025-11-03 12:05:43 +00:00
Anton Medvedev
5c7984f0f8
Revert "ref: this it part of releng repository refactoring, which affects logger scripts used in this config fedora-messaging-logger"
...
This reverts commit 099cfd03ca .
2025-10-06 16:15:11 +00:00
Anton Medvedev
68d298720c
ref: this it part of releng repository refactoring, which affects logger scripts used in this config fedora-messaging-logger
...
Signed-off-by: Anton Medvedev <amedvede@redhat.com >
2025-10-06 16:15:11 +00:00
Kevin Fenzi
3f5b2c4401
nagios / bvmhost-p10-mgmt: try and fix http exclusion
...
This isn't a group, it's just a group variable, so try and change the
conditional to match.
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-09-28 09:51:02 -07:00
Greg Sutcliffe
b1d0f7c744
Nagios: remove datacenter key filtering
...
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org >
2025-09-23 12:49:45 +01:00
Greg Sutcliffe
5339774faf
Nagios: Revert change to staging template
...
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org >
2025-09-23 11:30:31 +01:00
Greg Sutcliffe
049eca9a7f
Fix Nagios checking of staging hosts
...
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org >
2025-09-23 10:56:59 +01:00
Greg Sutcliffe
67f182ecf7
Nagios: revert multi-DC handling from 1531c45283
...
Signed-off-by: Greg Sutcliffe <github@emeraldreverie.org >
2025-09-19 19:14:07 +00:00
Kevin Fenzi
3ac263f576
nagios: drop old mdapi messages check
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-09-17 13:11:47 -07:00
Michal Konecny
d5f83a7272
[nagios] Use server checks on noc01
...
Just move datanommer check to server plugins, so it's the same as before.
2025-07-31 10:26:01 +02:00
Michal Konecny
e4afc6cf7a
[nagios_server] Remove datanommer check
...
This check is already installed as part of nagios_client playbook. The
nagios_server role contained old version which doesn't work anymore. Let's get rid
off it.
2025-07-31 10:02:02 +02:00
Kevin Fenzi
e36386c30d
nagios: no nomail check at all externally
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-07-24 10:44:24 -07:00
Kevin Fenzi
9c1bb508b5
nagios: try a different way to not run mail_queue check externally
...
Revert the previous thing that tried to move a template to a file, and
instead move it back to a template and just conditionalize it only to
apply on rdu3_internal nagios.
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-07-18 10:25:26 -07:00
Kevin Fenzi
fafc5b1baa
Revert "nagios / external: make mail_queue internal only"
...
This reverts commit 84f03db63c .
2025-07-18 10:22:29 -07:00
Kevin Fenzi
168d030d9c
nagios: turns out this newline is important syntax wise
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-07-18 09:34:01 -07:00
Kevin Fenzi
84f03db63c
nagios / external: make mail_queue internal only
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-07-15 18:23:40 -07:00
Kevin Fenzi
f73944f190
nagios: try and adjust things so noc02 / nagios-external works again
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-07-15 17:23:22 -07:00
Greg Sutcliffe
0d71c0bce0
Nagios: remove http check on p10 mgmt interface
...
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org >
2025-07-11 20:06:33 +00:00
Kevin Fenzi
70c633121e
Add bodhi-backend01.stg and adjust value01
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-07-09 16:00:08 -07:00
Aurélien Bompard
cf00289c06
Add a Nagios check to monitor IPA ID ranges
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org >
2025-07-09 17:27:19 +02:00
Kevin Fenzi
6d796a6fff
basset: remove monitoring, we havent deployed this in years
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-07-08 10:45:24 -07:00
Nils Philippsen
6c85fda0c9
Mass remove/replace iad2 -> rdu3, 10.3. -> 10.16.
...
Signed-off-by: Nils Philippsen <nils@redhat.com >
2025-07-03 20:05:02 +02:00
James Antill
b5338d9050
nagios: gateway-hosts: Add conditionals for nagios_location as rdu3.
...
Signed-off-by: James Antill <james@and.org >
2025-06-30 22:00:55 -04:00
James Antill
a960bf8e7c
nrpe: Add 10.16.163.10 to allowed_hosts.
...
Signed-off-by: James Antill <james@and.org >
2025-06-30 20:59:28 -04:00
Kevin Fenzi
d5d7e4f606
nagios_server: try and simplify logic
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-06-26 19:00:00 -07:00
Kevin Fenzi
2d741d3a63
nagios_server: another cred fix
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-06-26 18:36:13 -07:00
Aurélien Bompard
f185573c41
Do stuff on iad2_internal also on rdu3_internal
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org >
2025-06-23 19:02:44 +02:00
Aurélien Bompard
d22bde741d
Nagios: template the mail_queue.cfg file
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org >
2025-06-23 18:11:28 +02:00
Aurélien Bompard
0b7bab72e6
Nagios: filter the hostgroups again
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org >
2025-06-23 17:37:32 +02:00
Aurélien Bompard
aefb2eb4bc
Filter staging-hosts
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org >
2025-06-23 14:14:29 +02:00
Aurélien Bompard
2c2c06bde0
Filter the mirrorlist-proxies services
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org >
2025-06-23 14:05:33 +02:00
Aurélien Bompard
1531c45283
Try to filter the group contents instead of the group names
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org >
2025-06-23 12:10:52 +02:00
Aurélien Bompard
d3246f3c64
Filter the other nagios templates
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org >
2025-06-23 11:20:06 +02:00
Aurélien Bompard
72881d29d2
Filter the mincheckgrp hostgroup
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org >
2025-06-23 10:43:38 +02:00
Aurélien Bompard
3ab4e21dbc
Filter the no_ping group
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org >
2025-06-23 10:40:34 +02:00
Aurélien Bompard
933060bc15
Don't change the template name, or it will be the name of the remote file
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org >
2025-06-23 10:27:35 +02:00
Aurélien Bompard
9007df7619
Don't change the template name, or it will be the name of the remote file
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org >
2025-06-23 10:27:03 +02:00
Aurélien Bompard
b8fea68959
Try to exclude rdu3 hosts from the iad2 nagios template
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org >
2025-06-23 09:12:25 +02:00