Commit Graph

641 Commits

Author SHA1 Message Date
Michal Konecny
d5f83a7272 [nagios] Use server checks on noc01
Just move datanommer check to server plugins, so it's the same as before.
2025-07-31 10:26:01 +02:00
Michal Konecny
e4afc6cf7a [nagios_server] Remove datanommer check
This check is already installed as part of nagios_client playbook. The
nagios_server role contained old version which doesn't work anymore. Let's get rid
off it.
2025-07-31 10:02:02 +02:00
Kevin Fenzi
70c633121e Add bodhi-backend01.stg and adjust value01
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-09 16:00:08 -07:00
Kevin Fenzi
6d796a6fff basset: remove monitoring, we havent deployed this in years
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-08 10:45:24 -07:00
Nils Philippsen
6c85fda0c9 Mass remove/replace iad2 -> rdu3, 10.3. -> 10.16.
Signed-off-by: Nils Philippsen <nils@redhat.com>
2025-07-03 20:05:02 +02:00
Aurélien Bompard
d22bde741d Nagios: template the mail_queue.cfg file
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 18:11:28 +02:00
Michal Konecny
f63e839698 [nagios-server] Move the datanommer checks to noc01
There were few fedora-messaging datanommer checks that were running on
busgateway01. As this machine is part of fedmsg it will be
decommissioned. Let's move the checks to noc01.

Signed-off-by: Michal Konecny <mkonecny@redhat.com>
2025-02-14 09:45:39 +00:00
Michal Konecny
6428f8f772 Sunset github2fedmsg and fedmsg
This commit is removing all the fedmsg related stuff from ansible
repository.

Signed-off-by: Michal Konecny <mkonecny@redhat.com>
2025-02-13 10:08:51 +00:00
Nick Bebout
cdb7471dfe Remove codeblock (relrod) from nagios 2025-02-11 18:39:05 -06:00
Jiri Podivin
f513e7cbcd Linting python scripts
Signed-off-by: Jiri Podivin <jpodivin@redhat.com>
2024-09-18 19:57:29 +00:00
Kevin Fenzi
0dfa11a6eb fedimg: signing off...
Thanks for all the uploads fedimg.
You go to a far far better place I'm sure.

There's no point in keeping it around now, as it's actually not working
and the replacement ( cloud-image-uploader) should work soon.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-13 16:40:01 -07:00
Kevin Fenzi
d6ecf4c07d virthost-cc-rdu02/rhel7 becomes vmhost-x86-cc02/rhel9
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-02 11:53:18 -07:00
Kevin Fenzi
4bcbc54efa people: retire people02
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-06-27 15:38:03 -07:00
James Antill
d7258e320e Add DNF countme nagios checks.
Signed-off-by: James Antill <james@and.org>
2024-06-27 17:35:23 +00:00
Kevin Fenzi
71d5c496d4 nagios: fix badges monitoring check in nagios
This changed from 'fedbadges' to 'badges'.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-05-28 13:07:21 -07:00
Kevin Fenzi
d366194a22 module-build-service (mbs): retire service
With the EOL of Fedora 38 yesterday, we are no longer building any
modules and can retire our module build service.

Note that toddlers needs to be adjusted still, that will happen after
this.

Thanks for all the modules!

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-05-22 13:38:53 -07:00
Kevin Fenzi
ce72533001 nagios / badges: remove old fedmsg checks
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-05-06 13:11:59 -07:00
Kevin Fenzi
c84b99223c osbs: raise a glass for it's service
This removes osbs and allmost all it's associated playbooks and files.

It served long and well, but we no longer need it.
flatpaks are building with a koji-flatpak plugin.
base/minimal/toolbox containers are building with kiwi.
We aren't building any other containers right now, and we did they could
be added to kiwi.

This is the end of an era... I look with nostolga on
ansible-ansible-openshift-ansible (a role to setup ansible on a control
host and run it from our ansible).

Good bye osbs!

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-03-28 12:52:07 -07:00
Kevin Fenzi
f95712d8a0 nagios / koji: drop ssl cert check
This check was from long ago when koji used a self signed cert/ca
It still amusingly has that configured, so this check is telling us that
that self signed cert that we dont use anymore is expiring. :)
So, just drop this, koji is being proxies now and uses our main wildcard
cert.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-02-13 10:13:48 -08:00
Leo Puvilland
172a57c0cf nagios: remove serviceackauthor from host notifications
Signed-off-by: Leo Puvilland <leo@craftcat.dev>
2024-01-24 03:34:52 +00:00
Leo Puvilland
c2b5cf45ac Switch to SERVICESTATE instead of HOSTSTATE in notify.cfg
Signed-off-by: Leo Puvilland <leo@craftcat.dev>
2024-01-08 21:59:13 +00:00
Leo Puvilland
00d82f8610 Add matrix-bot to ircbot contactgroup
Signed-off-by: Leo Puvilland <leo@craftcat.dev>
2023-12-20 15:35:19 -08:00
Leo Puvilland
5aafc6a1d2 Move nagios notifications to Matrix
Signed-off-by: Leo Puvilland <leo@craftcat.dev>
2023-12-18 15:55:30 -08:00
Kevin Fenzi
2524e7c258 nagios: stop trying to monitor start.fedoraproject.org, as its now under fedoraproject.org/start
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-11-30 14:52:03 -08:00
Kevin Fenzi
20dc948173 notifs (old fmn): retire
We are retiring this in favor of the new service.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-11-15 12:28:28 -08:00
Kevin Fenzi
3808d867de value01/value01.stg: retire
These are old rhel7 instances. The only thing left on them is fedmsg-irc
(sending to one irc channel, fedora-releng). Move everything to use the
newer rhel8 value02 instead.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-11-15 12:13:38 -08:00
Pavel Raiskup
8e6de8396e nagios: send notifications to copr-team@redhat.com
Instead of separate members.  This is just to align with:
https://accounts.fedoraproject.org/group/copr-sig/
2023-11-13 15:32:26 +01:00
Kevin Fenzi
f0e6442a27 noc: drop bodhi nagios alert group
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-09-21 14:01:18 -07:00
Pavel Raiskup
fdb5bc033e nagios_server: add Jiří Kyjovský as a point of contact 2023-09-08 08:08:03 +02:00
Adam Williamson
8286b8f6c8 Port check_nagios_notifications.py to Python 3
Saw from one of the emails this morning that this isn't running
because there's no python2 on whatever system it was trying to
run on. This ports it to Python 3 (thanks, 2to3) and cleans up
the formatting (thanks, black). I tested it with a random sample
file I found lying around the internet -
https://github.com/bahamas10/node-nagios-status-parser/blob/master/status.dat
and it seems to do what it's supposed to do.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2023-08-14 08:58:54 -07:00
Kevin Fenzi
22dde8163b unbound: remove and retire unbound servers
These instances served long and well as fallback resolvers for
dnssec-trigger. This is no longer needed or used, so lets remove them.
See https://pagure.io/fedora-infrastructure/issue/11415

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-07-24 14:40:43 -07:00
Stephen Smoogen
7d7d0bf0a8 Remove smooge from various aliases
Currently, I (Stephen Smoogen) do not have the time to work on Fedora
system administration items. However, I get a lot of email and people
see my email address in various places to ping me for working on
things. I feel it would be better to remove myself from those places
and let Fedora Infrastructure add someone else to replace me when it
is possible to do so.

Signed-off-by: Stephen Smoogen <ssmoogen@redhat.com>
2023-07-17 23:34:18 +00:00
Andrew Heath
9121258f52 reenable ansible nagios busgateway01 checks 2023-05-23 12:13:31 -04:00
Andrew Heath
9d3c107ef0 Disabling ansible check till we can troubleshoot 2023-05-19 20:07:41 +00:00
Andrew Heath
3600553301 removing nommer and fixing RPM sign 2023-05-19 20:07:41 +00:00
Kevin Fenzi
624f7545f0 Fare thee well 32bit arm. You served long and well.
Now that f36 is eol we don't need 32bit arm builders, test machines or
exceptions anywhere.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-05-16 17:05:14 -07:00
Pavel Raiskup
cb87003edc nagios_external: align icmp6 check with 5adeb88890 2023-04-26 09:24:45 +02:00
Andrew Heath
1bbd805e17 Remove remaining greenwave checks for busgateway 2023-04-11 17:32:49 +00:00
Kevin Fenzi
c69fbc43f7 nagios: add monitoring for stg pagure ssl cert
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-03-28 09:13:54 -07:00
Pavel Raiskup
92cab5ca01 copr-distgit: disable cgit check from FE
Relates: https://github.com/fedora-copr/copr/issues/2410
2023-03-21 09:55:48 +01:00
Kevin Fenzi
b981ac9c8f nagios / server: check fedorapeople.org ssl cert expiry
This is for https://pagure.io/fedora-infrastructure/issue/10928

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-03-15 13:15:18 -07:00
Stephen Smoogen
6a6f5c0c75 Fix hostname in ping6-ipv6
A host_name in a nagios directive must match one which is defined
elsewhere in the hosts tree. For this case we needed to use the
host_name noc02-ipv6.fedoraproject.org to match what was in the ipv6
namespace.

Signed-off-by: Stephen Smoogen <ssmoogen@redhat.com>
2022-11-18 15:11:38 -05:00
Kevin Fenzi
71cdddf55b nagios: move the ipv6 specific ping config to a ping-ipv6.cfg file
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-11-17 16:39:11 -08:00
Kevin Fenzi
b9b35a09ed nagios: move ping.cfg to a template so it works for both nagios servers
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-11-17 16:19:50 -08:00
Stephen Smoogen
4fe28d9291 do not put jinja2 template items into a static file 2022-11-17 16:00:22 -05:00
Stephen Smoogen
e6b3fb1904 Make it so that ipv6 is checked on hosts 2022-11-17 15:55:53 -05:00
Stephen Smoogen
e36f982263 This should allow for ansible to build correctly the templates for noc01/noc02. 2022-11-17 12:06:00 -05:00
Seddik Alaoui Ismaili
9af427e1bf add ipv6 check for fedorapeople 2022-11-17 01:40:25 +00:00
Kevin Fenzi
b97e20c3d8 nagios: add check for ocp api ssl cert
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-07-28 17:19:04 -07:00
Miroslav Suchý
b5c09240f1 remove schlupova
Silvie left the team and RH
2022-06-01 11:08:14 +02:00