Commit Graph

336 Commits

Author SHA1 Message Date
Kevin Fenzi
929f957cb2 nagios: fix check_disk on the client side of log01
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-12-08 12:29:33 -08:00
Kevin Fenzi
3ac263f576 nagios: drop old mdapi messages check
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-09-17 13:11:47 -07:00
Michal Konecny
d5f83a7272 [nagios] Use server checks on noc01
Just move datanommer check to server plugins, so it's the same as before.
2025-07-31 10:26:01 +02:00
Michal Konecny
31b05ab861 [nagios_client] Install the nagios script on noc
To get the datanommer checks working on noc01, we need to install the
nagios scripts on noc01 as well.
2025-07-28 13:00:14 +02:00
Michal Konecny
eb732ee1ac [nagios_client] Fix yamllint errors 2025-07-28 12:09:29 +02:00
Michal Konecny
88132737b5 [nagios_client] Fix ansible-lint errors 2025-07-28 11:58:02 +02:00
Michal Konecny
4670886484 [nagios_client] Install datanommer checks on noc01
https://pagure.io/fedora-infrastructure/issue/12402
2025-07-28 11:58:02 +02:00
Greg Sutcliffe
9f06fe9e9c Selinux: allow NRPE to follow symlinks when checking files
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2025-07-11 20:03:17 +00:00
Michal Konecny
7f7d887102 [nagios_client] Add missing nagios plugin
When looking at the mailman alert I found out that it's using
nagios-plugins-http, which is not installed by default. So let's add it
to list of installed packages.
2025-07-10 15:45:42 +02:00
Aurélien Bompard
fd48d02915 Improve the check_ipa_free_ids.py script to be able to set thresholds
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-07-10 11:53:25 +02:00
Aurélien Bompard
0c6153cebe Create an IPA service for monitoring and use it for check-ipa-free-ids
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-07-10 11:46:05 +02:00
Aurélien Bompard
5d1c0427c9 Nagios: use a keytab for check_ipa_free_ids
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-07-10 10:17:07 +02:00
Aurélien Bompard
cf00289c06 Add a Nagios check to monitor IPA ID ranges
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-07-09 17:27:19 +02:00
Kevin Fenzi
6d796a6fff basset: remove monitoring, we havent deployed this in years
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-08 10:45:24 -07:00
Nils Philippsen
6c85fda0c9 Mass remove/replace iad2 -> rdu3, 10.3. -> 10.16.
Signed-off-by: Nils Philippsen <nils@redhat.com>
2025-07-03 20:05:02 +02:00
Kevin Fenzi
6d0efee3cb nagios / client: allow noc01.rdu3 access to nrpe
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-06-22 09:01:18 -07:00
James Antill
e11f6f2262 Emergency fix for wiki install.
Signed-off-by: James Antill <james@and.org>
2025-05-22 18:11:09 -04:00
Kevin Fenzi
17fb0e324c rdu3: no openvpn server checks in rdu3 yet
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-05-21 16:52:33 -07:00
Kevin Fenzi
36417bc22a nagios_client: make sure the nrpe user is local
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-05-21 16:40:55 -07:00
Michal Konecny
6428f8f772 Sunset github2fedmsg and fedmsg
This commit is removing all the fedmsg related stuff from ansible
repository.

Signed-off-by: Michal Konecny <mkonecny@redhat.com>
2025-02-13 10:08:51 +00:00
Kevin Fenzi
22f3d8832f handlers: more renaming fixes
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-01-24 14:06:11 -08:00
Kevin Fenzi
d30b2ecfaa nagios_client: fix shell call to useradd
This is a actual shell var, not a call to the ansible shell plugin.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-01-15 17:08:56 -08:00
Ryan Lerch
47c68f478d ansiblelint fixes - fqcn[action-core] - template to ansible.builtin.template
Replaces references to template: with ansible.builtin.template

Signed-off-by: Ryan Lerch <rlerch@redhat.com>
2025-01-15 11:30:29 +10:00
Ryan Lerch
3c41882bb0 ansiblelint fixes - fqcn[action-core] - shell to ansible.builtin.shell
Replaces references to shell: with ansible.builtin.shell

Signed-off-by: Ryan Lerch <rlerch@redhat.com>
2025-01-15 11:29:10 +10:00
Ryan Lerch
25391e95b7 ansiblelint fixes - fqcn[action-core] - package to ansible.builtin.package
Replaces many references to  package: with ansible.builtin.package

Signed-off-by: Ryan Lerch <rlerch@redhat.com>
2025-01-15 11:28:00 +10:00
Ryan Lerch
462176464b ansiblelint fixes-- fqcn[action-core] - command to ansible.builtin.command
Replaces many references to  command: with ansible.builtin.command

Signed-off-by: Ryan Lerch <rlerch@redhat.com>
2025-01-15 11:26:47 +10:00
Ryan Lerch
6a3816dfdc ansiblelint fixes-- fqcn[action-core] - copy to ansible.builtin.copy
Replaces many references to 'copy' with ansible.builtin.copy

Signed-off-by: Ryan Lerch <rlerch@redhat.com>
2025-01-15 10:43:31 +10:00
Ryan Lerch
62952df107 ansiblelint fixes-- fqcn[action-core] - file to ansible.builtin.file
Replaces many references to  file: with ansible.builtin.file

Signed-off-by: Ryan Lerch <rlerch@redhat.com>
2025-01-15 10:41:52 +10:00
Ryan Lerch
691adee6ee Fix name[casing] ansible-lint issues
fix 1900 failures of the following case issue:

`name[casing]: All names should start with an uppercase letter.`

Signed-off-by: Ryan Lerch <rlerch@redhat.com>
2025-01-14 20:20:07 +10:00
Ryan Lerch
89f6f1fc32 Fix majority of remaining yamllint warnings and errors
Signed-off-by: Ryan Lerch <rlerch@redhat.com>
2024-11-28 17:31:45 +10:00
Kevin Fenzi
7f2b98cb6e basset: retire no longer deployed service
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-09-24 11:43:47 -07:00
Jiri Podivin
f513e7cbcd Linting python scripts
Signed-off-by: Jiri Podivin <jpodivin@redhat.com>
2024-09-18 19:57:29 +00:00
Kevin Fenzi
0dfa11a6eb fedimg: signing off...
Thanks for all the uploads fedimg.
You go to a far far better place I'm sure.

There's no point in keeping it around now, as it's actually not working
and the replacement ( cloud-image-uploader) should work soon.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-13 16:40:01 -07:00
Kevin Fenzi
9a444067cd noc-cc01: also install client checks
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-01 17:30:03 -07:00
Kevin Fenzi
289bda5698 nagios_client: install client on noc-cc01
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-01 16:34:40 -07:00
Michal Konecny
c8b62faaa4 [nagios_client] Fix for mailman api check
The mailman is now returning HTTP/1.1 instead of HTTP/1.0.

Signed-off-by: Michal Konecny <mkonecny@redhat.com>
2024-06-28 10:19:10 +02:00
James Antill
d7258e320e Add DNF countme nagios checks.
Signed-off-by: James Antill <james@and.org>
2024-06-27 17:35:23 +00:00
Kevin Fenzi
84a7a7afc8 nagios: adjust nrpe for badges vs old fedbadges
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-05-28 13:54:53 -07:00
Kevin Fenzi
d366194a22 module-build-service (mbs): retire service
With the EOL of Fedora 38 yesterday, we are no longer building any
modules and can retire our module build service.

Note that toddlers needs to be adjusted still, that will happen after
this.

Thanks for all the modules!

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-05-22 13:38:53 -07:00
Kevin Fenzi
c84b99223c osbs: raise a glass for it's service
This removes osbs and allmost all it's associated playbooks and files.

It served long and well, but we no longer need it.
flatpaks are building with a koji-flatpak plugin.
base/minimal/toolbox containers are building with kiwi.
We aren't building any other containers right now, and we did they could
be added to kiwi.

This is the end of an era... I look with nostolga on
ansible-ansible-openshift-ansible (a role to setup ansible on a control
host and run it from our ansible).

Good bye osbs!

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-03-28 12:52:07 -07:00
Seddik Alaoui Ismaili
c05bcd289f remove pynag from check ipa replica 2024-02-27 13:16:46 +00:00
Kevin Fenzi
a60ca7159f nuancier: retire and remove from ansible
See https://pagure.io/fedora-infrastructure/issue/11371
This service is retired.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-11-15 10:44:00 -08:00
Andrew Heath
96d9ed3d6b Adding more checks for the fedmsg socket 2023-08-16 14:08:16 -04:00
Kevin Fenzi
0066f3cc68 proxies / fedmsg_monitoring: revert part of last config change
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-08-15 14:37:27 -07:00
Kevin Fenzi
1c0516c831 nagios_client: adjust fedmsg monitoring
Copy the fixes from exceptions monitoring to backlog.
Fix the calls that were passing a trailing - which isn't needed anymore.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-08-15 14:25:56 -07:00
Andrew Heath
c5daa84f53 Have script check for fedmsg socket 2023-08-15 21:18:18 +00:00
Kevin Fenzi
22dde8163b unbound: remove and retire unbound servers
These instances served long and well as fallback resolvers for
dnssec-trigger. This is no longer needed or used, so lets remove them.
See https://pagure.io/fedora-infrastructure/issue/11415

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-07-24 14:40:43 -07:00
Pavel Raiskup
0944ac4ef3 copr-dist-git: decrease storage warning quota
With 5T storage, it is enough to warn on remaining 12%, and error on 6%.
2023-07-24 07:14:16 +02:00
Kevin Fenzi
314fa870a9 notifs-backend: fix check script and increase limits
The check_rabbitmq_size script seems to have critical and warning
backwards and is doing str comparisons when int should be used.
Also increase the limits a bunch as we don't want to be notified if it's
just backloged a bit.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-07-10 15:00:04 -07:00
Andrew Heath
a4ca219ed9 updating to point at monitoring-fedmsg-hub-3.socke 2023-05-30 15:49:45 -04:00