Kevin Fenzi
929f957cb2
nagios: fix check_disk on the client side of log01
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-12-08 12:29:33 -08:00
Kevin Fenzi
3ac263f576
nagios: drop old mdapi messages check
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-09-17 13:11:47 -07:00
Michal Konecny
d5f83a7272
[nagios] Use server checks on noc01
...
Just move datanommer check to server plugins, so it's the same as before.
2025-07-31 10:26:01 +02:00
Michal Konecny
31b05ab861
[nagios_client] Install the nagios script on noc
...
To get the datanommer checks working on noc01, we need to install the
nagios scripts on noc01 as well.
2025-07-28 13:00:14 +02:00
Michal Konecny
eb732ee1ac
[nagios_client] Fix yamllint errors
2025-07-28 12:09:29 +02:00
Michal Konecny
88132737b5
[nagios_client] Fix ansible-lint errors
2025-07-28 11:58:02 +02:00
Michal Konecny
4670886484
[nagios_client] Install datanommer checks on noc01
...
https://pagure.io/fedora-infrastructure/issue/12402
2025-07-28 11:58:02 +02:00
Greg Sutcliffe
9f06fe9e9c
Selinux: allow NRPE to follow symlinks when checking files
...
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org >
2025-07-11 20:03:17 +00:00
Michal Konecny
7f7d887102
[nagios_client] Add missing nagios plugin
...
When looking at the mailman alert I found out that it's using
nagios-plugins-http, which is not installed by default. So let's add it
to list of installed packages.
2025-07-10 15:45:42 +02:00
Aurélien Bompard
fd48d02915
Improve the check_ipa_free_ids.py script to be able to set thresholds
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org >
2025-07-10 11:53:25 +02:00
Aurélien Bompard
0c6153cebe
Create an IPA service for monitoring and use it for check-ipa-free-ids
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org >
2025-07-10 11:46:05 +02:00
Aurélien Bompard
5d1c0427c9
Nagios: use a keytab for check_ipa_free_ids
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org >
2025-07-10 10:17:07 +02:00
Aurélien Bompard
cf00289c06
Add a Nagios check to monitor IPA ID ranges
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org >
2025-07-09 17:27:19 +02:00
Kevin Fenzi
6d796a6fff
basset: remove monitoring, we havent deployed this in years
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-07-08 10:45:24 -07:00
Nils Philippsen
6c85fda0c9
Mass remove/replace iad2 -> rdu3, 10.3. -> 10.16.
...
Signed-off-by: Nils Philippsen <nils@redhat.com >
2025-07-03 20:05:02 +02:00
Kevin Fenzi
6d0efee3cb
nagios / client: allow noc01.rdu3 access to nrpe
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-06-22 09:01:18 -07:00
James Antill
e11f6f2262
Emergency fix for wiki install.
...
Signed-off-by: James Antill <james@and.org >
2025-05-22 18:11:09 -04:00
Kevin Fenzi
17fb0e324c
rdu3: no openvpn server checks in rdu3 yet
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-05-21 16:52:33 -07:00
Kevin Fenzi
36417bc22a
nagios_client: make sure the nrpe user is local
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-05-21 16:40:55 -07:00
Michal Konecny
6428f8f772
Sunset github2fedmsg and fedmsg
...
This commit is removing all the fedmsg related stuff from ansible
repository.
Signed-off-by: Michal Konecny <mkonecny@redhat.com >
2025-02-13 10:08:51 +00:00
Kevin Fenzi
22f3d8832f
handlers: more renaming fixes
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-01-24 14:06:11 -08:00
Kevin Fenzi
d30b2ecfaa
nagios_client: fix shell call to useradd
...
This is a actual shell var, not a call to the ansible shell plugin.
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2025-01-15 17:08:56 -08:00
Ryan Lerch
47c68f478d
ansiblelint fixes - fqcn[action-core] - template to ansible.builtin.template
...
Replaces references to template: with ansible.builtin.template
Signed-off-by: Ryan Lerch <rlerch@redhat.com >
2025-01-15 11:30:29 +10:00
Ryan Lerch
3c41882bb0
ansiblelint fixes - fqcn[action-core] - shell to ansible.builtin.shell
...
Replaces references to shell: with ansible.builtin.shell
Signed-off-by: Ryan Lerch <rlerch@redhat.com >
2025-01-15 11:29:10 +10:00
Ryan Lerch
25391e95b7
ansiblelint fixes - fqcn[action-core] - package to ansible.builtin.package
...
Replaces many references to package: with ansible.builtin.package
Signed-off-by: Ryan Lerch <rlerch@redhat.com >
2025-01-15 11:28:00 +10:00
Ryan Lerch
462176464b
ansiblelint fixes-- fqcn[action-core] - command to ansible.builtin.command
...
Replaces many references to command: with ansible.builtin.command
Signed-off-by: Ryan Lerch <rlerch@redhat.com >
2025-01-15 11:26:47 +10:00
Ryan Lerch
6a3816dfdc
ansiblelint fixes-- fqcn[action-core] - copy to ansible.builtin.copy
...
Replaces many references to 'copy' with ansible.builtin.copy
Signed-off-by: Ryan Lerch <rlerch@redhat.com >
2025-01-15 10:43:31 +10:00
Ryan Lerch
62952df107
ansiblelint fixes-- fqcn[action-core] - file to ansible.builtin.file
...
Replaces many references to file: with ansible.builtin.file
Signed-off-by: Ryan Lerch <rlerch@redhat.com >
2025-01-15 10:41:52 +10:00
Ryan Lerch
691adee6ee
Fix name[casing] ansible-lint issues
...
fix 1900 failures of the following case issue:
`name[casing]: All names should start with an uppercase letter.`
Signed-off-by: Ryan Lerch <rlerch@redhat.com >
2025-01-14 20:20:07 +10:00
Ryan Lerch
89f6f1fc32
Fix majority of remaining yamllint warnings and errors
...
Signed-off-by: Ryan Lerch <rlerch@redhat.com >
2024-11-28 17:31:45 +10:00
Kevin Fenzi
7f2b98cb6e
basset: retire no longer deployed service
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2024-09-24 11:43:47 -07:00
Jiri Podivin
f513e7cbcd
Linting python scripts
...
Signed-off-by: Jiri Podivin <jpodivin@redhat.com >
2024-09-18 19:57:29 +00:00
Kevin Fenzi
0dfa11a6eb
fedimg: signing off...
...
Thanks for all the uploads fedimg.
You go to a far far better place I'm sure.
There's no point in keeping it around now, as it's actually not working
and the replacement ( cloud-image-uploader) should work soon.
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2024-08-13 16:40:01 -07:00
Kevin Fenzi
9a444067cd
noc-cc01: also install client checks
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2024-08-01 17:30:03 -07:00
Kevin Fenzi
289bda5698
nagios_client: install client on noc-cc01
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2024-08-01 16:34:40 -07:00
Michal Konecny
c8b62faaa4
[nagios_client] Fix for mailman api check
...
The mailman is now returning HTTP/1.1 instead of HTTP/1.0.
Signed-off-by: Michal Konecny <mkonecny@redhat.com >
2024-06-28 10:19:10 +02:00
James Antill
d7258e320e
Add DNF countme nagios checks.
...
Signed-off-by: James Antill <james@and.org >
2024-06-27 17:35:23 +00:00
Kevin Fenzi
84a7a7afc8
nagios: adjust nrpe for badges vs old fedbadges
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2024-05-28 13:54:53 -07:00
Kevin Fenzi
d366194a22
module-build-service (mbs): retire service
...
With the EOL of Fedora 38 yesterday, we are no longer building any
modules and can retire our module build service.
Note that toddlers needs to be adjusted still, that will happen after
this.
Thanks for all the modules!
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2024-05-22 13:38:53 -07:00
Kevin Fenzi
c84b99223c
osbs: raise a glass for it's service
...
This removes osbs and allmost all it's associated playbooks and files.
It served long and well, but we no longer need it.
flatpaks are building with a koji-flatpak plugin.
base/minimal/toolbox containers are building with kiwi.
We aren't building any other containers right now, and we did they could
be added to kiwi.
This is the end of an era... I look with nostolga on
ansible-ansible-openshift-ansible (a role to setup ansible on a control
host and run it from our ansible).
Good bye osbs!
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2024-03-28 12:52:07 -07:00
Seddik Alaoui Ismaili
c05bcd289f
remove pynag from check ipa replica
2024-02-27 13:16:46 +00:00
Kevin Fenzi
a60ca7159f
nuancier: retire and remove from ansible
...
See https://pagure.io/fedora-infrastructure/issue/11371
This service is retired.
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2023-11-15 10:44:00 -08:00
Andrew Heath
96d9ed3d6b
Adding more checks for the fedmsg socket
2023-08-16 14:08:16 -04:00
Kevin Fenzi
0066f3cc68
proxies / fedmsg_monitoring: revert part of last config change
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2023-08-15 14:37:27 -07:00
Kevin Fenzi
1c0516c831
nagios_client: adjust fedmsg monitoring
...
Copy the fixes from exceptions monitoring to backlog.
Fix the calls that were passing a trailing - which isn't needed anymore.
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2023-08-15 14:25:56 -07:00
Andrew Heath
c5daa84f53
Have script check for fedmsg socket
2023-08-15 21:18:18 +00:00
Kevin Fenzi
22dde8163b
unbound: remove and retire unbound servers
...
These instances served long and well as fallback resolvers for
dnssec-trigger. This is no longer needed or used, so lets remove them.
See https://pagure.io/fedora-infrastructure/issue/11415
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2023-07-24 14:40:43 -07:00
Pavel Raiskup
0944ac4ef3
copr-dist-git: decrease storage warning quota
...
With 5T storage, it is enough to warn on remaining 12%, and error on 6%.
2023-07-24 07:14:16 +02:00
Kevin Fenzi
314fa870a9
notifs-backend: fix check script and increase limits
...
The check_rabbitmq_size script seems to have critical and warning
backwards and is doing str comparisons when int should be used.
Also increase the limits a bunch as we don't want to be notified if it's
just backloged a bit.
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2023-07-10 15:00:04 -07:00
Andrew Heath
a4ca219ed9
updating to point at monitoring-fedmsg-hub-3.socke
2023-05-30 15:49:45 -04:00