Commit Graph

1125 Commits

Author SHA1 Message Date
Aurélien Bompard
f185573c41 Do stuff on iad2_internal also on rdu3_internal
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 19:02:44 +02:00
Aurélien Bompard
d22bde741d Nagios: template the mail_queue.cfg file
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 18:11:28 +02:00
Aurélien Bompard
0b7bab72e6 Nagios: filter the hostgroups again
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 17:37:32 +02:00
Aurélien Bompard
aefb2eb4bc Filter staging-hosts
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 14:14:29 +02:00
Aurélien Bompard
2c2c06bde0 Filter the mirrorlist-proxies services
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 14:05:33 +02:00
Aurélien Bompard
1531c45283 Try to filter the group contents instead of the group names
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 12:10:52 +02:00
Aurélien Bompard
d3246f3c64 Filter the other nagios templates
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 11:20:06 +02:00
Aurélien Bompard
72881d29d2 Filter the mincheckgrp hostgroup
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 10:43:38 +02:00
Aurélien Bompard
3ab4e21dbc Filter the no_ping group
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 10:40:34 +02:00
Aurélien Bompard
933060bc15 Don't change the template name, or it will be the name of the remote file
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 10:27:35 +02:00
Aurélien Bompard
9007df7619 Don't change the template name, or it will be the name of the remote file
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 10:27:03 +02:00
Aurélien Bompard
b8fea68959 Try to exclude rdu3 hosts from the iad2 nagios template
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-06-23 09:12:25 +02:00
Kevin Fenzi
3883da6d3d nagios / server: template oneproxy for iad2/rdu3
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-06-21 12:15:04 -07:00
Kevin Fenzi
aeaa0811c4 nagios: fix task to match the real template name
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-06-21 11:45:49 -07:00
Kevin Fenzi
ad3533e506 nagios: try and split out all hostgroups into _iad2 and _rdu3
We want to monitor iad2 from noc01.iad2 and rdu3 from noc01.rdu3, so
try and split this out into seperate all groups for each datacenter.
This will likely miss some things that aren't split out into seperate
_iad2 and _rdu3 groups, but we can hopefully fix those.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-06-21 11:26:38 -07:00
Kevin Fenzi
7113cce4ec nagios_server: fix missing = in when
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-06-21 11:06:35 -07:00
Kevin Fenzi
3be2d89e66 nagios: also add these templates in rdu3
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-06-20 22:34:45 -07:00
Kevin Fenzi
4ca8fa862c nagios: adjust when clause for rdu3
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-06-20 22:11:57 -07:00
Kevin Fenzi
a42481a782 nagios/rdu3: need templates and other config in rdu3 also
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-06-20 21:49:17 -07:00
Kevin Fenzi
437479c896 nagios-rdu3: drop limited mgmt group in rdu3
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-06-20 21:28:57 -07:00
Kevin Fenzi
449385c8b0 nagios: move rdu3 hosts over to noc01.rdu3
Also open firewalls to allow noc03.rdu3 to access them.
Also enable nagios_server on noc01.rdu3.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-06-20 20:29:24 -07:00
Kevin Fenzi
3edae8484b nagios: iad2 noc01 can treat the rdu3 internal gw as rdu3-gw
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-06-20 19:51:07 -07:00
Kevin Fenzi
2b3441492a nagios: add rdu3-hosts template to be deployed
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-06-20 18:05:31 -07:00
Kevin Fenzi
8ffd2aef29 nagios: update gateways for iad2/rdu3, they need to be one hop up from the actual external ip
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-05-30 16:45:19 -07:00
Mark Rosenbaum
7a10ef14d3 Added Nagios rdu3 configs 2025-05-30 17:24:36 +00:00
Kevin Fenzi
91e9a5627d httpd / botblocking: fix syntax on bot rewrite
These have to be in "s in order to do a string comparison, since
they were not, they were never matching anything. ;(

Fix them all up, and also block a few more repos on pagure that are
getting heavily crawled.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-05-13 11:39:25 -07:00
Greg Sutcliffe
9f431805ec nagios: Update authorized user lists 2025-03-26 21:16:13 +00:00
Kevin Fenzi
0a986e4f7e nagios / registry: check registry via the actual registry instead of the web page
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-02-17 12:19:30 -08:00
Michal Konecny
f63e839698 [nagios-server] Move the datanommer checks to noc01
There were few fedora-messaging datanommer checks that were running on
busgateway01. As this machine is part of fedmsg it will be
decommissioned. Let's move the checks to noc01.

Signed-off-by: Michal Konecny <mkonecny@redhat.com>
2025-02-14 09:45:39 +00:00
Michal Konecny
6428f8f772 Sunset github2fedmsg and fedmsg
This commit is removing all the fedmsg related stuff from ansible
repository.

Signed-off-by: Michal Konecny <mkonecny@redhat.com>
2025-02-13 10:08:51 +00:00
Nick Bebout
cdb7471dfe Remove codeblock (relrod) from nagios 2025-02-11 18:39:05 -06:00
Michal Konecny
2ec055db6f Use first uppercase letter for all handlers
This will unify all the handlers to use first uppercase letter for
ansible-lint to stop complaining.

I went through all `notify:` occurrences and fixed them by running
```
set TEXT "text_to_replace"; set REPLACEMENT "replacement_text"; git grep
-rlz "$TEXT" . | xargs -0 sed -i "s/$TEXT/$REPLACEMENT/g"
```

Then I went through all the changes and removed the ones that wasn't
expected to be changed.

Fixes https://pagure.io/fedora-infrastructure/issue/12391

Signed-off-by: Michal Konecny <mkonecny@redhat.com>
2025-02-10 20:31:49 +00:00
Kevin Fenzi
22f3d8832f handlers: more renaming fixes
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-01-24 14:06:11 -08:00
Ryan Lerch
47c68f478d ansiblelint fixes - fqcn[action-core] - template to ansible.builtin.template
Replaces references to template: with ansible.builtin.template

Signed-off-by: Ryan Lerch <rlerch@redhat.com>
2025-01-15 11:30:29 +10:00
Ryan Lerch
25391e95b7 ansiblelint fixes - fqcn[action-core] - package to ansible.builtin.package
Replaces many references to  package: with ansible.builtin.package

Signed-off-by: Ryan Lerch <rlerch@redhat.com>
2025-01-15 11:28:00 +10:00
Ryan Lerch
462176464b ansiblelint fixes-- fqcn[action-core] - command to ansible.builtin.command
Replaces many references to  command: with ansible.builtin.command

Signed-off-by: Ryan Lerch <rlerch@redhat.com>
2025-01-15 11:26:47 +10:00
Ryan Lerch
6a3816dfdc ansiblelint fixes-- fqcn[action-core] - copy to ansible.builtin.copy
Replaces many references to 'copy' with ansible.builtin.copy

Signed-off-by: Ryan Lerch <rlerch@redhat.com>
2025-01-15 10:43:31 +10:00
Ryan Lerch
62952df107 ansiblelint fixes-- fqcn[action-core] - file to ansible.builtin.file
Replaces many references to  file: with ansible.builtin.file

Signed-off-by: Ryan Lerch <rlerch@redhat.com>
2025-01-15 10:41:52 +10:00
Ryan Lerch
691adee6ee Fix name[casing] ansible-lint issues
fix 1900 failures of the following case issue:

`name[casing]: All names should start with an uppercase letter.`

Signed-off-by: Ryan Lerch <rlerch@redhat.com>
2025-01-14 20:20:07 +10:00
Ryan Lerch
89f6f1fc32 Fix majority of remaining yamllint warnings and errors
Signed-off-by: Ryan Lerch <rlerch@redhat.com>
2024-11-28 17:31:45 +10:00
Kevin Fenzi
ef8a734d69 nagios: also make sure the service is running and enabled
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-11-21 12:53:00 -08:00
Kevin Fenzi
160a909053 noc: install ipmitool as well
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-11-19 13:22:48 -08:00
Michal Konecny
cee2700942 [nagios_server] Add zlopez to list of users who can use commands
I was not able to acknowledge alerts on nagios and this looks like the correct
place to get them.

Signed-off-by: Michal Konecny <mkonecny@redhat.com>
2024-11-04 12:52:38 +01:00
Kevin Fenzi
e3e2cb1d93 odcs: retire service ( infra 12192 )
Time to retire ODCS. ELN is moved off and that was the last thing using
it. Thanks for all the service ODCS!

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-09-24 18:21:51 +00:00
Jiri Podivin
f513e7cbcd Linting python scripts
Signed-off-by: Jiri Podivin <jpodivin@redhat.com>
2024-09-18 19:57:29 +00:00
James Antill
31de6ced58 nagios: change the monitoring of registry.fedoraproject.org to start at
fedora (skiping fNN/* etc), so we don't hit limits and not see
        fedora* images.

Signed-off-by: James Antill <james@and.org>
2024-09-12 19:24:03 +00:00
Kevin Fenzi
0dfa11a6eb fedimg: signing off...
Thanks for all the uploads fedimg.
You go to a far far better place I'm sure.

There's no point in keeping it around now, as it's actually not working
and the replacement ( cloud-image-uploader) should work soon.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-13 16:40:01 -07:00
Kevin Fenzi
d6ecf4c07d virthost-cc-rdu02/rhel7 becomes vmhost-x86-cc02/rhel9
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-02 11:53:18 -07:00
Stephen Smoogen
a0397d7abb Add blocks to nagios.conf httpd
I forgot I am the expert on nagios configs so added it to the template
file.

Signed-off-by: Stephen Smoogen <ssmoogen@redhat.com>
2024-07-09 09:18:56 +00:00
Kevin Fenzi
2397e3fbc4 mirrormanager: remove no longer needed nagios check for frontend
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-07-01 14:37:55 -07:00