Commit Graph

43806 Commits

Author SHA1 Message Date
Adam Williamson
75cebd40eb Really drop ns03 from openQA worker DNS config
Signed-off-by: Adam Williamson <awilliam@redhat.com>
2025-07-15 15:18:11 -07:00
Adam Williamson
e9435623a1 Drop broken ns03 from openQA worker network config temporarily
This server isn't working and we can't figure out why not. It's
a problem for openQA because we copy the host's DNS config into
'advanced networking' openQA guests, and then when we do a
FreeIPA deployment test, it picks up both DNS servers, tries to
confirm both work, and fails. So we need to take ns03 out until
it's fixed.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2025-07-15 15:11:58 -07:00
Kevin Fenzi
1a41934f52 ns03: fix another copy pasta
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-15 12:32:05 -07:00
Kevin Fenzi
989b73537d proxy01: do not give proxy01 ns01s ip
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-15 12:19:49 -07:00
Kevin Fenzi
dce80c9d1a sign-vault02: provision in rdu3
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-15 09:34:56 -07:00
Greg Sutcliffe
286cc834a1 Zabbix: Comment zabbix-agent in server playbook, it's causing some issues I need to look at
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2025-07-15 16:08:48 +01:00
Jakub Kadlcik
225351a2c8 copr-be: make sure dist-git-client is up-to-date 2025-07-15 13:40:38 +02:00
Greg Sutcliffe
a26d68ebec Zabbix: Refactor sync-noc-membership and add zabbix-agent to server playbook
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2025-07-15 12:29:51 +01:00
Kevin Fenzi
5376cbe713 bodhi-backend01: recycle db connections after 600 seconds
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-14 20:43:09 -07:00
Kevin Fenzi
11c4c4f211 inventory: add ipv6 addresses to various hosts that need them
We don't have ipv6 routing setup yet, but are scheduled to work on that
soon. To get ready for that, lets add ipv6 addresses to the (few)
machines that will actually need them.

We do not want to add ipv6 to all hosts. The vast majority of them never
need to talk to the outside world directly and shouldn't have a ipv6
address that can do this.

These few hosts are ones with external nat mappings where it is
desireable that they be able to handle ipv6 connections.

Note that we also do NOT want to add any of these to dns until
they are known working. We also will likely have to adjust nftables
to allow the services on ipv6 that we do on ipv4 (if they make sense).

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-14 15:02:06 -07:00
Aurélien Bompard
d8cab8b424 FMN: fix configuration variable
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-07-14 23:09:08 +02:00
Kevin Fenzi
be410884f9 kernel02: this is using a bond/bridge now
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-14 13:34:58 -07:00
Kevin Fenzi
23f98071f8 kernel02 for rdu3
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-14 13:31:50 -07:00
Kevin Fenzi
4f01c21e72 bvmhost-p09-05: fix mac3 address
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-14 12:07:41 -07:00
Kevin Fenzi
b04d0d372f readd bvmhost-p09-05 in rdu3
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-14 11:19:30 -07:00
David Kirwan
57a7517abb forgejo: remove OADP operator content, create netapp PVC
Signed-off-by: David Kirwan <davidkirwanirl@gmail.com>
2025-07-14 15:50:23 +01:00
David Kirwan
a54397cea8 forgejo: disable creation of the PVC by helm chart
Signed-off-by: David Kirwan <davidkirwanirl@gmail.com>
2025-07-14 15:19:18 +01:00
Michal Konecny
7ff5ac563e [mailman3] Remove the authentication options 2
Remove the authentication options also from group vars.
2025-07-14 15:41:49 +02:00
Michal Konecny
19f13fa83f [mailman3] Remove non working authentication options
https://pagure.io/fedora-infrastructure/issue/12653
2025-07-14 15:04:11 +02:00
Michal Konecny
e0ed8cd7dd [mailman3] Add automatic restart on web
The gunicorn is sometimes getting killed by OOM (I assume it's the AI
scrapers). So let's add automatic restart to get it back on track.
2025-07-14 10:39:13 +02:00
Greg Sutcliffe
0d71c0bce0 Nagios: remove http check on p10 mgmt interface
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2025-07-11 20:06:33 +00:00
Greg Sutcliffe
9f06fe9e9c Selinux: allow NRPE to follow symlinks when checking files
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2025-07-11 20:03:17 +00:00
Dusty Mabe
195d6b3b15 set empty string for storageclass for fedora-ostree-pruner
This was done in d97f99c for coreos-ostree-importer. Here I add a
comment with more context to the the coreos-ostree-importer one and
add the same config to the fedora-ostree-pruner.
2025-07-11 15:42:54 -04:00
Kevin Fenzi
f32e35dc41 koji: restore some web endpoints now, uncomment if load becomes too bad
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-11 11:16:09 -07:00
Aurélien Bompard
20ca80e25f fixup the fixup
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-07-11 18:15:13 +02:00
Aurélien Bompard
c1d5cb0c9f Fixup last commit
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-07-11 18:14:24 +02:00
Aurélien Bompard
2b4fa6bd91 Refactor the sysadmin-openshift role to be a more generic openshift post-install role
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-07-11 18:06:07 +02:00
Michal Konecny
50eb5e0165 [freemedia] Apply SELinux changes
sefcontext doesn't apply changes, just set's the context. Let's apply
them by using restorecon.
2025-07-11 15:02:17 +02:00
Michal Konecny
ac63dc1ed1 [freemedia] Use the correct module name 2025-07-11 14:14:39 +02:00
Michal Konecny
f18b2fbf84 [freemedia] Fix lint errors
Let's make the ansible-lint happy.
2025-07-11 14:08:17 +02:00
Michal Konecny
2cea99c83e [freemedia] Use sefcontext module
Instead of command that is setting SELinux context (not applying it)
let's use module that will do that for us.
2025-07-11 13:50:50 +02:00
Adam Williamson
666196bbed openqa/worker: don't start worker unless NFS mount is up
There's this annoying pattern where the NFS mount fails on boot
and then the worker services all start up and take jobs, but they
instafail because the share isn't there.

Ideally we could handle this very easily with Restart= directives
but systemd has...*opinions* about this:

https://github.com/systemd/systemd/issues/4468
https://github.com/systemd/systemd/issues/1312

so we have to do some fairly awkward hacks to just express:

* Retry the NFS mount if it fails
* Don't start the workers unless the NFS mount is up
* Retry the workers after a while if they were blocked

It's ugly, but in testing this same config on one worker it seems
to work...

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2025-07-10 19:07:54 -07:00
Kevin Fenzi
a64ef334cc ns02.rdu3 becomes ns03.rdu3.
This is to disambiguate 'ns02'. Right now we have ns02.fedoraproject.org
and also ns02.rdu3.fedoraproject.org. After this we will just have a
ns02 and a ns03.rdu3 server.

This will also allow us to more easily change whois/glue records.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-10 10:53:19 -07:00
Michal Konecny
7f7d887102 [nagios_client] Add missing nagios plugin
When looking at the mailman alert I found out that it's using
nagios-plugins-http, which is not installed by default. So let's add it
to list of installed packages.
2025-07-10 15:45:42 +02:00
Aurélien Bompard
fd48d02915 Improve the check_ipa_free_ids.py script to be able to set thresholds
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-07-10 11:53:25 +02:00
Aurélien Bompard
0c6153cebe Create an IPA service for monitoring and use it for check-ipa-free-ids
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-07-10 11:46:05 +02:00
Aurélien Bompard
5d1c0427c9 Nagios: use a keytab for check_ipa_free_ids
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-07-10 10:17:07 +02:00
Michal Konecny
4658698927 [openshift] Update sysadmin openshift group
* Add myself to sysadmin openshift
* Remove Vipul as he is no longer in the project
2025-07-10 08:05:07 +02:00
Akashdeep Dhar
c319941db9 Add client config for W2FM UI test environment
Signed-off-by: Akashdeep Dhar <akashdeep.dhar@gmail.com>
2025-07-10 05:16:44 +00:00
Kevin Fenzi
434f2f9405 inventory: add bodhi-backend01.stg to staging
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-09 16:06:44 -07:00
Kevin Fenzi
70c633121e Add bodhi-backend01.stg and adjust value01
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-09 16:00:08 -07:00
Kevin Fenzi
05311f97fc flatpak-cache01: use correct vmhost
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-09 11:21:31 -07:00
Kevin Fenzi
0228df9cd0 flatpak-cache01: add rdu3 host vars and install
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-09 11:16:50 -07:00
Kevin Fenzi
233ec96688 inventory: drop non existant machines
These are various machines that are not yet deployed, or no longer exist
in rdu3 (though they did in iad2). This should clean up nagios
a fair bit and when/if we redeploy these we can add them back in.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-09 10:26:51 -07:00
Aurélien Bompard
cf00289c06 Add a Nagios check to monitor IPA ID ranges
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-07-09 17:27:19 +02:00
Aurélien Bompard
4272c8aa77 proxy04 and proxy12 are reachable again
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-07-09 16:16:12 +02:00
Aurélien Bompard
02abc93d59 Add a script to send queue metrics to CentOS's Zabbix
Fixes: https://pagure.io/fedora-infrastructure/issue/12617

Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-07-09 15:13:32 +02:00
Yaakov Selkowitz
cddfb47925 releng: tweak eln compose schedule
Signed-off-by: Yaakov Selkowitz <yselkowi@redhat.com>
2025-07-08 23:28:08 +00:00
Nils Philippsen
3f2138029b Add myself to sysadmin-openshift
Signed-off-by: Nils Philippsen <nils@redhat.com>
2025-07-08 23:21:29 +00:00
Nils Philippsen
59e778fa99 ansible-lint: Name ocp4-sysadmin-openshift play
Signed-off-by: Nils Philippsen <nils@redhat.com>
2025-07-08 23:21:29 +00:00