Commit Graph

9853 Commits

Author SHA1 Message Date
Greg Sutcliffe
0a53bacdce Zabbix: ignore internal vnet interfaces when doing network monitoring for vm hosts too
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2025-07-23 14:18:36 +01:00
Greg Sutcliffe
a11cf08594 Zabbix: ignore internal vnet interfaces when doing network monitoring for bvm hosts
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2025-07-23 13:24:44 +01:00
Greg Sutcliffe
ffbb5e8777 Zabbix: Override package list for Zabbix server
The server uses the upstream release RPM, not EPEL, so
the package names are different. Our pattern for OS vars
override host_vars, so we have to explictly set an override var

Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2025-07-23 11:12:34 +01:00
Greg Sutcliffe
f54dabecf5 Zabbix: Only monitor ping for STGmgmt IPs
Turns out we don't have 80/443 access from .stg to .mgmt so
we can only monitor ping for these hosts

Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2025-07-23 10:56:46 +01:00
Greg Sutcliffe
0f7d5cb568 Zabbix: Revert change to agent path, settling on EPEL agent packages
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2025-07-23 10:38:55 +01:00
Greg Sutcliffe
76b7e1289e Zabbix: use EPEL/Fedora package names for the zabbix-agent
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2025-07-23 10:12:42 +01:00
Kevin Fenzi
807e80d0c3 memcached02.stg.rdu3.fedoraproject.org is in staging
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-22 18:43:19 -07:00
Kevin Fenzi
27c2c0b2b3 inventory: debuginfod01.stg is in the staging group
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-22 17:04:32 -07:00
Kevin Fenzi
ab4e2e9cee buildvm-a64-40: this vm is on bvmhost-a64-04
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-22 15:40:37 -07:00
Kevin Fenzi
6c759ec459 buildhw-x86-04: fill in mac addresses
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-22 15:37:35 -07:00
Greg Sutcliffe
fe81326dc4 Zabbix: add BMC host_vars to monitor BMC interfaces on the host entry 2025-07-22 16:23:21 +01:00
Kevin Fenzi
a3dd45c855 buildvm: all these are f42 already, just adjust the more generic groups files
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-21 17:31:11 -07:00
Kevin Fenzi
ea119bea09 ibiblio05: try and drop auto_gateway: no
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-21 15:33:26 -07:00
Kevin Fenzi
86eb4f3e9d ibiblio05: adjust network for new vlan
ibiblio wants to move us to a new vlan. They have already setup things
so we can tag into that vlan on their side, so this just configures a
br1 bridge with that vlan tagged. The existing vm's should be able to be
on the existing vlan for now, but this will let us provision on that
network/vlan.

Also, it seems that I didn't set these up correctly network wise.
They are just using the interfaces directly instead of using a bond over
them. This configuration does that correctly.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-21 15:09:28 -07:00
Kevin Fenzi
b44c28e08d inventory: add some buildhw's to inventory/nagios
We want to monitor the a64 and x86 buildhw devices too.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-21 14:36:43 -07:00
Kevin Fenzi
ff15dbd044 buildhw-x86-04: provision
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-21 14:17:04 -07:00
Kevin Fenzi
2f4f0e8354 buildvm-a64: off by one error
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-21 13:11:50 -07:00
Kevin Fenzi
30dac8c0ae bvmhost-a64-4*: fix up some ips
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-21 12:52:54 -07:00
Kevin Fenzi
7d98656f2b buildvm-a64-41 to 48: add 8 more aarch64 buildvms
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-21 12:20:41 -07:00
Aurélien Bompard
a51c0ea353 RabbitMQ: setup sending the queue metrix to CentOS
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-07-21 15:23:42 +02:00
Kevin Fenzi
f6e453b0ff bvmhost-a64-05: add another aarch64 bvmhost
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-18 12:14:52 -07:00
Kevin Fenzi
a601c59604 bvmhost-p10-01: bump procs warning a bit
This is alerting a lot at 4000, bump it to 4500.
It has a lot of processes running on it.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-18 09:58:48 -07:00
Michal Konecny
163446f4dc [ipa] Add ipa machines to VPN
The IPA machines are currently not reachable through VPN. This is
because they are missing firewall rules for VPN as they need to
have vpn variable set to include them.
2025-07-18 10:44:39 +02:00
Greg Sutcliffe
7f877e95ee Zabbix: Add prod Matrix room ID - take2
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2025-07-17 17:16:13 +01:00
Greg Sutcliffe
32f290bf88 Zabbix: Add prod Matrix room ID
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2025-07-17 17:14:20 +01:00
Michal Konecny
daafbfc969 [pagure] Add staging machine to staging group
The pagure staging machine is not in staging group and instead uses prod
variables. Let's fix that.
2025-07-17 12:18:59 +02:00
Kevin Fenzi
20f9de9a38 bvmhost-p10-01: increase procs warning and crit thresholds
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-16 21:00:47 -07:00
Kevin Fenzi
97dab9dcaf iscsi_client: readd role, apply to power10 host and switch guests to use it
This re-adds a iscsi_client role we had in iad2 back in in rdu3.
When then apply it to bvmhost-p10-01 to login and use a iscsi lun from
the rdu3 netapp. We then move the buildvm-ppc64le vm's to use this iscsi
volume instead of local storage.

As we reinstall those builders they will use the iscsi volume.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-16 15:19:56 -07:00
Kevin Fenzi
4f7b2ef98d inventory: clean up some duplicate variables
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-15 17:18:12 -07:00
Adam Williamson
75cebd40eb Really drop ns03 from openQA worker DNS config
Signed-off-by: Adam Williamson <awilliam@redhat.com>
2025-07-15 15:18:11 -07:00
Adam Williamson
e9435623a1 Drop broken ns03 from openQA worker network config temporarily
This server isn't working and we can't figure out why not. It's
a problem for openQA because we copy the host's DNS config into
'advanced networking' openQA guests, and then when we do a
FreeIPA deployment test, it picks up both DNS servers, tries to
confirm both work, and fails. So we need to take ns03 out until
it's fixed.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2025-07-15 15:11:58 -07:00
Kevin Fenzi
1a41934f52 ns03: fix another copy pasta
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-15 12:32:05 -07:00
Kevin Fenzi
989b73537d proxy01: do not give proxy01 ns01s ip
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-15 12:19:49 -07:00
Kevin Fenzi
dce80c9d1a sign-vault02: provision in rdu3
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-15 09:34:56 -07:00
Kevin Fenzi
11c4c4f211 inventory: add ipv6 addresses to various hosts that need them
We don't have ipv6 routing setup yet, but are scheduled to work on that
soon. To get ready for that, lets add ipv6 addresses to the (few)
machines that will actually need them.

We do not want to add ipv6 to all hosts. The vast majority of them never
need to talk to the outside world directly and shouldn't have a ipv6
address that can do this.

These few hosts are ones with external nat mappings where it is
desireable that they be able to handle ipv6 connections.

Note that we also do NOT want to add any of these to dns until
they are known working. We also will likely have to adjust nftables
to allow the services on ipv6 that we do on ipv4 (if they make sense).

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-14 15:02:06 -07:00
Kevin Fenzi
be410884f9 kernel02: this is using a bond/bridge now
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-14 13:34:58 -07:00
Kevin Fenzi
23f98071f8 kernel02 for rdu3
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-14 13:31:50 -07:00
Kevin Fenzi
4f01c21e72 bvmhost-p09-05: fix mac3 address
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-14 12:07:41 -07:00
Kevin Fenzi
b04d0d372f readd bvmhost-p09-05 in rdu3
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-14 11:19:30 -07:00
Michal Konecny
7ff5ac563e [mailman3] Remove the authentication options 2
Remove the authentication options also from group vars.
2025-07-14 15:41:49 +02:00
Greg Sutcliffe
0d71c0bce0 Nagios: remove http check on p10 mgmt interface
Signed-off-by: Greg Sutcliffe <fedora@emeraldreverie.org>
2025-07-11 20:06:33 +00:00
Kevin Fenzi
a64ef334cc ns02.rdu3 becomes ns03.rdu3.
This is to disambiguate 'ns02'. Right now we have ns02.fedoraproject.org
and also ns02.rdu3.fedoraproject.org. After this we will just have a
ns02 and a ns03.rdu3 server.

This will also allow us to more easily change whois/glue records.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-10 10:53:19 -07:00
Kevin Fenzi
434f2f9405 inventory: add bodhi-backend01.stg to staging
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-09 16:06:44 -07:00
Kevin Fenzi
70c633121e Add bodhi-backend01.stg and adjust value01
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-09 16:00:08 -07:00
Kevin Fenzi
05311f97fc flatpak-cache01: use correct vmhost
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-09 11:21:31 -07:00
Kevin Fenzi
0228df9cd0 flatpak-cache01: add rdu3 host vars and install
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-09 11:16:50 -07:00
Kevin Fenzi
233ec96688 inventory: drop non existant machines
These are various machines that are not yet deployed, or no longer exist
in rdu3 (though they did in iad2). This should clean up nagios
a fair bit and when/if we redeploy these we can add them back in.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-09 10:26:51 -07:00
Aurélien Bompard
4272c8aa77 proxy04 and proxy12 are reachable again
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2025-07-09 16:16:12 +02:00
Kevin Fenzi
9053bd61a4 mailman01.stg: this host should be in the staging group
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-08 14:07:43 -07:00
Kevin Fenzi
3a9405cd77 bvirthost: increase default number of processes before alerting
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-07-06 09:37:40 -07:00