Commit Graph

9236 Commits

Author SHA1 Message Date
Adam Williamson
c60a3afe99 relvalconsumer: update AMI consumer routing keys
This should cover both before and after
https://pagure.io/cloud-image-uploader/pull-request/28 if it
gets merged.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2024-09-12 16:58:36 -07:00
David Kirwan
6354a6bd38 communishift: add communishift-commops-analytics project
Signed-off-by: David Kirwan <davidkirwanirl@gmail.com>
2024-09-12 13:42:39 +01:00
Pavel Raiskup
79ee807af5 vmhost-x86-copr01: update mac addresses
https://pagure.io/fedora-infrastructure/issue/11950
2024-09-12 07:48:26 +02:00
David Kirwan
1764f3f86f communishift: add communishift-fossology
Signed-off-by: David Kirwan <davidkirwanirl@gmail.com>
2024-09-11 15:39:48 +01:00
David Kirwan
d2920ad85d zabbix: add templates group var to releng_compose_eln
Signed-off-by: David Kirwan <davidkirwanirl@gmail.com>
2024-09-09 17:05:54 +01:00
Adam Williamson
01aa61f145 decommission openqa-a64-worker01 for now
It seems to have memory issues. Comment it out and promote 03 to
createhdds and tap1 duties.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2024-09-05 15:28:09 -07:00
Adam Williamson
33ff6fda44 openqa: as an experiment, cut workers on a64 prod to 20
this is to see if it reduces the number of test flakes, I want
to see if we're running too many workers for the system to cope
with (maybe the storage).

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2024-09-05 14:16:13 -07:00
Kevin Fenzi
969024df88 communishift: add weekly-bootc. ticket 12156
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-09-03 15:10:16 -07:00
Aurélien Bompard
f26baf7de2 Deploy webhook-to-fedora-messaging to prod
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2024-09-03 09:57:57 +02:00
Adam Williamson
c6ad51abc0 openqa/worker: bump thresholds higher
sigh, we're still hitting it on a64-worker04.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2024-08-31 00:01:10 -07:00
Adam Williamson
2dbf99e280 openqa/worker: bump load average threshold for big worker hosts
This is a new feature in openQA that prevents worker hosts
picking up new jobs if their load average is above a certain
threshold. It defaults to 40. Our big worker hosts tend to run
above this, so let's bump it on those.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2024-08-30 23:27:48 -07:00
Kevin Fenzi
4f020d47a5 Add communishift-ocm group (ticket 12138)
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-28 17:07:36 -07:00
Kevin Fenzi
069e2cbc9f nagios: clear old retired hosts mgmt
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-27 12:04:23 -07:00
Michal Srb
a876382717 retrace03: Add f41 repos
Signed-off-by: Michal Srb <michal@redhat.com>
2024-08-26 12:22:21 +00:00
Kevin Fenzi
92aeb18e12 inventory: remove stray zvm group
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-22 15:08:58 -07:00
Kevin Fenzi
19f3868519 builders: all the builders should be f40 now
We moved them to f40 via upgrade, sync up ansible to match so that when
we reinstall them they will get 40 instead of 39

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-22 14:54:40 -07:00
Kevin Fenzi
0bdf132aa6 download: add mirror.twds.com.tw ticket 12129
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-19 13:11:59 -07:00
Kevin Fenzi
0dfa11a6eb fedimg: signing off...
Thanks for all the uploads fedimg.
You go to a far far better place I'm sure.

There's no point in keeping it around now, as it's actually not working
and the replacement ( cloud-image-uploader) should work soon.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-13 16:40:01 -07:00
Samyak Jain
21bf0e3794 Enabling koji back to stage
Signed-off-by: Samyak Jain <samyak.jn11@gmail.com>
2024-08-14 00:53:16 +05:30
Samyak Jain
9465ff33a1 Enabling koji for internal machines
Signed-off-by: Samyak Jain <samyak.jn11@gmail.com>
2024-08-13 21:25:31 +05:30
Kevin Fenzi
c4024c4aa4 pdc: fare thee well!
This commit retires pdc from ansible.
The website should get redirected to a wiki page about the retirement.
If for some reason we need to bring things back, the vm's will still
have their disks and xml saved off so we can bring it back.
Would need to revert this, run proxy playbooks and do a little cleanup
on the redirect, then bring the vm's back up.
Hopefully we don't have to.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-02 13:39:15 -07:00
Kevin Fenzi
88ebe715b5 vmhost-x86-cc02: fix up host vars, with the host vars file this time
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-02 13:14:19 -07:00
Kevin Fenzi
9860e06e58 vmhost-x86-cc02: fix up host vars
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-02 13:13:57 -07:00
Kevin Fenzi
d6ecf4c07d virthost-cc-rdu02/rhel7 becomes vmhost-x86-cc02/rhel9
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-02 11:53:18 -07:00
Kevin Fenzi
6ea92d7154 noc-cc01: setup internal interface
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-02 08:58:45 -07:00
Kevin Fenzi
f8ae207321 noc-cc01: put mgmt on same ip as old noc
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-01 14:51:00 -07:00
Kevin Fenzi
03c1c2b5bf cloud-noc-os01: retire
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-01 14:44:19 -07:00
Kevin Fenzi
9eb49ee0d5 noc-cc01: adjust kickstart for new anaconda inst.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-01 13:46:36 -07:00
Kevin Fenzi
9ccd3ba16e rdu-cc: adjust some dns searches
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-01 13:43:20 -07:00
Kevin Fenzi
1a82c3d6c4 noc-cc01: fix typo in host vars
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-01 13:40:40 -07:00
Kevin Fenzi
f0a562a8e9 noc-cc01: add new rhel9 noc in rdu-cc named better
The old cloud-noc-os01 was for the old openstack we used to have and
wanted to re-setup in rdu, but never did.

So, lets just move this to more our normal convention.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-01 13:38:19 -07:00
Kevin Fenzi
867139da37 vmhost-x86-cc01: add bridge to mgmt for noc vm here too
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-01 13:00:23 -07:00
Kevin Fenzi
7355a9349b logdetective01: add to cloud_aws group to get correct nagios checks
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-07-29 14:21:01 -07:00
Kevin Fenzi
974428680d vmhost-x86-07: move vm's off and retire
vmhost-x86-07 is a ~6 yr old server that we need to move off of.
So, move the guests and retire it.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-07-29 13:31:36 -07:00
Kevin Fenzi
4b21fd0ffe bvmhost-x86-08: remove from ansible to retire
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-07-29 11:23:48 -07:00
Kevin Fenzi
484ba1b632 odcs-backend-releng01: move to another vmhost so we can retire bvmhost-x86-08
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-07-29 11:12:07 -07:00
David Kirwan
765e619525 communishift: gitlabce
Signed-off-by: David Kirwan <davidkirwanirl@gmail.com>
2024-07-29 08:29:59 +01:00
Adam Williamson
4743c3fdce openqa/worker: transition all tap workers to NM-based setup
This seems to be working fine in testing, so let's deploy it
everywhere.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2024-07-25 14:54:03 -07:00
Adam Williamson
690a5eb951 openqa/worker: add NM-based tap setup and test on p09-worker01
network-scripts-openvswitch was removed in f40 and network-scripts
is going away in f41; we really need to get off using them.
This attempts to implement the same setup using NetworkManager,
based on a few different NM/ovs references, and the source of
openQA upstream's os-autoinst-setup-multi-machine . It might
need a bit of tweaking, so for now, we make it a separate task
and use it only on p09-worker01 for testing. This doesn't handle
tearing down the old network-scripts-based config as that's
pretty complex and will only need to happen once; I'll do it
manually before trying this out.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2024-07-25 13:50:39 -07:00
Adam Williamson
2ea8ffa760 Ugh, fixup for previous commit
Signed-off-by: Adam Williamson <awilliam@redhat.com>
2024-07-25 13:27:44 -07:00
Adam Williamson
ba9e0f04a0 Update openqa-p09-worker01 host vars
The interface name changed (thanks, 'predictable' names...sigh)
and this box *is* encrypted currently.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2024-07-25 13:25:36 -07:00
Miroslav Suchý
c591394e66 set ansible user for logdetective01 2024-07-25 13:30:49 +02:00
Miroslav Suchý
cd280f399e set python path for logdetective
trying to address
[WARNING]: Unhandled error in Python interpreter discovery for host logdetective01.fedorainfracloud.org: unexpected output from Python interpreter discovery
[WARNING]: Platform unknown on host logdetective01.fedorainfracloud.org is using the discovered Python interpreter at /usr/bin/python, but future installation of another Python interpreter could change the
meaning of that path. See https://docs.ansible.com/ansible-core/2.14/reference_appendices/interpreter_discovery.html for more information.
fatal: [logdetective01.fedorainfracloud.org]: FAILED! => {"ansible_facts": {"discovered_interpreter_python": "/usr/bin/python"}, "changed": false, "module_stderr": "", "module_stdout": "Please login as the user \"fedora\" rather than the user \"root\".\n\n", "msg": "MODULE FAILURE\nSee stdout/stderr for the exact error", "rc": 142}
2024-07-25 13:27:23 +02:00
Miroslav Suchý
9f29a94193 add playbook for logdetective
https://pagure.io/fedora-infrastructure/issue/12021
2024-07-24 16:45:19 +02:00
Nils Philippsen
c901eae7ae rabbitmq: Fix typo making sudo groups ineffective
This amends commit dbbf94a411.

Signed-off-by: Nils Philippsen <nils@redhat.com>
2024-07-23 19:25:32 +02:00
Ryan Lerch
cebe9b9cb7 add communishift-forgejo project
Signed-off-by: Ryan Lerch <rlerch@redhat.com>
2024-07-23 08:50:26 +10:00
Kevin Fenzi
81a9f2ceaf bastion: add sysadmin-eln
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-07-22 07:54:37 -07:00
Adam Williamson
295c0ccb25 openqa: run aarch64 updates on prod too
Seems to be working fine on stg.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2024-07-19 17:22:59 -07:00
Adam Williamson
27ed0ce621 openqa: test running update tests on aarch64 on stg
We really ought to do this. Capacity and reliability are issues,
so I'm going to try it with a small set of core tests at first.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2024-07-19 09:59:51 -07:00
Kevin Fenzi
64c216b79d compose-eln01: not external
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-07-17 17:19:20 -07:00