This is VASTLY better than the hack we have in base now to try and setup
ifcfg files. It uses a standard role that has lots of options and does
the right thing with NetworkManager. Ideally we would switch everything
to this, but lets try it here first to see. It should work with bridges,
etc as well.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
... it is down on staging. So the whole role/messaging/base is broken,
and we can not depend on it:
TASK [rabbit/user : Create the user in RabbitMQ] ***********************
Tuesday 11 August 2020 05:37:34 +0000 (0:00:00.121) 0:05:22.244 *
Tuesday 11 August 2020 05:37:34 +0000 (0:00:00.121) 0:05:22.243 *
[WARNING]: Unhandled error in Python interpreter discovery for host
rabbitmq01.stg.iad2.fedoraproject.org: Failed to connect to the host via
ssh: ssh: connect to host rabbitmq01.stg.iad2.fedoraproject.org port 22:
Connection timed out
fatal: [copr-be-dev.aws.fedoraproject.org]: UNREACHABLE! => {"changed":
false, "msg": "Data could not be sent to remote host
\"rabbitmq01.stg.iad2.fedoraproject.org\". Make sure this host can be
reached over ssh: ssh: connect to host rab
bitmq01.stg.iad2.fedoraproject.org port 22: Connection timed out\r\n",
"unreachable": true}
So there's now a new toggle named "copr_messaging", turned on only for
production copr instance.
db-koji01 has been running with this since before the mass rebuild, and
it seems to make it have a higher load, but process faster and without
stalling when doing backups or when long/bad koji-gc queries for old
versions of texlive hit it.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
THe cache host needs to accept port 80 requests for varnish.
MOve that from the old 07 zvm instance to the new 24 kvm one.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
We already use this in other builders. The worst that can happen is that
the vm becomes corrupt from a hard power off or the like. If so, we blow
it away and make a new one. This should end up making builds a lot
faster.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
The virthost has 32 cpus, and we have ~11 guests.
If we use 4 cpus per we oversubscribe the host.
So, go down to 3 for now and see if it helps things.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
We are disabling the autosign01.iad2.fp.o so that nagios wont complain
about it with ssh turned off and to disable someone running something
like 'ansible something all'
This reverts commit 91671f86cc.