Files
fedora-infra_ansible/roles/openqa/worker/files
Adam Williamson 666196bbed openqa/worker: don't start worker unless NFS mount is up
There's this annoying pattern where the NFS mount fails on boot
and then the worker services all start up and take jobs, but they
instafail because the share isn't there.

Ideally we could handle this very easily with Restart= directives
but systemd has...*opinions* about this:

https://github.com/systemd/systemd/issues/4468
https://github.com/systemd/systemd/issues/1312

so we have to do some fairly awkward hacks to just express:

* Retry the NFS mount if it fails
* Don't start the workers unless the NFS mount is up
* Retry the workers after a while if they were blocked

It's ugly, but in testing this same config on one worker it seems
to work...

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2025-07-10 19:07:54 -07:00
..