Commit Graph

108 Commits

Author SHA1 Message Date
Kevin Fenzi
d42014c740 httpd: set a few conf files as optional for initial installs
Due to ordering, these config files don't get copied over before the
first time httpd gets restarted. Setting them as optional should allow
initial runs to work better.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2025-05-28 19:49:32 -07:00
Michal Konecny
2ec055db6f Use first uppercase letter for all handlers
This will unify all the handlers to use first uppercase letter for
ansible-lint to stop complaining.

I went through all `notify:` occurrences and fixed them by running
```
set TEXT "text_to_replace"; set REPLACEMENT "replacement_text"; git grep
-rlz "$TEXT" . | xargs -0 sed -i "s/$TEXT/$REPLACEMENT/g"
```

Then I went through all the changes and removed the ones that wasn't
expected to be changed.

Fixes https://pagure.io/fedora-infrastructure/issue/12391

Signed-off-by: Michal Konecny <mkonecny@redhat.com>
2025-02-10 20:31:49 +00:00
Ryan Lerch
47c68f478d ansiblelint fixes - fqcn[action-core] - template to ansible.builtin.template
Replaces references to template: with ansible.builtin.template

Signed-off-by: Ryan Lerch <rlerch@redhat.com>
2025-01-15 11:30:29 +10:00
Ryan Lerch
62952df107 ansiblelint fixes-- fqcn[action-core] - file to ansible.builtin.file
Replaces many references to  file: with ansible.builtin.file

Signed-off-by: Ryan Lerch <rlerch@redhat.com>
2025-01-15 10:41:52 +10:00
Ryan Lerch
89f6f1fc32 Fix majority of remaining yamllint warnings and errors
Signed-off-by: Ryan Lerch <rlerch@redhat.com>
2024-11-28 17:31:45 +10:00
Kevin Fenzi
c04dd50fbf certificates: move to new 2024 wildcard fedoraproject.org cert
The old one expires in about 57 days, but might as well just renew it
early and avoid problems later.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-10-03 16:50:01 -07:00
Mikolaj Izdebski
bb4d2c2456 Koschei: Add custom robots.txt
Related: fedora-infrastructure#12180
2024-09-13 22:15:14 +02:00
Kevin Fenzi
f40fef9316 resultsdb: crawlers go away
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-07-19 10:45:02 -07:00
Stephen Smoogen
7e426dbf37 Add more spiders which do not seem to honour robots.txt
I went through the last couple of logs afer the first round of 'turn
off the spiders' went out. I looked at the areas which the /robots.txt
disregard and then looked for the bots which ignored it and still
looked up stuff in 'accounts'. This may cut down CPU spikes as these
are looking at dynamic data which can 'blow' things up.

It might be good to add similar tooling to pagure and src since they
seem to be hit a lot in the logs also.

Signed-off-by: Stephen Smoogen <ssmoogen@redhat.com>
2024-07-09 09:18:56 +00:00
Kevin Fenzi
6c2f16eb9a src: adjust robots.txt and fix formatting
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-07-03 10:00:01 -07:00
Kevin Fenzi
2140cf2080 mailman: adjust the right robots.txt and fix formatting
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-07-03 09:58:18 -07:00
Kevin Fenzi
59829605c5 src.fedoraproject.org: adjust robots.txt
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-06-28 12:15:16 -07:00
Kevin Fenzi
6ac1be1f3a lists: block agreessive crawler
This crawler is doing about 2M requests a day to
lists.fedoraproject.org. This is causing db load on db01, causing
services to have issues.

So, block them here for now, but we may want to block them elsewhere
too.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-04-19 11:47:18 -07:00
Kevin Fenzi
bc745fe824 website: try tagging the letsencrypt include_role
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-03-27 13:37:21 -07:00
Francois Andrieu
4479551510 websites: Disable indexing of alt.fp-o/cloud & fix redirect
alt.fpo/en/cloud is still the first link to pop up on search engine and
the current redirect only works with /cloud.
This commit fix the redirect, and prevent this old page to be indexed.
2024-02-06 22:19:14 +01:00
Kevin Fenzi
aeea2c1d06 proxies / koji.fedoraproject.org: disallow robots on koji.fedoraproject.org
Turns out back in 2015 (in 623be6ff73) I added a robots.txt on the koji
hubs to disallow crawling, but since it's behind the proxies, no one
ever saw that robots.txt. Instead they got the default one that just had
a 'crawl-delay 1' in it.

So, lets tell robots to go away for real now, as they are causing load
problems on the koji database and I don't think it's particularly useful
for koji to be indexed. It's a lot of dynamic content anyhow.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-11-06 15:48:09 +00:00
Kevin Fenzi
47cf07184e wildcard-2023.fedoraproject.org: new wildcard ssl cert
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-11 08:48:40 -08:00
Francois Andrieu
f9f0942988 docs: disallow indexing of old-docs 2022-04-03 17:54:25 +00:00
Kevin Fenzi
1e712cdc30 Revert "Revert "wildcard-2022.fedoraproject.org cert""
This reverts commit 4430178b29.

It's time to put this back before the cert expires and before we go into
Beta freeze. Hopefully the odd issue with armv7 qemu guests having a
time behind real time is not still happening.
2022-02-21 10:19:17 -08:00
Kevin Fenzi
4430178b29 Revert "wildcard-2022.fedoraproject.org cert"
This reverts commit 57f0d4fdb6.

For an anoying reason, armv7 image builds come up with the time as 10
days ago, which makes this cert invalid. So, move back to the old cert
for a week or so and then switch to the new one again. ;(
2022-01-31 12:39:49 -08:00
Kevin Fenzi
57f0d4fdb6 wildcard-2022.fedoraproject.org cert
This is the renewed version of the old 2020 cert.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-01-27 10:37:27 -08:00
Stephen Smoogen
5db5700936 This will allow for X forwarded to be set for hosts which need it. [Currently only debuginfod] 2021-04-06 12:25:44 -04:00
Nick Bebout
849bcf0514 Create /srv/web 2020-06-19 12:42:56 -05:00
Kevin Fenzi
5e10e633fd proxies / lists: name the robots files correctly
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-29 22:56:01 +00:00
Kevin Fenzi
4949dd7baf proxies / lists: put lists robots.txt in a place where it actually is read
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-29 22:06:58 +00:00
Stephen Smoogen
37915eaf25 put in patches to use wildcard2020 2020-04-24 21:34:26 +02:00
Clement Verna
430daf7ff1 Bodhi: add a specific robots.txt
Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:19 +02:00
Mikolaj Izdebski
a07dfe1c02 Remove koschei from robots.txt for apps.fp.o 2020-04-24 21:34:11 +02:00
Michael Scherer
474fac15f0 Add a option to bypass certgetter01 intercept
This permit to move, let's say,  blog to a provider
that do support LE natively, but without needing to
transfer the certificate or anything.

Just set "enable_certgetter: false", run the playbook,
do the http01 negociation, and then switch DNS.
2020-04-24 21:34:09 +02:00
Kevin Fenzi
d056090c7f websites: Disallow robots crawling fas.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2019-05-14 16:54:18 +00:00
Patrick Uiterwijk
0af40df768 Add Request ID for fun
Signed-off-by: Patrick Uiterwijk <puiterwijk@redhat.com>
2019-04-13 19:18:26 +00:00
Patrick Uiterwijk
0c7449ea1d Add sslciphers tags
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-04-08 21:41:17 +02:00
Alessandro Lorenzi
3430d47412 Site src.fedoraproject.org expires
refs: #7629
2019-03-30 23:45:19 +00:00
Stephen Smoogen
11e2ff87a1 [proxies/robots.txt] Make it so that we force the proxy to use a local robots.txt
The various openshift tools get hit by various crawlers and do not send
a robots.txt. This seems to be due to the balancer code used to send
back to the nodes. This forces the proxy robots.txt to be honored
always.
2019-03-19 19:51:11 +00:00
Patrick Uiterwijk
bf6be45d70 Hope that curl fixed their GOAWAY HTTP/2 bug
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-02-25 16:47:23 +01:00
Rick Elrod
c16f040a40 this dir is needed for robots.txt stuff too
Signed-off-by: Rick Elrod <relrod@redhat.com>
2019-01-29 03:19:48 +00:00
Rick Elrod
62316d11b2 and make the template point to the new ones
Signed-off-by: Rick Elrod <relrod@redhat.com>
2019-01-29 02:00:39 +00:00
Rick Elrod
3c2e614eeb Make actual robots.txt files end with .txt so the mime-type is right since apache Alias will preserve the mime-type of the file it points to
Signed-off-by: Rick Elrod <relrod@redhat.com>
2019-01-29 01:49:44 +00:00
Rick Elrod
77644a5183 Attempt to make crawlers stop scanning stg.fp.o (infra #7514)
Signed-off-by: Rick Elrod <relrod@redhat.com>
2019-01-28 22:43:00 +00:00
Patrick Uiterwijk
f26ac060cb Only do OCSP stapling on the proxies
The actual cache is only set in the proxy HTTP config.
While we could set the cache path in the other servers' configs as well,
that would be a significantly larger change.

Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2018-09-25 21:52:12 +02:00
Stephen Smoogen
dde378de0a and this will start a long long long proxy push 2018-09-13 21:27:22 +00:00
Patrick Uiterwijk
4a385eadba Dont use h2 for (app.)os.stg.fedoraproject.org to fix websockets
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2018-09-12 01:27:39 +02:00
Patrick Uiterwijk
74502e1c52 Enable OCSP stapling on the proxies
Signed-off-by: Patrick Uiterwijk <puiterwijk@redhat.com>
2018-07-28 23:01:20 +00:00
Rick Elrod
4a60ddc875 Nuke pkgdb some more... and probably break everything.
Signed-off-by: Rick Elrod <relrod@redhat.com>
2018-07-19 20:40:14 +00:00
Patrick Uiterwijk
26da0d1a0b For all proxied sites, all acme traffic will go to certgetter
Signed-off-by: Patrick Uiterwijk <puiterwijk@redhat.com>
2018-07-17 00:12:39 +00:00
Patrick Uiterwijk
51cd5614be Use the ansible-installed hotspot.txt so we don't fail if web build fails
Signed-off-by: Patrick Uiterwijk <puiterwijk@redhat.com>
2018-07-12 00:10:19 +00:00
Patrick Uiterwijk
9ade79edad For hotspot.txt, we want all the stats
Signed-off-by: Patrick Uiterwijk <puiterwijk@redhat.com>
2018-07-12 02:01:01 +02:00
Patrick Uiterwijk
53b7a8ce3f Fix typo
Signed-off-by: Patrick Uiterwijk <puiterwijk@redhat.com>
2018-07-11 21:30:16 +02:00
Patrick Uiterwijk
0fac774a60 Apply expires for hotspot.txt
Signed-off-by: Patrick Uiterwijk <puiterwijk@redhat.com>
2018-07-11 21:22:08 +02:00
Patrick Uiterwijk
6d8240bd5b This is python-like, not C-like
Signed-off-by: Patrick Uiterwijk <puiterwijk@redhat.com>
2018-07-11 19:08:56 +00:00