Aurélien Bompard
125d8a2655
nagios_server: update package command to not use items
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org >
2019-06-11 17:56:22 +02:00
Clement Verna
bb24183f46
oci-registry: Update nagios to monitor the correct directory for disk space
...
Signed-off-by: Clement Verna <cverna@tutanota.com >
2019-05-30 20:06:56 +02:00
Stephen Smoogen
40a819e1d5
[nagios/datanommer] this is what happens when you have 2 files which are supposeldy the same file. You edit one in nagios_server and miss the one in nagios_client. Bad nagios. Bad
2019-05-30 16:56:26 +00:00
Kevin Fenzi
b68a3cf906
nagios / bodhi: change masher to composer
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2019-05-29 02:57:01 +00:00
Randy Barlow
4cf1624c76
bodhi: Upgrade production to Bodhi 4.0.0.
...
Signed-off-by: Randy Barlow <randy@electronsweatshop.com >
2019-05-28 15:58:52 +00:00
Stephen Smoogen
b2599f8d2f
[nagios] try ang get groups working
2019-05-23 23:34:41 +00:00
Stephen Smoogen
af3def70a1
overzealous _ in group name for nagios group
2019-05-23 23:16:18 +00:00
Kevin Fenzi
ab4d457fec
nagios_server: correct some more templates for overzelous _ replacement
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2019-05-22 18:34:12 +00:00
Kevin Fenzi
52bb723fee
nagios_server: fix another _ case in group name
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2019-05-22 18:08:04 +00:00
Kevin Fenzi
15ebdb5233
nagios_server: fix autocloud-backend group to use _
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2019-05-22 17:24:40 +00:00
Patrick Uiterwijk
9b7882313f
More nagios fixes
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org >
2019-05-22 17:17:21 +02:00
Patrick Uiterwijk
63fe73c878
More nagios file fixes
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org >
2019-05-22 17:15:11 +02:00
Patrick Uiterwijk
c276097e32
nagios: fix smtp-mm
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org >
2019-05-22 17:13:55 +02:00
Patrick Uiterwijk
c14b702513
Continue fixing nagios group names
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org >
2019-05-22 17:12:23 +02:00
Patrick Uiterwijk
ab9cd48efe
nagios_server: Remove fedmsg checks from hotness01
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org >
2019-05-22 16:40:56 +02:00
Kevin Fenzi
4b31ac5152
ansible: Change all our group names from foo-bar to foo_bar or foo-bar-baz to foo_bar_baz
...
In ansible 2.8 the - character isn't supposed to be valid in group names.
While we could override this, might has well just bite the bullet and change it.
So, just switch all group names to use _ instead of -
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2019-05-20 17:38:09 +00:00
Patrick Uiterwijk
30d9d21ceb
Use SSL for id.fp.o
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org >
2019-05-09 20:37:17 +02:00
Patrick Uiterwijk
e67fc8ca91
Move ipsilon checks to proxies
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org >
2019-05-09 20:35:05 +02:00
Clement Verna
e220bb4867
Fedbages: put a higher threshold to fedmsg hub backlog nagios alerts.
...
The fedbages consumer has to consume many more messages and the query to datanomer
are slower. We need to allow the consumer to have a bigger backlog before raising an
alert.
Generally above 35000 message in the backlog it will be difficult for the consumer
to catch up, in that case it might be better to flush the backlog and restart from 0
Signed-off-by: Clement Verna <cverna@tutanota.com >
2019-05-05 13:59:31 +02:00
Stephen Smoogen
c432675c74
[nagios] add checks for datanommer monitoring
2019-04-25 18:13:43 +00:00
Clement Verna
93d0eeaf54
Nagios: monitor that resultsdb sends messages on the bus
...
Signed-off-by: Clement Verna <cverna@tutanota.com >
2019-04-24 11:22:46 +02:00
Patrick Uiterwijk
0c7449ea1d
Add sslciphers tags
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org >
2019-04-08 21:41:17 +02:00
Miroslav Suchý
1a1ca033b6
sent Copr nagios notifications to frostyx too
2019-04-01 10:13:00 +02:00
Miroslav Suchý
b7394d1c54
sent notifications to msuchy@ as clime@ does not work now
2019-04-01 10:10:33 +02:00
Stephen Smoogen
d9d24d08d9
[nagios_server] Add in certgetter test.
...
This was offered by Alessandro Lorenzi <alorenzi@alorenzi.eu > as a fix
to deal with our inability to monitor the certgetter after
reboots. Thank you very much for this work.
Signed-off-by: Stephen Smoogen <smooge@redhat.com >
2019-03-20 20:17:43 +00:00
Kevin Fenzi
625acabd00
nagios: monitoring resultsdb.ci.centos.org. Need to call it by it's internal fedora name here.
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2019-03-16 04:21:02 +00:00
Kevin Fenzi
39cee77465
nagios: monitor resultsdb.ci.centos.org. Ticket 7601.
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2019-03-15 20:49:49 +00:00
Kevin Fenzi
1a40dd5142
nagios: drop askbot fedmsg check.
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2019-02-21 15:34:31 +00:00
Kevin Fenzi
a6cceb3599
nagios: drop remnant of check_osbs_builds
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com >
2019-02-21 15:23:54 +00:00
Rick Elrod
8a4ed2358a
remove tummy01 everywhere
...
Signed-off-by: Rick Elrod <relrod@redhat.com >
2019-02-20 17:28:52 +00:00
Stephen Smoogen
06cc8ca030
and we have no bodhost
2019-02-14 21:11:50 +00:00
Rick Elrod
0b7bb3b5b3
prep for proxy03 move
...
Signed-off-by: Rick Elrod <relrod@redhat.com >
2019-02-11 23:14:27 +00:00
Rick Elrod
4c8cf933fc
make odcs-backend check for fedmsg-hub-3 instead (infra #7526 )
...
Signed-off-by: Rick Elrod <relrod@redhat.com >
2019-01-28 08:54:46 +00:00
Patrick Uiterwijk
18b0acc8f3
Monitor ostree summary on proxies
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org >
2019-01-21 16:57:26 +01:00
Patrick Uiterwijk
2ded08f111
Add 24-hour check for bodhi compose start
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org >
2018-12-20 20:44:10 +01:00
Stephen Smoogen
0267f916bd
This will fix nrpe on server
2018-12-19 13:44:03 +00:00
Stephen Smoogen
6c1357ff59
repospanner is only running on pkgs01 currently
2018-12-17 18:25:27 +00:00
Stephen Smoogen
418af8da3d
and now make it so nagios sees the file on the server
2018-12-17 17:00:13 +00:00
Stephen Smoogen
0819f469c0
this should allow noc01 to see nrpe commands
2018-12-17 16:42:16 +00:00
Stephen Smoogen
3bbc0031f4
This will add minimal monitoring for repospanner on pkgs01.stg. This only says it is running or not.
2018-12-17 15:44:31 +00:00
Kevin Fenzi
4125997ecc
fix the check_supybot_plugin to listen only for zodbot privmsg, not frigg
2018-12-15 20:52:54 +00:00
Stephen Smoogen
cc16f56e4f
maybe this will make qa02 easier?
2018-12-15 01:14:59 +00:00
Stephen Smoogen
c7deb6502a
somedays you just keep neanderthal-patching until it works
2018-11-20 00:13:00 +00:00
Randy Barlow
4422e2bb2d
Monitor fedmsg-hub-3 on Bodhi instead of fedmsg-hub.
...
Signed-off-by: Randy Barlow <randy@electronsweatshop.com >
2018-11-19 21:33:37 +00:00
Randy Barlow
ce86a667b7
Configure check_fedmsg_cp_bodhi_backend02_hub to use fedmsg-hub-3.
...
Signed-off-by: Randy Barlow <randy@electronsweatshop.com >
2018-11-19 21:29:14 +00:00
Randy Barlow
2911286a3a
Rename check_fedmsg_masher_proc to check_fedmsg_composer_proc and have it check fedmsg-hub-3.
...
Signed-off-by: Randy Barlow <randy@electronsweatshop.com >
2018-11-19 21:20:15 +00:00
Patrick Uiterwijk
505d8bbf8c
Set COPR backend to send notifs to copr
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org >
2018-10-20 19:27:40 +02:00
Patrick Uiterwijk
5fa3c6e53d
Add contact info for clime
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org >
2018-10-20 19:27:40 +02:00
Patrick Uiterwijk
52d42114c4
Allow clime to ack services
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org >
2018-10-20 19:27:40 +02:00
Patrick Uiterwijk
3fc57e699b
Enable nagios checks for ticketkey, and stop emailing puiterwijk
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org >
2018-10-20 15:36:00 +02:00