[00:00:19] [02miraheze/services] 07MirahezeSSLBot pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJwEj [00:00:21] [02miraheze/services] 07MirahezeSSLBot 033d03bdc - BOT: Updating services config for wikis [00:02:53] RECOVERY - cp3 Disk Space on cp3 is OK: DISK OK - free space: / 3815 MB (15% inode=93%); [00:03:44] [02miraheze/puppet] 07paladox pushed 031 commit to 03paladox-patch-2 [+0/-0/±1] 13https://git.io/JJwuk [00:03:45] [02miraheze/puppet] 07paladox 038ce8acb - Change paradox ssh key [00:03:47] [02puppet] 07paladox created branch 03paladox-patch-2 - 13https://git.io/vbiAS [00:03:48] [02puppet] 07paladox opened pull request 03#1465: Change paradox ssh key - 13https://git.io/JJwuI [00:03:59] PROBLEM - cloud2 Current Load on cloud2 is CRITICAL: CRITICAL - load average: 25.67, 22.90, 17.71 [00:04:56] PROBLEM - tla.awiki.org - reverse DNS on sslhost is WARNING: rDNS WARNING - reverse DNS entry for tla.awiki.org could not be found [00:05:44] [02puppet] 07paladox closed pull request 03#1465: Change paradox ssh key - 13https://git.io/JJwuI [00:05:46] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJwuY [00:05:47] [02miraheze/puppet] 07paladox 03de71c7a - Change paradox ssh key (#1465) [00:05:49] [02puppet] 07paladox deleted branch 03paladox-patch-2 - 13https://git.io/vbiAS [00:05:50] [02miraheze/puppet] 07paladox deleted branch 03paladox-patch-2 [00:08:48] PROBLEM - bacula2 Bacula Private Git on bacula2 is WARNING: WARNING: Full, 4480 files, 13.03MB, 2020-07-26 00:06:00 (1.1 weeks ago) [00:09:57] PROBLEM - cloud2 Current Load on cloud2 is WARNING: WARNING - load average: 19.46, 22.28, 19.03 [00:13:58] PROBLEM - cloud2 Current Load on cloud2 is CRITICAL: CRITICAL - load average: 38.74, 27.47, 21.46 [00:14:52] [ANNOUNCEMENT] MirahezeBot v8.0 is now released! This is our biggest release yet. Thank you all for your continued support. -The MirahezeBot team [00:15:57] PROBLEM - cloud2 Current Load on cloud2 is WARNING: WARNING - load average: 16.17, 22.95, 20.56 [00:15:59] <-CloudGuy38-> What's the notable changes with Mirahezebot v8.0? [00:17:56] RECOVERY - cloud2 Current Load on cloud2 is OK: OK - load average: 11.56, 18.71, 19.28 [00:42:08] > What's the notable changes with Mirahezebot v8.0? @-CloudGuy38- RhinosF1 will probably be too happy to explain the changes in v8.0, unless @Zppix wants to. Not going to ping him, though, since he's sleeping. 🙂 [00:42:34] <-CloudGuy38-> It's near my sleeping time [00:42:57] > It's near my sleeping time @-CloudGuy38- Oh, okay, sounds good. Tomorrow then? [00:43:19] <-CloudGuy38-> yes, but I will still be here until I go offline [00:46:56] okay, that's cool [01:14:01] PROBLEM - cloud2 Current Load on cloud2 is WARNING: WARNING - load average: 23.96, 23.52, 19.72 [01:17:59] RECOVERY - cloud2 Current Load on cloud2 is OK: OK - load average: 18.54, 19.89, 19.04 [01:20:19] [02miraheze/services] 07MirahezeSSLBot pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJwgt [01:20:20] [02miraheze/services] 07MirahezeSSLBot 034e77770 - BOT: Updating services config for wikis [01:41:00] RECOVERY - tla.awiki.org - LetsEncrypt on sslhost is OK: OK - Certificate 'tla.awiki.org' will expire on Sat 31 Oct 2020 11:20:36 GMT +0000. [02:05:07] Hello Wagglie! If you have any questions, feel free to ask and someone should answer soon. [02:07:13] do not ban me from this server rhinosf1 [02:15:55] ^ @Zppix [02:16:33] Hola Doug [02:16:54] Please, request global ban to joaquinito01 [02:18:26] Wagglie, your name is awfully similar to Waggle from earlier today. [02:18:47] So no one told you life was gonna be this way [02:19:27] The behavioural characteristics between your messages and Waggle's are uncanny. [02:19:53] 🦆 [02:39:09] Hello RhinoF1! If you have any questions, feel free to ask and someone should answer soon. [02:41:31] @CVT that makes two 🦆 s of an LTA, and I suspect I know who it is [02:55:43] @doug only stewards and sysadmins have op in irc [02:57:35] @Zppix Yeah, I know, but all sysadmins are GS at present, so that's why I used that [02:57:51] Not true [02:57:55] Pioneer isnt [02:58:04] no that's not what I meant [02:58:06] sorry [02:58:14] I meant all GS are sysadmins [02:58:29] Not true [02:58:36] Pioneer isnt a sysadmin [02:59:00] Me and reception are the only CVT members that are sysadmins [03:35:12] [02miraheze/services] 07MirahezeSSLBot pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJwwC [03:35:13] [02miraheze/services] 07MirahezeSSLBot 0320524cb - BOT: Updating services config for wikis [03:42:13] > Me and reception are the only CVT members that are sysadmins @Zppix Yeah, I know, but I think of CVT as an umbrella term that encompasses both stewards and GS. Yeah, The Pioneer is absolutely a steward and thus part of CVT, but only you and Reception123 are GS and sysadmins. [03:43:08] Plus, I didn't want to ping the system administrators group because I didn't want to wake RhinosF1 as I know he's sleeping. So thought I'd just ping the CVT group and someone who was also a sysadmin would answer that. [03:43:19] So that's my thought process. Hope that clarifies. [03:44:22] PROBLEM - cloud2 Current Load on cloud2 is WARNING: WARNING - load average: 20.79, 18.44, 15.71 [03:46:22] RECOVERY - cloud2 Current Load on cloud2 is OK: OK - load average: 11.69, 15.77, 15.06 [03:54:50] PROBLEM - wiki.apap04.com - LetsEncrypt on sslhost is WARNING: WARNING - Certificate 'wiki.apap04.com' expires in 15 day(s) (Wed 19 Aug 2020 03:46:05 GMT +0000). [03:57:20] PROBLEM - ping6 on cp3 is CRITICAL: PING CRITICAL - Packet loss = 100% [03:59:22] RECOVERY - ping6 on cp3 is OK: PING OK - Packet loss = 0%, RTA = 265.39 ms [03:59:41] [02miraheze/ssl] 07MirahezeSSLBot pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJwwH [03:59:42] [02miraheze/ssl] 07MirahezeSSLBot 033aacd68 - Bot: Update SSL cert for wiki.apap04.com [04:15:12] RECOVERY - wiki.apap04.com - LetsEncrypt on sslhost is OK: OK - Certificate 'wiki.apap04.com' will expire on Sun 01 Nov 2020 02:59:34 GMT +0000. [04:24:28] PROBLEM - mw4 Current Load on mw4 is CRITICAL: CRITICAL - load average: 8.20, 5.92, 4.64 [04:26:28] RECOVERY - mw4 Current Load on mw4 is OK: OK - load average: 4.42, 5.19, 4.53 [04:32:42] !log reception@jobrunner1:/srv/mediawiki/w/extensions/ManageWiki/maintenance$ sudo -u www-data php populateGroupPermissionsWithDefaults.php --wiki=xhoriawiki --overwrite [04:32:49] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [04:37:29] [02miraheze/ssl] 07Reception123 pushed 031 commit to 03master [+1/-0/±1] 13https://git.io/JJwr6 [04:37:30] [02miraheze/ssl] 07Reception123 03d4ac398 - add wiki.lct.app cert [04:39:09] !log reception@jobrunner1:~$ sudo -u www-data php /srv/mediawiki/w/maintenance/importDump.php --wiki=vilaryawiki /home/reception/Wikipedia-20200802155856.xml [04:39:18] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [04:40:20] PROBLEM - oecumene.org - LetsEncrypt on sslhost is WARNING: WARNING - Certificate 'oecumene.org' expires in 15 day(s) (Wed 19 Aug 2020 04:35:14 GMT +0000). [04:45:12] [02miraheze/services] 07MirahezeSSLBot pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJwrQ [04:45:13] [02miraheze/services] 07MirahezeSSLBot 035ff69d4 - BOT: Updating services config for wikis [04:45:55] [02miraheze/ssl] 07MirahezeSSLBot pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJwrd [04:45:57] [02miraheze/ssl] 07MirahezeSSLBot 03bd5d509 - Bot: Update SSL cert for oecumene.org [04:47:35] Wow, that was fast at getting a custom domain set up, @Reception123. I literally just approved lctwiki a couple hours ago, and didn't even know they wanted a custom domain. They must be familiar with getting custom domains set up. 😛 [04:47:55] @Doug well the process isn't that difficult (at least for users :P) [04:48:15] they just need to read the domain page and point their domain to us and they're done (if they choose Lets Encrypt) [04:48:16] yeah, that's true...it's well explained 🙂 [04:48:25] yep [04:48:40] there's a few more steps for us sysadmins but you get used to it [04:48:45] and it's quick [04:49:24] Looked at what you do on your end, is that literally all you have to do is just generate the LetsEncrypt certificate, modify that certs config file, and update the server settings in ManageWiki? [04:49:27] yeah [04:49:49] there's also the private key that has to be copied to puppet2 [04:49:56] ah, yes [04:49:59] (which is why only puppet-users can fully do certs) [04:50:01] that's the puppet part then [04:50:28] but before when I was mw-admin I did certs but just had to have an ops member (what SRE was called at the time) do the private cert [04:51:05] Oh and if the custom domain is pointed to us via nameservers we also need to add it to the DNS repo [04:51:08] ah, yeah, so now it's simplified since mw-admins can request puppet access [04:51:28] > Oh and if the custom domain is pointed to us via nameservers we also need to add it to the DNS repo ah, I see [04:52:03] > ah, yeah, so now it's simplified since mw-admins can request puppet access @Doug yeah when I was mw-admin I worked on getting that done [04:52:17] before the ssl GitHub repository was part of puppet but we split it so that puppet-users can handle certs too [04:53:06] ah, cool [04:53:26] there's also a group called "cache-admins" that can get access to cache proxy servers but it's not used atm as it's not very necessary [04:53:39] PROBLEM - jobrunner1 Current Load on jobrunner1 is CRITICAL: CRITICAL - load average: 9.82, 9.02, 6.20 [04:54:12] yeah... I could see that, as there's always usually one SRE online and you've got multiple cache proxy servers [04:54:28] if one fails, I assume traffic is just rerouted through the others? [04:54:29] and in theory if we had a lot of sysadmins there could be more groups for individual access to different areas (like phab-admins, mail-admins, redis-admins, etc.) [04:54:48] > if one fails, I assume traffic is just rerouted through the others? @Doug yeah normally it should depool itself and everything goes through the others [04:55:03] yeah, mail-admins wouldn't be necessary since only staff and board have e-mail, and I doubt Miraheze wants to get into e-mail hosting [04:55:09] but it might need a reboot (which cache-admins couldn't do anyway) [04:55:36] PROBLEM - jobrunner1 Current Load on jobrunner1 is WARNING: WARNING - load average: 4.95, 7.67, 6.06 [04:55:46] > yeah, mail-admins wouldn't be necessary since only staff and board have e-mail, and I doubt Miraheze wants to get into e-mail hosting @Doug yeah the WMF have all sorts of different groups but for now we definitely don't need more [04:57:35] PROBLEM - jobrunner1 Current Load on jobrunner1 is CRITICAL: CRITICAL - load average: 6.39, 8.04, 6.43 [04:59:32] RECOVERY - jobrunner1 Current Load on jobrunner1 is OK: OK - load average: 3.73, 6.48, 6.04 [05:00:34] RECOVERY - oecumene.org - LetsEncrypt on sslhost is OK: OK - Certificate 'oecumene.org' will expire on Sun 01 Nov 2020 03:45:49 GMT +0000. [05:05:21] PROBLEM - jobrunner1 Current Load on jobrunner1 is CRITICAL: CRITICAL - load average: 6.24, 8.70, 7.26 [05:07:19] PROBLEM - jobrunner1 Current Load on jobrunner1 is WARNING: WARNING - load average: 3.33, 6.90, 6.77 [05:12:45] yeah...well I'm going to head to bed now, a bit earlier than usual but am tired 😛 [05:13:12] PROBLEM - jobrunner1 Current Load on jobrunner1 is CRITICAL: CRITICAL - load average: 10.21, 8.51, 7.50 [05:21:03] PROBLEM - jobrunner1 Current Load on jobrunner1 is WARNING: WARNING - load average: 5.92, 7.55, 7.61 [05:22:34] PROBLEM - meta.orain.org - reverse DNS on sslhost is WARNING: rDNS WARNING - reverse DNS entry for meta.orain.org could not be found [05:23:00] PROBLEM - jobrunner1 Current Load on jobrunner1 is CRITICAL: CRITICAL - load average: 8.55, 7.95, 7.74 [05:24:58] PROBLEM - jobrunner1 Current Load on jobrunner1 is WARNING: WARNING - load average: 6.38, 7.92, 7.79 [05:26:56] PROBLEM - jobrunner1 Current Load on jobrunner1 is CRITICAL: CRITICAL - load average: 6.32, 8.37, 8.03 [05:29:29] RECOVERY - meta.orain.org - reverse DNS on sslhost is OK: rDNS OK - meta.orain.org reverse DNS resolves to cp7.miraheze.org [05:34:46] PROBLEM - jobrunner1 Current Load on jobrunner1 is WARNING: WARNING - load average: 5.30, 7.66, 8.00 [05:36:45] PROBLEM - jobrunner1 Current Load on jobrunner1 is CRITICAL: CRITICAL - load average: 6.40, 8.35, 8.29 [05:50:45] PROBLEM - jobrunner1 Current Load on jobrunner1 is WARNING: WARNING - load average: 4.57, 6.33, 7.86 [05:52:44] Reception123: what's on jobrunner1 causing load? [05:52:45] PROBLEM - jobrunner1 Current Load on jobrunner1 is CRITICAL: CRITICAL - load average: 9.86, 7.69, 8.16 [05:54:43] RhinosF1: well I'm running an import but it shouldn't cause such a high load [05:55:04] Reception123: check htop/top [05:56:13] RhinosF1: seems to be the import yeah [05:57:33] !log nice -5 for the import logged above [05:57:39] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [05:59:48] !log re-run GDPR scripts [05:59:52] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [06:00:46] PROBLEM - jobrunner1 Current Load on jobrunner1 is WARNING: WARNING - load average: 5.25, 7.08, 7.99 [06:06:56] @Zppix Thanks for your response on my talk, I was asleep. [06:10:33] Also miraheze.org doesn't work for me. [06:17:22] PROBLEM - mail1 Puppet on mail1 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 3 minutes ago with 1 failures. Failed resources (up to 3 shown): Package[php7.3-apcu] [06:18:23] @MrJaroslavik: just seen [06:18:30] !sre Miraheze.org is down [06:18:49] RhinosF1: urgh, probably a syntax error [06:19:08] Reception123: can you look? Not even out of bed yet [06:19:18] yeah I'll take a look in a few [06:20:45] RECOVERY - jobrunner1 Current Load on jobrunner1 is OK: OK - load average: 5.62, 6.13, 6.65 [06:25:23] RECOVERY - mail1 Puppet on mail1 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [06:48:38] RhinosF1: I doubt the period is supposed to be on the right side of the ''... https://github.com/miraheze/landing/commit/67570adb286ca186720070672c32a0335db7c169#diff-8fc97db3f912c2edd5adacb64fc8b3acR172 [06:48:39] [ Add Czech Translation (2) (#40) · miraheze/landing@67570ad · GitHub ] - github.com [06:48:52] but it seems to be done like that above too [06:50:39] Reception123: hmm [06:55:54] RhinosF1: oh it seems to be the lack of period that's problematic [06:56:00] I didn't know those worked with php [06:56:23] Ok [06:56:30] [02miraheze/landing] 07Reception123 pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJwik [06:56:32] [02miraheze/landing] 07Reception123 034712eb6 - attempt to fix syntax error [06:57:19] miraheze/landing/master/4712eb6 - Reception123 The build was fixed. https://travis-ci.org/miraheze/landing/builds/714372829 [06:57:40] ^ RhinosF1 yay! [07:00:26] RhinosF1: I don't see why travis wasn't checking in the first place though [07:02:40] It doesn't seem back yet [07:05:11] Reception123: hmm [07:05:19] RhinosF1: back now [07:05:35] But I still don't see what happened with Travis before the merge [07:05:40] Ok [07:05:54] We need to make sure it works [07:05:58] Travis seems to be playing up on the Miraheze org [07:10:19] * Reception123 wonders why [07:10:41] But remember I saw it happen on mw-config too [07:15:01] Reception123: it's only the Miraheze org [07:15:11] I guess we need to refresh it [07:15:49] It was only slow on MirahezeBots because it's has 4 test builds to run and then it has to wait for them before doing the deploy build [07:17:39] Ok then [07:17:45] Do you know how to refresh it? [07:20:00] Let me ask [07:22:01] This happens with the sopel devs often [07:40:14] ok [07:42:45] PROBLEM - mw5 Current Load on mw5 is WARNING: WARNING - load average: 7.97, 5.82, 4.95 [07:44:46] PROBLEM - mw5 Current Load on mw5 is CRITICAL: CRITICAL - load average: 9.10, 7.13, 5.55 [07:46:46] RECOVERY - mw5 Current Load on mw5 is OK: OK - load average: 3.43, 5.74, 5.24 [07:54:30] [02miraheze/mediawiki] 07Reception123 pushed 031 commit to 03REL1_34 [+1/-0/±1] 13https://git.io/JJwXW [07:54:31] [02miraheze/mediawiki] 07Reception123 03f7bc202 - Install MobileTagsPlugin ext. [07:54:33] PROBLEM - mw4 Current Load on mw4 is CRITICAL: CRITICAL - load average: 10.43, 7.18, 5.57 [07:54:45] PROBLEM - mw5 Current Load on mw5 is CRITICAL: CRITICAL - load average: 8.63, 7.26, 6.00 [07:56:33] RECOVERY - mw4 Current Load on mw4 is OK: OK - load average: 5.44, 6.50, 5.53 [07:56:45] RECOVERY - mw5 Current Load on mw5 is OK: OK - load average: 5.25, 6.61, 5.93 [08:02:31] PROBLEM - mw4 Current Load on mw4 is WARNING: WARNING - load average: 7.78, 6.49, 5.72 [08:02:44] PROBLEM - mw5 Current Load on mw5 is CRITICAL: CRITICAL - load average: 8.30, 6.83, 6.10 [08:03:05] Reception123: what's up with load? [08:03:13] I'm not doing anything [08:03:28] PROBLEM - jobrunner1 Puppet on jobrunner1 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 3 minutes ago with 1 failures. Failed resources (up to 3 shown): Exec[git_pull_MediaWiki core] [08:03:46] Reception123: https://grafana.miraheze.org/d/W9MIkA7iz/miraheze-cluster?viewPanel=287&orgId=1&var-job=node&var-node=mw5.miraheze.org&var-port=9100 [08:03:46] PROBLEM - cloud1 Current Load on cloud1 is WARNING: WARNING - load average: 22.71, 19.37, 16.96 [08:03:47] [ Grafana ] - grafana.miraheze.org [08:03:52] RhinosF1: yeah it must be puppet being annoying when adding a new submodule [08:03:52] What the **** [08:03:54] you know the drill [08:04:02] Reception123: no that disk usage [08:04:13] That's a huge spike [08:04:31] PROBLEM - mw4 Current Load on mw4 is CRITICAL: CRITICAL - load average: 8.32, 7.46, 6.19 [08:04:35] hm that's very odd [08:04:42] Reception123: very [08:04:44] PROBLEM - mw5 Current Load on mw5 is WARNING: WARNING - load average: 6.67, 6.98, 6.26 [08:04:53] Something doesn't add up with this load [08:05:09] it makes no sense [08:05:19] why is it that I can never do one thing without being bothered by icinga-miraheze [08:05:28] PROBLEM - mw5 Puppet on mw5 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 2 minutes ago with 1 failures. Failed resources (up to 3 shown): Exec[git_pull_MediaWiki core] [08:05:41] Reception123: can you scan top + access logs to make sure everything looks good [08:05:46] RECOVERY - cloud1 Current Load on cloud1 is OK: OK - load average: 15.89, 18.23, 16.86 [08:05:52] RhinosF1: yeah was doing that [08:06:28] Good [08:06:29] PROBLEM - mw4 Puppet on mw4 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 3 minutes ago with 1 failures. Failed resources (up to 3 shown): Exec[git_pull_MediaWiki core] [08:06:31] RECOVERY - mw4 Current Load on mw4 is OK: OK - load average: 5.17, 6.58, 6.02 [08:06:52] RECOVERY - mw5 Current Load on mw5 is OK: OK - load average: 4.59, 6.05, 6.01 [08:07:05] ^ RhinosF1 that seems to be fine, was almost definitely puppet [08:07:09] but the space I don't get [08:07:23] Reception123: it's usage [08:07:35] Something was trying to use the disk a lot [08:11:58] PROBLEM - test2 Puppet on test2 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 2 minutes ago with 1 failures. Failed resources (up to 3 shown): Exec[git_pull_MediaWiki core] [08:12:33] PROBLEM - mw6 Puppet on mw6 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 2 minutes ago with 1 failures. Failed resources (up to 3 shown): Exec[git_pull_MediaWiki core] [08:12:42] PROBLEM - mw7 Puppet on mw7 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 3 minutes ago with 1 failures. Failed resources (up to 3 shown): Exec[git_pull_MediaWiki core] [08:13:41] PROBLEM - cloud2 Current Load on cloud2 is WARNING: WARNING - load average: 20.74, 20.18, 16.42 [08:15:33] RECOVERY - mw5 Puppet on mw5 is OK: OK: Puppet is currently enabled, last run 49 seconds ago with 0 failures [08:15:38] RECOVERY - jobrunner1 Puppet on jobrunner1 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [08:15:41] RECOVERY - cloud2 Current Load on cloud2 is OK: OK - load average: 17.12, 19.09, 16.48 [08:16:28] RECOVERY - mw4 Puppet on mw4 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [08:19:42] PROBLEM - cloud2 Current Load on cloud2 is CRITICAL: CRITICAL - load average: 19.74, 24.10, 19.57 [08:21:43] PROBLEM - cloud2 Current Load on cloud2 is WARNING: WARNING - load average: 14.31, 20.41, 18.78 [08:23:44] RECOVERY - cloud2 Current Load on cloud2 is OK: OK - load average: 11.56, 17.95, 18.11 [08:23:47] PROBLEM - mw5 Current Load on mw5 is WARNING: WARNING - load average: 7.71, 5.74, 5.47 [08:24:31] RECOVERY - mw6 Puppet on mw6 is OK: OK: Puppet is currently enabled, last run 16 seconds ago with 0 failures [08:25:47] RECOVERY - mw5 Current Load on mw5 is OK: OK - load average: 4.49, 5.39, 5.39 [08:26:01] RECOVERY - test2 Puppet on test2 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [08:28:42] RECOVERY - mw7 Puppet on mw7 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [08:28:44] [02puppet] 07RhinosF1 opened pull request 03#1466: 403 clients with no UserAgent - 13https://git.io/JJwM2 [08:28:51] Reception123: ^ [08:30:40] [02puppet] 07Reception123 commented on pull request 03#1466: 403 clients with no UserAgent - 13https://git.io/JJwM6 [08:30:56] [02puppet] 07Reception123 closed pull request 03#1466: 403 clients with no UserAgent - 13https://git.io/JJwM2 [08:30:58] [02miraheze/puppet] 07Reception123 pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJwMP [08:30:59] [02miraheze/puppet] 07RhinosF1 03076f218 - 429 clients with no UserAgent (#1466) [08:33:45] PROBLEM - mw5 Current Load on mw5 is WARNING: WARNING - load average: 7.69, 5.97, 5.44 [08:33:59] [02miraheze/mw-config] 07Reception123 pushed 031 commit to 03Reception123-patch-1 [+0/-0/±4] 13https://git.io/JJwM5 [08:34:01] [02miraheze/mw-config] 07Reception123 03fce5110 - Add MobileTabsPlugin ext. T5999 [08:34:02] [02mw-config] 07Reception123 created branch 03Reception123-patch-1 - 13https://git.io/vbvb3 [08:34:22] [02mw-config] 07Reception123 opened pull request 03#3202: Add MobileTabsPlugin ext. T5999 - 13https://git.io/JJwMd [08:35:15] [02puppet] 07RhinosF1 commented on pull request 03#1466: 403 clients with no UserAgent - 13https://git.io/JJwMp [08:35:44] RECOVERY - mw5 Current Load on mw5 is OK: OK - load average: 3.58, 5.20, 5.24 [08:44:41] New accounts flood? [08:44:42] https://cdn.discordapp.com/attachments/435711390544560128/739765673365995580/IMG_20200803_104419.jpg [08:45:07] !log sudo service nginx reload on cp3 [08:45:43] @MrJaroslavik hmm, if we see another probably need to block for them to calm down [08:46:04] PROBLEM - mw4 MediaWiki Rendering on mw4 is WARNING: HTTP WARNING: HTTP/1.1 403 Forbidden - 338 bytes in 0.011 second response time [08:46:05] PROBLEM - mw5 MediaWiki Rendering on mw5 is WARNING: HTTP WARNING: HTTP/1.1 403 Forbidden - 338 bytes in 0.007 second response time [08:46:17] PROBLEM - gluster1 Puppet on gluster1 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 2 minutes ago with 1 failures. Failed resources (up to 3 shown): Exec[/mnt/mediawiki-static] [08:46:27] PROBLEM - test2 MediaWiki Rendering on test2 is WARNING: HTTP WARNING: HTTP/1.1 403 Forbidden - 338 bytes in 0.005 second response time [08:46:43] Reception123: revert ^ [08:46:46] PROBLEM - mw6 MediaWiki Rendering on mw6 is WARNING: HTTP WARNING: HTTP/1.1 403 Forbidden - 338 bytes in 0.105 second response time [08:46:53] PROBLEM - cp7 HTTPS on cp7 is WARNING: HTTP WARNING: HTTP/1.1 403 Forbidden - 338 bytes in 0.007 second response time [08:47:08] [02miraheze/puppet] 07Reception123 pushed 031 commit to 03revert-1466-patch-15 [+0/-0/±1] 13https://git.io/JJwyL [08:47:10] [02miraheze/puppet] 07Reception123 030ea76b2 - Revert "429 clients with no UserAgent (#1466)" This reverts commit 076f218106bed668f827724be4be0de766d3b96c. [08:47:11] [02puppet] 07Reception123 created branch 03revert-1466-patch-15 - 13https://git.io/vbiAS [08:47:13] [02puppet] 07Reception123 opened pull request 03#1467: Revert "403 clients with no UserAgent" - 13https://git.io/JJwyq [08:47:18] [02puppet] 07Reception123 closed pull request 03#1467: Revert "403 clients with no UserAgent" - 13https://git.io/JJwyq [08:47:19] [02miraheze/puppet] 07Reception123 pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJwym [08:47:21] [02miraheze/puppet] 07Reception123 035cdca2b - Revert "429 clients with no UserAgent (#1466)" (#1467) This reverts commit 076f218106bed668f827724be4be0de766d3b96c. [08:47:24] PROBLEM - jobrunner1 MediaWiki Rendering on jobrunner1 is WARNING: HTTP WARNING: HTTP/1.1 403 Forbidden - 338 bytes in 0.008 second response time [08:47:29] PROBLEM - mw7 MediaWiki Rendering on mw7 is WARNING: HTTP WARNING: HTTP/1.1 403 Forbidden - 338 bytes in 0.004 second response time [08:47:40] Reception123: force puppet everywhere please [08:47:44] yes [08:47:47] PROBLEM - cp6 HTTPS on cp6 is WARNING: HTTP WARNING: HTTP/1.1 403 Forbidden - 338 bytes in 0.006 second response time [08:47:56] PROBLEM - cp9 HTTPS on cp9 is WARNING: HTTP WARNING: HTTP/1.1 403 Forbidden - 338 bytes in 0.332 second response time [08:48:47] PROBLEM - cp3 HTTPS on cp3 is WARNING: HTTP WARNING: HTTP/1.1 403 Forbidden - 338 bytes in 1.006 second response time [08:51:38] Reception123: has puppet ran and nginx been reloaded? If so, can we force icinga to recheck? [08:51:54] yes will try to force [08:52:58] still running on cp3 since it's slow [08:54:04] RECOVERY - mw4 MediaWiki Rendering on mw4 is OK: HTTP OK: HTTP/1.1 200 OK - 18546 bytes in 0.019 second response time [08:54:05] RECOVERY - mw5 MediaWiki Rendering on mw5 is OK: HTTP OK: HTTP/1.1 200 OK - 18546 bytes in 0.007 second response time [08:54:17] RECOVERY - jobrunner1 MediaWiki Rendering on jobrunner1 is OK: HTTP OK: HTTP/1.1 200 OK - 18545 bytes in 0.009 second response time [08:54:19] there it goes [08:54:21] RECOVERY - cp6 HTTPS on cp6 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 1936 bytes in 0.004 second response time [08:54:22] RECOVERY - mw7 MediaWiki Rendering on mw7 is OK: HTTP OK: HTTP/1.1 200 OK - 18546 bytes in 0.006 second response time [08:54:23] RECOVERY - mw6 MediaWiki Rendering on mw6 is OK: HTTP OK: HTTP/1.1 200 OK - 18546 bytes in 0.006 second response time [08:54:24] RECOVERY - cp7 HTTPS on cp7 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 1936 bytes in 0.021 second response time [08:54:27] RECOVERY - test2 MediaWiki Rendering on test2 is OK: HTTP OK: HTTP/1.1 200 OK - 18546 bytes in 0.007 second response time [09:01:25] RECOVERY - bacula2 Bacula Private Git on bacula2 is OK: OK: Full, 4526 files, 13.14MB, 2020-08-03 09:00:00 (1.4 minutes ago) [09:05:24] RECOVERY - cp9 HTTPS on cp9 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 1936 bytes in 0.332 second response time [09:09:23] RECOVERY - cp3 HTTPS on cp3 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 1935 bytes in 1.024 second response time [09:21:46] [02puppet] 07doraemonfan2020 opened pull request 03#1468: Add new headers - 13https://git.io/JJw9n [10:26:26] PROBLEM - wiki.ciptamedia.org - LetsEncrypt on sslhost is WARNING: WARNING - Certificate 'wiki.ciptamedia.org' expires in 15 day(s) (Wed 19 Aug 2020 10:21:38 GMT +0000). [10:33:42] [02miraheze/ssl] 07MirahezeSSLBot pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJwdp [10:33:43] [02miraheze/ssl] 07MirahezeSSLBot 03674673b - Bot: Update SSL cert for wiki.ciptamedia.org [10:34:37] PROBLEM - adadevelopersacademy.wiki - LetsEncrypt on sslhost is WARNING: WARNING - Certificate 'adadevelopersacademy.wiki' expires in 15 day(s) (Wed 19 Aug 2020 10:32:02 GMT +0000). [10:41:21] [02miraheze/ssl] 07MirahezeSSLBot pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJwFn [10:41:23] [02miraheze/ssl] 07MirahezeSSLBot 033034ddd - Bot: Update SSL cert for adadevelopersacademy.wiki [10:43:36] PROBLEM - wiki.gesamtschule-nordkirchen.de - LetsEncrypt on sslhost is WARNING: WARNING - Certificate 'wiki.gesamtschule-nordkirchen.de' expires in 15 day(s) (Wed 19 Aug 2020 10:34:54 GMT +0000). [10:43:54] [02miraheze/ssl] 07MirahezeSSLBot pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJwFB [10:43:55] [02miraheze/ssl] 07MirahezeSSLBot 032e89153 - Bot: Update SSL cert for wiki.gesamtschule-nordkirchen.de [10:46:37] RECOVERY - wiki.ciptamedia.org - LetsEncrypt on sslhost is OK: OK - Certificate 'wiki.ciptamedia.org' will expire on Sun 01 Nov 2020 09:33:35 GMT +0000. [10:55:12] RECOVERY - adadevelopersacademy.wiki - LetsEncrypt on sslhost is OK: OK - Certificate 'adadevelopersacademy.wiki' will expire on Sun 01 Nov 2020 09:41:15 GMT +0000. [10:57:06] RECOVERY - wiki.gesamtschule-nordkirchen.de - LetsEncrypt on sslhost is OK: OK - Certificate 'wiki.gesamtschule-nordkirchen.de' will expire on Sun 01 Nov 2020 09:43:47 GMT +0000. [10:58:46] [02miraheze/mw-config] 07Reception123 pushed 031 commit to 03Reception123-patch-2 [+0/-0/±1] 13https://git.io/JJwb3 [10:58:48] [02miraheze/mw-config] 07Reception123 0344770ab - add wikiapiary footer for circleyversewiki per req requested via Discord [10:58:49] [02mw-config] 07Reception123 created branch 03Reception123-patch-2 - 13https://git.io/vbvb3 [10:58:51] [02mw-config] 07Reception123 opened pull request 03#3203: add wikiapiary footer for circleyversewiki per req - 13https://git.io/JJwbs [10:59:07] ^ RhinosF1 yup annoying travis is gone [10:59:13] now that it's gone I miss it [10:59:21] Heh [11:02:07] [02mw-config] 07Reception123 synchronize pull request 03#3203: add wikiapiary footer for circleyversewiki per req - 13https://git.io/JJwbs [11:02:08] [02miraheze/mw-config] 07Reception123 pushed 031 commit to 03Reception123-patch-2 [+0/-0/±1] 13https://git.io/JJwbC [11:02:10] [02miraheze/mw-config] 07Reception123 03b9065ce - fix link [11:02:28] [02mw-config] 07Reception123 closed pull request 03#3203: add wikiapiary footer for circleyversewiki per req - 13https://git.io/JJwbs [11:02:29] [02miraheze/mw-config] 07Reception123 pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJwbW [11:02:31] [02miraheze/mw-config] 07Reception123 037f4ad08 - add wikiapiary footer for circleyversewiki per req (#3203) * add wikiapiary footer for circleyversewiki per req requested via Discord * fix link [11:02:32] [02mw-config] 07Reception123 deleted branch 03Reception123-patch-2 - 13https://git.io/vbvb3 [11:02:34] [02miraheze/mw-config] 07Reception123 deleted branch 03Reception123-patch-2 [11:02:44] RhinosF1: a repo without travis is not very good, did you get a chance to ask the sopel folks? [11:03:05] * Reception123 remembers the dark times before we had travis [11:03:49] miraheze/mw-config/Reception123-patch-2/b9065ce - Reception123 The build has errored. https://travis-ci.org/miraheze/mw-config/builds/714429902 [11:04:02] uh that's not what my linter said... [11:04:35] RhinosF1: now travis is lying to me [11:27:17] Reception123: ha! [11:28:01] [02miraheze/puppet] 07Reception123 pushed 031 commit to 03Reception123-patch-1 [+0/-0/±1] 13https://git.io/JJwNF [11:28:02] [02miraheze/puppet] 07Reception123 03d1b7fe7 - whitelist wikiapiary.com ran by trusted MediaWiki contributor Kghbln, needs one more SRE approval [11:28:04] [02puppet] 07Reception123 created branch 03Reception123-patch-1 - 13https://git.io/vbiAS [11:28:08] [02puppet] 07Reception123 opened pull request 03#1469: whitelist wikiapiary.com - 13https://git.io/JJwNb [12:14:40] PROBLEM - mw5 Current Load on mw5 is WARNING: WARNING - load average: 7.83, 5.75, 4.53 [12:16:40] RECOVERY - mw5 Current Load on mw5 is OK: OK - load average: 3.83, 4.85, 4.34 [13:53:55] [02puppet] 07paladox closed pull request 03#1468: Add new headers - 13https://git.io/JJw9n [13:55:23] paladox: what would that have even done? [13:57:43] I would guess break it, this is the same user that keeps creating pulls for varnish and then we close them. So I think it’s JoeA. [13:59:48] paladox: oh [14:00:06] paladox: did you see the tech@ email [14:00:55] Reception123: how did we not spot who that is [14:01:40] What email? [14:04:03] paladox: high load + disk earlier and a lot of traffic from what looks to be bad bot/crawler with UA of - attempt to block it failed and had to be reverted [14:04:13] Any way to handle it? [14:06:05] You would just block it’s UA [14:06:11] that's the thing [14:06:13] see -staff [14:09:54] paladox: see my commit to puppet, we tried, it blocked icinga and many users [14:20:05] var ~ "-" looks to be a regex match, which matches any string including the "-" character. To match only that character using ~, the expression must be "^-$" [14:22:44] ^ [14:22:45] RhinosF1: Reception123 try ^ [14:22:47] You can test using nginx [14:22:48] On test2 [14:23:10] Reception123: ^ [14:23:18] paladox: how would I test exactly? [14:57:11] Reception123 you would just edit the mediawiki file under nginx on test2 [14:57:13] and restart it [14:58:03] paladox: OK, and then? [14:58:44] wait a bit, as test2 is in varnish [14:58:51] so if it fails you will get the same issue [15:01:23] Ok [15:51:30] [02miraheze/MirahezeMagic] 07RhinosF1 pushed 031 commit to 03RhinosF1-patch-2 [+0/-0/±1] 13https://github.com/miraheze/MirahezeMagic/commit/5636b02cbe15 [15:51:31] [02miraheze/MirahezeMagic] 07RhinosF1 035636b02 - Update removePII.php [15:51:33] [02MirahezeMagic] 07RhinosF1 created branch 03RhinosF1-patch-2 - 13https://git.io/fQRGX [15:51:34] [02MirahezeMagic] 07RhinosF1 opened pull request 03#140: Update removePII.php - 13https://git.io/JJrPK [15:52:30] [02MirahezeMagic] 07paladox edited pull request 03#140: Use old name when deleting user page - 13https://git.io/JJrPK [15:52:31] miraheze/MirahezeMagic/RhinosF1-patch-2/5636b02 - RhinosF1 The build passed. https://travis-ci.com/miraheze/MirahezeMagic/builds/178282075 [15:53:13] [02MirahezeMagic] 07paladox closed pull request 03#140: Use old name when deleting user page - 13https://git.io/JJrPK [15:53:14] [02miraheze/MirahezeMagic] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJrX3 [15:53:16] [02miraheze/MirahezeMagic] 07RhinosF1 0388d6171 - Update removePII.php (#140) [15:53:18] [02miraheze/MirahezeMagic] 07paladox deleted branch 03RhinosF1-patch-2 [15:53:19] [02MirahezeMagic] 07paladox deleted branch 03RhinosF1-patch-2 - 13https://git.io/fQRGX [15:53:32] [02miraheze/puppet] 07Reception123 pushed 031 commit to 03revert-1467-revert-1466-patch-15 [+0/-0/±1] 13https://git.io/JJrXl [15:53:34] [02miraheze/puppet] 07Reception123 03be0c861 - Revert "Revert "429 clients with no UserAgent (#1466)" (#1467)" This reverts commit 5cdca2bf2033856255beef9d844408c41c47adb5. [15:53:36] [02puppet] 07Reception123 created branch 03revert-1467-revert-1466-patch-15 - 13https://git.io/vbiAS [15:53:37] [02puppet] 07Reception123 opened pull request 03#1470: Revert "Revert "403 clients with no UserAgent"" - 13https://git.io/JJrX4 [15:54:35] [02puppet] 07Reception123 synchronize pull request 03#1470: Revert "Revert "403 clients with no UserAgent"" - 13https://git.io/JJrX4 [15:54:37] [02miraheze/puppet] 07Reception123 pushed 031 commit to 03revert-1467-revert-1466-patch-15 [+0/-0/±1] 13https://git.io/JJrXj [15:54:38] [02miraheze/puppet] 07Reception123 0387c74b9 - Update mediawiki.conf [15:54:40] [02puppet] 07paladox closed pull request 03#1469: whitelist wikiapiary.com - 13https://git.io/JJwNb [15:54:41] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJr1f [15:54:43] [02miraheze/puppet] 07Reception123 036e9eb28 - whitelist wikiapiary.com (#1469) ran by trusted MediaWiki contributor Kghbln, needs one more SRE approval [15:54:45] [02miraheze/puppet] 07paladox deleted branch 03Reception123-patch-1 [15:54:46] [02puppet] 07paladox deleted branch 03Reception123-patch-1 - 13https://git.io/vbiAS [15:55:10] [02puppet] 07paladox closed pull request 03#1463: check_reverse_dns: handle an exception - 13https://git.io/JJ29M [15:55:12] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJr1q [15:55:13] [02miraheze/puppet] 07RhinosF1 03b706397 - check_reverse_dns: handle an exception (#1463) * check_reverse_dns: handle an exception should make https://icinga.miraheze.org/monitoring/event/show?type=notify&id=134112 more clear, a bit of a proof of concept without tests but it works in my sleepy head. * check_reverse_dns: exit on NoNameservers [15:55:14] [ Icinga Web 2 Login ] - icinga.miraheze.org [15:59:41] PROBLEM - cp7 Stunnel Http for test2 on cp7 is CRITICAL: HTTP CRITICAL - No data received from host [15:59:54] PROBLEM - test2 HTTPS on test2 is CRITICAL: connect to address 51.77.107.211 and port 443: Connection refusedHTTP CRITICAL - Unable to open TCP socket [15:59:55] Reception123: ? [16:00:04] PROBLEM - cp6 Stunnel Http for test2 on cp6 is CRITICAL: HTTP CRITICAL - No data received from host [16:00:16] RhinosF1: yes that's me on test2 [16:00:24] see -staff [16:00:42] PROBLEM - cp3 Stunnel Http for test2 on cp3 is CRITICAL: HTTP CRITICAL - No data received from host [16:01:25] PROBLEM - cp9 Stunnel Http for test2 on cp9 is CRITICAL: HTTP CRITICAL - No data received from host [16:01:38] Reception123: i'll push MM update, can you please rerun PII script? [16:01:47] k [16:02:43] RECOVERY - cp3 Stunnel Http for test2 on cp3 is OK: HTTP OK: HTTP/1.1 200 OK - 15655 bytes in 1.019 second response time [16:03:26] RECOVERY - cp9 Stunnel Http for test2 on cp9 is OK: HTTP OK: HTTP/1.1 200 OK - 15655 bytes in 0.320 second response time [16:03:41] RECOVERY - cp7 Stunnel Http for test2 on cp7 is OK: HTTP OK: HTTP/1.1 200 OK - 15669 bytes in 0.007 second response time [16:03:54] RECOVERY - test2 HTTPS on test2 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 561 bytes in 0.006 second response time [16:05:22] PROBLEM - gluster1 Current Load on gluster1 is WARNING: WARNING - load average: 7.22, 4.96, 3.09 [16:06:02] [02miraheze/puppet] 07paladox pushed 031 commit to 03paladox-patch-2 [+0/-0/±1] 13https://git.io/JJr1x [16:06:03] RECOVERY - cp6 Stunnel Http for test2 on cp6 is OK: HTTP OK: HTTP/1.1 200 OK - 15655 bytes in 0.005 second response time [16:06:04] [02miraheze/puppet] 07paladox 03da1f22f - 403 clients with no UserAgent [16:06:05] [02puppet] 07paladox created branch 03paladox-patch-2 - 13https://git.io/vbiAS [16:06:07] [02puppet] 07paladox opened pull request 03#1471: 403 clients with no UserAgent - 13https://git.io/JJr1p [16:06:38] paladox: it's not blank though, just - [16:07:22] RECOVERY - gluster1 Current Load on gluster1 is OK: OK - load average: 3.68, 4.35, 3.09 [16:08:54] [02puppet] 07Reception123 commented on pull request 03#1470: Revert "Revert "403 clients with no UserAgent"" - 13https://git.io/JJrMk [16:08:56] [02puppet] 07Reception123 closed pull request 03#1470: Revert "Revert "403 clients with no UserAgent"" - 13https://git.io/JJrX4 [16:08:57] [02puppet] 07Reception123 deleted branch 03revert-1467-revert-1466-patch-15 - 13https://git.io/vbiAS [16:08:59] [02miraheze/puppet] 07Reception123 deleted branch 03revert-1467-revert-1466-patch-15 [16:09:08] [02puppet] 07paladox closed pull request 03#1471: 403 clients with no UserAgent - 13https://git.io/JJr1p [16:09:10] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJrML [16:09:11] [02miraheze/puppet] 07paladox 03ef3920f - 403 clients with no UserAgent (#1471) [16:09:13] [02miraheze/puppet] 07paladox deleted branch 03paladox-patch-2 [16:09:14] [02puppet] 07paladox deleted branch 03paladox-patch-2 - 13https://git.io/vbiAS [16:11:16] PROBLEM - cp9 Puppet on cp9 is WARNING: WARNING: Puppet is currently disabled, message: paladox, last run 7 minutes ago with 0 failures [16:12:01] PROBLEM - cp7 Puppet on cp7 is WARNING: WARNING: Puppet is currently disabled, message: paladox, last run 8 minutes ago with 0 failures [16:12:14] PROBLEM - cp6 Puppet on cp6 is WARNING: WARNING: Puppet is currently disabled, message: paladox, last run 8 minutes ago with 0 failures [16:22:15] [02miraheze/mediawiki] 07RhinosF1 pushed 031 commit to 03REL1_34 [+0/-0/±1] 13https://git.io/JJrD0 [16:22:16] [02miraheze/mediawiki] 07RhinosF1 03340cb61 - MM [16:22:20] Reception123: ^ [16:22:32] will ru [16:22:34] *run [16:22:37] !log re-run the GDPR scripts [16:22:44] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [16:24:59] [02miraheze/puppet] 07paladox pushed 031 commit to 03paladox-patch-2 [+0/-0/±1] 13https://git.io/JJrDo [16:25:01] [02miraheze/puppet] 07paladox 038e19175 - Fix user agent [16:25:02] [02puppet] 07paladox created branch 03paladox-patch-2 - 13https://git.io/vbiAS [16:25:04] [02puppet] 07paladox opened pull request 03#1472: Fix user agent - 13https://git.io/JJrDK [16:25:49] Reception123 you updated mediawiki did you first on the mw*? [16:26:06] [02puppet] 07paladox closed pull request 03#1472: Fix user agent - 13https://git.io/JJrDK [16:26:08] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJrDD [16:26:09] [02miraheze/puppet] 07paladox 03a06fa58 - Fix user agent (#1472) [16:26:10] [02puppet] 07paladox deleted branch 03paladox-patch-2 - 13https://git.io/vbiAS [16:26:12] [02miraheze/puppet] 07paladox deleted branch 03paladox-patch-2 [16:26:21] paladox: hm? I didn't update mediawiki RhinosF1 did [16:26:41] Reception123: on jobrunner1 [16:26:45] You ran puppet? [16:26:54] it was still running so I'm running again [16:27:02] Ok [16:27:17] RECOVERY - cp9 Puppet on cp9 is OK: OK: Puppet is currently enabled, last run 2 minutes ago with 0 failures [16:28:01] RECOVERY - cp7 Puppet on cp7 is OK: OK: Puppet is currently enabled, last run 2 minutes ago with 0 failures [16:28:13] RECOVERY - cp6 Puppet on cp6 is OK: OK: Puppet is currently enabled, last run 2 minutes ago with 0 failures [16:30:42] PROBLEM - mw7 Puppet on mw7 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 2 minutes ago with 1 failures. Failed resources (up to 3 shown): Exec[git_pull_MediaWiki core] [16:31:24] PROBLEM - test2 Puppet on test2 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 2 minutes ago with 1 failures. Failed resources (up to 3 shown): Exec[git_pull_MediaWiki core] [16:31:27] PROBLEM - mw5 Puppet on mw5 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 2 minutes ago with 1 failures. Failed resources (up to 3 shown): Exec[git_pull_MediaWiki core] [16:32:28] PROBLEM - mw4 Puppet on mw4 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 3 minutes ago with 1 failures. Failed resources (up to 3 shown): Exec[git_pull_MediaWiki core] [16:32:31] PROBLEM - mw6 Puppet on mw6 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 3 minutes ago with 1 failures. Failed resources (up to 3 shown): Exec[git_pull_MediaWiki core] [16:32:31] PROBLEM - mw4 Current Load on mw4 is CRITICAL: CRITICAL - load average: 8.78, 6.59, 4.93 [16:33:27] PROBLEM - mw5 Current Load on mw5 is CRITICAL: CRITICAL - load average: 8.29, 6.64, 5.17 [16:34:17] PROBLEM - cloud2 Current Load on cloud2 is WARNING: WARNING - load average: 23.17, 19.81, 16.31 [16:34:27] RECOVERY - mw4 Puppet on mw4 is OK: OK: Puppet is currently enabled, last run 14 seconds ago with 0 failures [16:34:31] RECOVERY - mw4 Current Load on mw4 is OK: OK - load average: 5.36, 6.08, 4.95 [16:35:25] RECOVERY - test2 Puppet on test2 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [16:35:26] RECOVERY - mw5 Current Load on mw5 is OK: OK - load average: 4.02, 5.76, 5.04 [16:35:28] RECOVERY - mw5 Puppet on mw5 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [16:36:15] RECOVERY - cloud2 Current Load on cloud2 is OK: OK - load average: 14.86, 18.42, 16.25 [16:38:30] RECOVERY - mw6 Puppet on mw6 is OK: OK: Puppet is currently enabled, last run 51 seconds ago with 0 failures [16:38:42] RECOVERY - mw7 Puppet on mw7 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [16:38:44] PROBLEM - cp3 HTTPS on cp3 is CRITICAL: connect to address 128.199.139.216 and port 443: Connection refusedHTTP CRITICAL - Unable to open TCP socket [16:39:10] aware [16:40:45] RECOVERY - cp3 HTTPS on cp3 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 1946 bytes in 1.009 second response time [16:40:53] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJryV [16:40:55] [02miraheze/puppet] 07paladox 030390e9b - Block empty user agent [16:43:27] PROBLEM - mw5 Current Load on mw5 is CRITICAL: CRITICAL - load average: 10.63, 6.88, 5.46 [16:45:28] RECOVERY - mw5 Current Load on mw5 is OK: OK - load average: 5.60, 6.45, 5.49 [16:47:04] [02miraheze/dns] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJryS [16:47:05] [02miraheze/dns] 07paladox 03ce5aa36 - Add jobrunner2 to dns [17:10:21] [02miraheze/services] 07MirahezeSSLBot pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJr9U [17:10:22] [02miraheze/services] 07MirahezeSSLBot 0312db2e7 - BOT: Updating services config for wikis [17:14:32] PROBLEM - mw4 Current Load on mw4 is WARNING: WARNING - load average: 7.22, 5.77, 5.13 [17:16:33] RECOVERY - mw4 Current Load on mw4 is OK: OK - load average: 3.82, 4.96, 4.91 [17:19:43] [02miraheze/puppet] 07paladox pushed 031 commit to 03paladox-patch-2 [+0/-0/±1] 13https://git.io/JJr92 [17:19:44] [02miraheze/puppet] 07paladox 036065e39 - Add jobrunner2 [17:19:46] [02puppet] 07paladox created branch 03paladox-patch-2 - 13https://git.io/vbiAS [17:19:47] [02puppet] 07paladox opened pull request 03#1473: Add jobrunner2 - 13https://git.io/JJr9a [17:20:49] [02miraheze/puppet] 07paladox pushed 031 commit to 03paladox-patch-2 [+1/-0/±0] 13https://git.io/JJr9K [17:20:51] [02miraheze/puppet] 07paladox 0304fde4f - Create jobrunner2.yaml [17:20:52] [02puppet] 07paladox synchronize pull request 03#1473: Add jobrunner2 - 13https://git.io/JJr9a [17:21:58] [02puppet] 07paladox closed pull request 03#1473: Add jobrunner2 - 13https://git.io/JJr9a [17:22:00] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+1/-0/±1] 13https://git.io/JJr91 [17:22:02] [02miraheze/puppet] 07paladox 031943069 - Add jobrunner2 (#1473) [17:22:03] [02miraheze/puppet] 07paladox deleted branch 03paladox-patch-2 [17:22:05] [02puppet] 07paladox deleted branch 03paladox-patch-2 - 13https://git.io/vbiAS [17:27:31] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJr9x [17:27:32] [02miraheze/puppet] 07paladox 030a812e0 - Update site.pp [17:31:28] PROBLEM - cp3 Disk Space on cp3 is WARNING: DISK WARNING - free space: / 2650 MB (10% inode=93%); [17:33:41] PROBLEM - cloud2 Current Load on cloud2 is CRITICAL: CRITICAL - load average: 28.19, 23.22, 18.69 [17:33:43] Reception123: I wonder why ^ [17:33:59] cloud2 must be because of paladox installing jobrunner2 [17:34:02] PROBLEM - jobrunner2 HTTPS on jobrunner2 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [17:34:06] PROBLEM - jobrunner2 JobChron Service on jobrunner2 is CRITICAL: PROCS CRITICAL: 0 processes with args 'redisJobChronService' [17:34:27] Reception123: I meant cp3 disk space is warning [17:35:37] oh [17:35:42] PROBLEM - cloud2 Current Load on cloud2 is WARNING: WARNING - load average: 18.73, 21.56, 18.64 [17:39:39] RECOVERY - cloud2 Current Load on cloud2 is OK: OK - load average: 16.62, 19.40, 18.47 [17:43:24] [02miraheze/MirahezeMagic] 07paladox pushed 031 commit to 03paladox-patch-1 [+0/-0/±1] 13https://git.io/JJrHw [17:43:25] [02miraheze/MirahezeMagic] 07paladox 030d5958e - Fix deleting user page [17:43:27] [02MirahezeMagic] 07paladox created branch 03paladox-patch-1 - 13https://git.io/fQRGX [17:43:30] [02MirahezeMagic] 07paladox opened pull request 03#141: Fix deleting user page - 13https://github.com/miraheze/MirahezeMagic/pull/141 [17:44:36] PROBLEM - jobrunner2 Puppet on jobrunner2 is CRITICAL: CRITICAL: Puppet has 4 failures. Last run 2 minutes ago with 4 failures. Failed resources (up to 3 shown): File[/usr/lib/php/20170718/luasandbox.so],Exec[nginx-reload],Exec[ExtensionMessageFiles],Mount[/mnt/mediawiki-static] [17:45:25] [02MirahezeMagic] 07paladox synchronize pull request 03#141: Fix deleting user page - 13https://git.io/JJrHr [17:45:26] [02miraheze/MirahezeMagic] 07paladox pushed 031 commit to 03paladox-patch-1 [+0/-0/±1] 13https://git.io/JJrH6 [17:45:28] [02miraheze/MirahezeMagic] 07paladox 0372e3b88 - Update removePII.php [17:45:34] PROBLEM - cloud2 Current Load on cloud2 is WARNING: WARNING - load average: 18.91, 20.71, 19.41 [17:46:06] [02miraheze/MirahezeMagic] 07translatewiki pushed 031 commit to 03master [+0/-0/±2] 13https://git.io/JJrHi [17:46:08] [02miraheze/MirahezeMagic] 07translatewiki 037e36152 - Localisation updates from https://translatewiki.net. [17:46:08] [ Main page - translatewiki.net ] - translatewiki.net [17:46:09] [02miraheze/CreateWiki] 07translatewiki pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJrHP [17:46:11] [02miraheze/CreateWiki] 07translatewiki 03dd6f5d6 - Localisation updates from https://translatewiki.net. [17:46:12] [ Main page - translatewiki.net ] - translatewiki.net [17:46:12] [02MirahezeMagic] 07paladox closed pull request 03#141: Fix deleting user page - 13https://git.io/JJrHr [17:46:14] [02miraheze/MirahezeMagic] 07paladox pushed 033 commits to 03master [+0/-0/±3] 13https://git.io/JJrHX [17:46:15] [02miraheze/MirahezeMagic] 07paladox 039714bfe - Fix deleting user page (#141) * Fix deleting user page * Update removePII.php [17:46:17] [02MirahezeMagic] 07paladox deleted branch 03paladox-patch-1 - 13https://git.io/fQRGX [17:46:18] [02miraheze/MirahezeMagic] 07paladox deleted branch 03paladox-patch-1 [17:46:38] miraheze/MirahezeMagic/paladox-patch-1/72e3b88 - paladox The build has errored. https://travis-ci.com/miraheze/MirahezeMagic/builds/178296671 [17:47:32] RECOVERY - cloud2 Current Load on cloud2 is OK: OK - load average: 16.42, 18.73, 18.82 [17:49:47] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJrQf [17:49:48] [02miraheze/puppet] 07paladox 030ad0d7c - gluster: Add jobrunner2 to firewall [17:51:13] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJrQU [17:51:14] [02miraheze/puppet] 07paladox 03edac317 - db: add jobrunner2 [17:53:29] RECOVERY - jobrunner2 JobChron Service on jobrunner2 is OK: PROCS OK: 1 process with args 'redisJobChronService' [17:54:05] Yey! [17:54:28] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJrQs [17:54:29] [02miraheze/puppet] 07paladox 034ec7d27 - varnish: Add jobrunner1 and 2 to firewall [17:55:10] >  [miraheze/puppet] paladox pushed 1 commit to master [+0/-0/±1] https://git.io/JJrQU Yeah. Innocuous little commit that deserves a bit more attention. 😛 [17:55:12] [ Comparing 0ad0d7c7b2e6...edac317303e8 · miraheze/puppet · GitHub ] - git.io [17:55:23] PROBLEM - cloud2 Current Load on cloud2 is WARNING: WARNING - load average: 19.78, 21.50, 19.99 [17:57:20] RECOVERY - cloud2 Current Load on cloud2 is OK: OK - load average: 17.02, 19.38, 19.37 [17:57:38] [02miraheze/mediawiki] 07paladox pushed 031 commit to 03REL1_34 [+0/-0/±1] 13https://git.io/JJrQ4 [17:57:39] [02miraheze/mediawiki] 07paladox 038cdee07 - Update MirahezeMagic [18:05:04] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJrQD [18:05:05] [02miraheze/puppet] 07paladox 03d648beb - varnish: Add jobrunner2 [18:07:47] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJrQd [18:07:49] [02miraheze/puppet] 07paladox 03ddfa619 - Remove /usr/lib/php/20170718/luasandbox.so [18:10:02] [02miraheze/puppet] 07Southparkfan pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJrQp [18:10:04] [02miraheze/puppet] 07Southparkfan 03f1653a9 - Automate firewall rules for redis [18:11:05] PROBLEM - mw6 Puppet on mw6 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 3 minutes ago with 1 failures. Failed resources (up to 3 shown): Exec[git_pull_MediaWiki core] [18:11:13] PROBLEM - mw7 Puppet on mw7 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 3 minutes ago with 1 failures. Failed resources (up to 3 shown): Exec[git_pull_MediaWiki core] [18:13:01] RECOVERY - mw6 Puppet on mw6 is OK: OK: Puppet is currently enabled, last run 16 seconds ago with 0 failures [18:13:12] RECOVERY - mw7 Puppet on mw7 is OK: OK: Puppet is currently enabled, last run 23 seconds ago with 0 failures [18:15:59] PROBLEM - cp6 Puppet on cp6 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 3 minutes ago with 1 failures. Failed resources (up to 3 shown): Service[varnish] [18:16:00] !log deleted firewall rule 'allow from any to 6379' from rdb[12] [18:16:03] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [18:17:21] PROBLEM - cloud2 Current Load on cloud2 is CRITICAL: CRITICAL - load average: 27.65, 23.07, 20.57 [18:19:20] RECOVERY - cloud2 Current Load on cloud2 is OK: OK - load average: 12.90, 19.63, 19.65 [18:19:54] RECOVERY - jobrunner2 Puppet on jobrunner2 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [18:23:59] RECOVERY - cp6 Puppet on cp6 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [18:25:40] [02miraheze/puppet] 07paladox pushed 031 commit to 03paladox-patch-2 [+0/-0/±1] 13https://git.io/JJr7r [18:25:41] [02miraheze/puppet] 07paladox 032b23420 - Revert blocking empty user agents [18:25:43] [02puppet] 07paladox created branch 03paladox-patch-2 - 13https://git.io/vbiAS [18:25:44] [02puppet] 07paladox opened pull request 03#1474: Revert blocking empty user agents - 13https://git.io/JJr7o [18:25:54] [02puppet] 07paladox closed pull request 03#1474: Revert blocking empty user agents - 13https://git.io/JJr7o [18:25:56] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJr7K [18:25:57] [02miraheze/puppet] 07paladox 03f6ab30a - Revert blocking empty user agents (#1474) [18:25:59] [02puppet] 07paladox deleted branch 03paladox-patch-2 - 13https://git.io/vbiAS [18:26:00] [02miraheze/puppet] 07paladox deleted branch 03paladox-patch-2 [18:32:56] PROBLEM - mw5 Current Load on mw5 is CRITICAL: CRITICAL - load average: 9.74, 6.41, 4.78 [18:34:53] RECOVERY - mw5 Current Load on mw5 is OK: OK - load average: 5.59, 6.20, 4.93 [18:40:31] [02miraheze/puppet] 07paladox pushed 031 commit to 03revert-1474-paladox-patch-2 [+0/-0/±1] 13https://git.io/JJr7j [18:40:32] [02miraheze/puppet] 07paladox 034317133 - Revert "Revert blocking empty user agents (#1474)" This reverts commit f6ab30a04fd1d17cbca067755d4a24dab0b21bac. [18:40:34] [02puppet] 07paladox created branch 03revert-1474-paladox-patch-2 - 13https://git.io/vbiAS [18:40:35] [02puppet] 07paladox opened pull request 03#1475: Revert "Revert blocking empty user agents" - 13https://git.io/JJr5e [18:40:40] [02puppet] 07paladox closed pull request 03#1475: Revert "Revert blocking empty user agents" - 13https://git.io/JJr5e [18:40:41] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJr5v [18:40:43] [02miraheze/puppet] 07paladox 03b62eb34 - Revert "Revert blocking empty user agents (#1474)" (#1475) This reverts commit f6ab30a04fd1d17cbca067755d4a24dab0b21bac. [18:40:44] [02puppet] 07paladox deleted branch 03revert-1474-paladox-patch-2 - 13https://git.io/vbiAS [18:40:46] [02miraheze/puppet] 07paladox deleted branch 03revert-1474-paladox-patch-2 [18:50:41] PROBLEM - phab1 Puppet on phab1 is WARNING: WARNING: Puppet is currently disabled, message: paladox, last run 8 minutes ago with 0 failures [19:14:05] !log upgrade phabricator [19:14:09] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [19:24:51] [02miraheze/mw-config] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JJrFY [19:24:53] [02miraheze/mw-config] 07paladox 03791a4b0 - Update LocalSettings.php [19:25:58] PROBLEM - cloud2 Current Load on cloud2 is CRITICAL: CRITICAL - load average: 34.21, 23.53, 18.68 [19:29:53] PROBLEM - cloud2 Current Load on cloud2 is WARNING: WARNING - load average: 23.60, 22.96, 19.51 [19:33:27] PROBLEM - mw4 Current Load on mw4 is WARNING: WARNING - load average: 7.02, 5.87, 5.03 [19:33:47] PROBLEM - cloud2 Current Load on cloud2 is CRITICAL: CRITICAL - load average: 27.06, 24.18, 20.68 [19:35:27] RECOVERY - mw4 Current Load on mw4 is OK: OK - load average: 3.59, 4.98, 4.80 [19:35:43] PROBLEM - cloud2 Current Load on cloud2 is WARNING: WARNING - load average: 19.17, 22.50, 20.50 [19:37:39] RECOVERY - cloud2 Current Load on cloud2 is OK: OK - load average: 12.67, 19.46, 19.64 [19:43:26] PROBLEM - mw4 Current Load on mw4 is WARNING: WARNING - load average: 7.75, 7.00, 5.69 [19:45:26] RECOVERY - mw4 Current Load on mw4 is OK: OK - load average: 4.13, 5.84, 5.41 [19:46:36] PROBLEM - cp9 Puppet on cp9 is CRITICAL: CRITICAL: Puppet has 7 failures. Last run 2 minutes ago with 7 failures. Failed resources (up to 3 shown) [19:47:59] paladox: what's failed ^ [19:48:08] me restarting puppetserver [19:48:26] !log restarted puppetserver [19:48:36] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [19:48:42] PROBLEM - cp3 Puppet on cp3 is CRITICAL: CRITICAL: Puppet has 149 failures. Last run 3 minutes ago with 149 failures. Failed resources (up to 3 shown): File[wiki.casadocarvalho.net_private],File[wiki.landan.ca],File[wiki.landan.ca_private],File[bebaskanpengetahuan.id] [19:48:54] paladox: ack [19:54:11] PROBLEM - mw5 Current Load on mw5 is WARNING: WARNING - load average: 7.31, 6.38, 5.55 [19:54:35] RECOVERY - cp9 Puppet on cp9 is OK: OK: Puppet is currently enabled, last run 17 seconds ago with 0 failures [19:56:10] RECOVERY - mw5 Current Load on mw5 is OK: OK - load average: 4.61, 5.65, 5.38 [19:56:39] RECOVERY - cp3 Puppet on cp3 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [20:00:22] !log reduce puppet2 cores by 1 (so 2 cores instead of 3) [20:00:32] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [20:02:43] PROBLEM - bacula2 Puppet on bacula2 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:02:48] PROBLEM - rdb2 Puppet on rdb2 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:02:52] PROBLEM - db12 Puppet on db12 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:02:52] PROBLEM - rdb1 Puppet on rdb1 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:02:54] PROBLEM - mon1 Puppet on mon1 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:02:59] PROBLEM - cloud3 Puppet on cloud3 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:03:00] PROBLEM - cloud2 Puppet on cloud2 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:03:04] PROBLEM - puppet2 puppetserver on puppet2 is CRITICAL: connect to address 51.89.160.129 and port 8140: Connection refused [20:03:12] PROBLEM - mw7 Puppet on mw7 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:03:12] PROBLEM - cp7 Puppet on cp7 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:03:13] PROBLEM - ns1 Puppet on ns1 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:03:14] PROBLEM - mw5 Puppet on mw5 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:03:14] PROBLEM - jobrunner1 Puppet on jobrunner1 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:03:17] PROBLEM - db11 Puppet on db11 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:03:21] PROBLEM - jobrunner2 Puppet on jobrunner2 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:03:22] PROBLEM - services1 Puppet on services1 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:03:24] PROBLEM - services2 Puppet on services2 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:03:27] PROBLEM - cloud1 Puppet on cloud1 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:03:30] PROBLEM - db7 Puppet on db7 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:03:33] PROBLEM - puppet2 Puppet on puppet2 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:03:35] PROBLEM - misc1 Puppet on misc1 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:03:41] PROBLEM - ns2 Puppet on ns2 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:03:42] PROBLEM - gluster2 Puppet on gluster2 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:03:50] PROBLEM - db13 Puppet on db13 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:03:53] PROBLEM - cp6 Puppet on cp6 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:03:54] PROBLEM - ldap1 Puppet on ldap1 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:04:04] PROBLEM - mail1 Puppet on mail1 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:04:09] PROBLEM - mw6 Puppet on mw6 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:04:35] PROBLEM - cp9 Puppet on cp9 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:04:39] PROBLEM - cp3 Puppet on cp3 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:04:42] PROBLEM - mw4 Puppet on mw4 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:04:46] PROBLEM - test2 Puppet on test2 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [20:05:00] RECOVERY - puppet2 puppetserver on puppet2 is OK: TCP OK - 0.000 second response time on 51.89.160.129 port 8140 [20:06:36] !log revert decrease [20:06:40] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [20:08:22] !log upgrade puppet2 to debian 10.5 [20:08:27] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [20:08:32] PROBLEM - ns1 APT on ns1 is CRITICAL: connect to address 192.184.82.120 port 5666: Connection refusedconnect to host 192.184.82.120 port 5666: Connection refused [20:08:40] PROBLEM - ns1 Disk Space on ns1 is CRITICAL: connect to address 192.184.82.120 port 5666: Connection refusedconnect to host 192.184.82.120 port 5666: Connection refused [20:09:00] PROBLEM - puppet2 puppetserver on puppet2 is CRITICAL: connect to address 51.89.160.129 and port 8140: Connection refused [20:09:01] !log reboot puppet2 [20:09:08] PROBLEM - ns1 GDNSD Datacenters on ns1 is CRITICAL: connect to address 192.184.82.120 port 5666: Connection refusedconnect to host 192.184.82.120 port 5666: Connection refused [20:09:10] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [20:09:57] PROBLEM - ns1 Current Load on ns1 is CRITICAL: connect to address 192.184.82.120 port 5666: Connection refusedconnect to host 192.184.82.120 port 5666: Connection refused [20:10:02] PROBLEM - ns1 NTP time on ns1 is CRITICAL: connect to address 192.184.82.120 port 5666: Connection refusedconnect to host 192.184.82.120 port 5666: Connection refused [20:12:55] PROBLEM - cloud2 Current Load on cloud2 is WARNING: WARNING - load average: 20.64, 18.64, 18.40 [20:13:00] RECOVERY - puppet2 puppetserver on puppet2 is OK: TCP OK - 0.001 second response time on 51.89.160.129 port 8140 [20:13:26] RECOVERY - cloud1 Puppet on cloud1 is OK: OK: Puppet is currently enabled, last run 51 seconds ago with 0 failures [20:14:51] PROBLEM - cloud2 Current Load on cloud2 is CRITICAL: CRITICAL - load average: 24.94, 21.08, 19.33 [20:15:00] !log set cpu type to host for puppet2 [20:15:07] RECOVERY - ns1 GDNSD Datacenters on ns1 is OK: OK - all datacenters are online [20:15:08] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [20:15:17] RECOVERY - db11 Puppet on db11 is OK: OK: Puppet is currently enabled, last run 5 seconds ago with 0 failures [20:15:23] RECOVERY - db7 Puppet on db7 is OK: OK: Puppet is currently enabled, last run 18 seconds ago with 0 failures [20:15:36] RECOVERY - misc1 Puppet on misc1 is OK: OK: Puppet is currently enabled, last run 22 seconds ago with 0 failures [20:15:40] RECOVERY - ns2 Puppet on ns2 is OK: OK: Puppet is currently enabled, last run 33 seconds ago with 0 failures [20:15:47] RECOVERY - db13 Puppet on db13 is OK: OK: Puppet is currently enabled, last run 35 seconds ago with 0 failures [20:15:56] PROBLEM - mw4 Current Load on mw4 is CRITICAL: CRITICAL - load average: 8.09, 7.39, 6.46 [20:15:58] RECOVERY - ns1 Current Load on ns1 is OK: OK - load average: 0.01, 0.15, 0.19 [20:16:00] RECOVERY - ldap1 Puppet on ldap1 is OK: OK: Puppet is currently enabled, last run 51 seconds ago with 0 failures [20:16:04] RECOVERY - mail1 Puppet on mail1 is OK: OK: Puppet is currently enabled, last run 44 seconds ago with 0 failures [20:16:09] RECOVERY - ns1 NTP time on ns1 is OK: NTP OK: Offset -0.004383265972 secs [20:16:35] RECOVERY - ns1 APT on ns1 is OK: APT OK: 22 packages available for upgrade (0 critical updates). [20:16:40] RECOVERY - ns1 Disk Space on ns1 is OK: DISK OK - free space: / 11353 MB (88% inode=92%); [20:16:45] RECOVERY - bacula2 Puppet on bacula2 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [20:16:46] RECOVERY - test2 Puppet on test2 is OK: OK: Puppet is currently enabled, last run 14 seconds ago with 0 failures [20:16:48] PROBLEM - cloud2 Current Load on cloud2 is WARNING: WARNING - load average: 20.48, 21.07, 19.56 [20:16:48] RECOVERY - rdb2 Puppet on rdb2 is OK: OK: Puppet is currently enabled, last run 57 seconds ago with 0 failures [20:16:52] RECOVERY - rdb1 Puppet on rdb1 is OK: OK: Puppet is currently enabled, last run 58 seconds ago with 0 failures [20:16:52] RECOVERY - db12 Puppet on db12 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [20:16:52] RECOVERY - mon1 Puppet on mon1 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [20:16:57] RECOVERY - cloud2 Puppet on cloud2 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [20:16:58] RECOVERY - cloud3 Puppet on cloud3 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [20:17:11] RECOVERY - mw7 Puppet on mw7 is OK: OK: Puppet is currently enabled, last run 24 seconds ago with 0 failures [20:17:11] RECOVERY - cp7 Puppet on cp7 is OK: OK: Puppet is currently enabled, last run 42 seconds ago with 0 failures [20:17:15] RECOVERY - mw5 Puppet on mw5 is OK: OK: Puppet is currently enabled, last run 32 seconds ago with 0 failures [20:17:16] RECOVERY - jobrunner1 Puppet on jobrunner1 is OK: OK: Puppet is currently enabled, last run 42 seconds ago with 0 failures [20:17:21] RECOVERY - jobrunner2 Puppet on jobrunner2 is OK: OK: Puppet is currently enabled, last run 52 seconds ago with 0 failures [20:17:22] RECOVERY - services1 Puppet on services1 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [20:17:24] RECOVERY - services2 Puppet on services2 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [20:17:33] RECOVERY - puppet2 Puppet on puppet2 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [20:17:39] RECOVERY - gluster2 Puppet on gluster2 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [20:17:52] RECOVERY - mw4 Current Load on mw4 is OK: OK - load average: 4.47, 6.50, 6.25 [20:17:58] RECOVERY - cp6 Puppet on cp6 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [20:18:10] RECOVERY - mw6 Puppet on mw6 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [20:18:35] RECOVERY - cp9 Puppet on cp9 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [20:18:42] RECOVERY - mw4 Puppet on mw4 is OK: OK: Puppet is currently enabled, last run 2 minutes ago with 0 failures [20:20:42] PROBLEM - cloud2 Current Load on cloud2 is CRITICAL: CRITICAL - load average: 25.05, 21.73, 20.06 [20:23:11] RECOVERY - ns1 Puppet on ns1 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [20:24:37] PROBLEM - cloud2 Current Load on cloud2 is WARNING: WARNING - load average: 21.13, 22.57, 20.92 [20:26:33] PROBLEM - cloud2 Current Load on cloud2 is CRITICAL: CRITICAL - load average: 30.55, 25.83, 22.31 [20:26:42] RECOVERY - cp3 Puppet on cp3 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [20:32:31] PROBLEM - cloud2 Current Load on cloud2 is WARNING: WARNING - load average: 21.42, 23.19, 22.15 [20:36:25] PROBLEM - cloud2 Current Load on cloud2 is CRITICAL: CRITICAL - load average: 27.59, 23.88, 22.55 [20:38:22] PROBLEM - cloud2 Current Load on cloud2 is WARNING: WARNING - load average: 16.74, 21.07, 21.69 [20:41:39] paladox: what's with load? [20:42:14] RECOVERY - cloud2 Current Load on cloud2 is OK: OK - load average: 12.92, 16.33, 19.57 [20:46:09] PROBLEM - cloud2 Current Load on cloud2 is CRITICAL: CRITICAL - load average: 24.30, 20.52, 20.52 [20:48:06] PROBLEM - cloud2 Current Load on cloud2 is WARNING: WARNING - load average: 20.68, 20.26, 20.41 [20:50:02] RECOVERY - cloud2 Current Load on cloud2 is OK: OK - load average: 12.48, 17.07, 19.21 [21:01:31] looks like jobrunner2 [21:01:57] (since it's now uses 4 more cores) [21:09:48] paladox: can we balance stuff out? [21:45:38] No. Apart from removing jobrunner2 [21:46:43] urgh [22:19:02] !log populate content_models for wizpedia101 [22:19:05] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [22:19:45] !log INSERT INTO slot_roles (role_id, role_name) VALUES (1, 'main'); on same wiki [22:19:49] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [22:20:49] paladox: ^ worked [22:22:10] Ok [23:04:18] :)