[00:05:41] (03CR) 10Volans: [C: 03+1] "LGTM! But please re-test it against af-netbox (add/modify/delete instances) and potential error cases." [software/netbox-deploy] - 10https://gerrit.wikimedia.org/r/492007 (https://phabricator.wikimedia.org/T215229) (owner: 10CRusnov) [00:09:47] (03PS5) 10Tim Eulitz: Set up exceptions for rollback confirmation [mediawiki-config] - 10https://gerrit.wikimedia.org/r/494270 (https://phabricator.wikimedia.org/T217436) [00:10:10] (03PS2) 10Tim Eulitz: Add default user config for rollback confirmation [mediawiki-config] - 10https://gerrit.wikimedia.org/r/495667 (https://phabricator.wikimedia.org/T217436) [00:14:05] (03PS12) 10CRusnov: Add system timer for running ganeti->netbox sync. [puppet] - 10https://gerrit.wikimedia.org/r/493774 (https://phabricator.wikimedia.org/T215229) [00:16:20] PROBLEM - citoid endpoints health on scb1001 is CRITICAL: /api (bad URL) timed out before a response was received [00:17:28] RECOVERY - citoid endpoints health on scb1001 is OK: All endpoints are healthy [00:35:24] (03PS1) 10Bstorm: toolforge: limit file size to 50 GB [puppet] - 10https://gerrit.wikimedia.org/r/496082 (https://phabricator.wikimedia.org/T122508) [02:33:19] (03PS1) 10EBernhardson: Allow apifeatureusage to work in beta cluster [puppet] - 10https://gerrit.wikimedia.org/r/496093 (https://phabricator.wikimedia.org/T183156) [02:36:06] (03CR) 10Alex Monk: Allow apifeatureusage to work in beta cluster (031 comment) [puppet] - 10https://gerrit.wikimedia.org/r/496093 (https://phabricator.wikimedia.org/T183156) (owner: 10EBernhardson) [02:36:23] (03CR) 10Alex Monk: [C: 04-1] "new host does not exist" [puppet] - 10https://gerrit.wikimedia.org/r/496093 (https://phabricator.wikimedia.org/T183156) (owner: 10EBernhardson) [02:44:00] (03PS2) 10EBernhardson: Allow apifeatureusage to work in beta cluster [puppet] - 10https://gerrit.wikimedia.org/r/496093 (https://phabricator.wikimedia.org/T183156) [02:55:40] 10Operations, 10Parsoid-PHP: Install PHP7 on scandium - https://phabricator.wikimedia.org/T213493 (10ssastry) [04:00:04] kart_: I seem to be stuck in Groundhog week. Sigh. Time for (yet another) deploy. (https://wikitech.wikimedia.org/wiki/Deployments#deploycal-item-20190313T0400). [04:11:07] !log Started manual run of unpublished ContentTranslation draft purge script (T217818) [04:11:10] Logged the message at https://wikitech.wikimedia.org/wiki/Server_Admin_Log [04:11:10] T217818: Run unpublished draft purge script for CX (Week of 03/10) - https://phabricator.wikimedia.org/T217818 [05:04:34] (03PS7) 10Ammarpad: Create new protection levels for dewiktionary [mediawiki-config] - 10https://gerrit.wikimedia.org/r/495918 (https://phabricator.wikimedia.org/T216885) [05:06:46] (03PS8) 10Ammarpad: Create new protection levels for dewiktionary [mediawiki-config] - 10https://gerrit.wikimedia.org/r/495918 (https://phabricator.wikimedia.org/T216885) [06:11:37] (03PS1) 10Marostegui: db-eqiad.php: Depool db1096 [mediawiki-config] - 10https://gerrit.wikimedia.org/r/496098 [06:12:42] (03PS1) 10BryanDavis: Add helpers for local testing [docker-images/toollabs-images] - 10https://gerrit.wikimedia.org/r/496099 [06:12:44] (03PS1) 10BryanDavis: Tweak build reporting output [docker-images/toollabs-images] - 10https://gerrit.wikimedia.org/r/496100 [06:12:46] (03PS1) 10BryanDavis: all: install locales used in Toolforge grid [docker-images/toollabs-images] - 10https://gerrit.wikimedia.org/r/496101 (https://phabricator.wikimedia.org/T218072) [06:12:48] (03PS1) 10BryanDavis: php72: tideways & switch to component/php72 [docker-images/toollabs-images] - 10https://gerrit.wikimedia.org/r/496102 (https://phabricator.wikimedia.org/T202825) [06:12:50] (03PS1) 10BryanDavis: php: install php5-xdebug [docker-images/toollabs-images] - 10https://gerrit.wikimedia.org/r/496103 (https://phabricator.wikimedia.org/T202825) [06:12:52] (03PS1) 10BryanDavis: python35: new images [docker-images/toollabs-images] - 10https://gerrit.wikimedia.org/r/496104 [06:12:54] (03PS1) 10BryanDavis: node10: new images [docker-images/toollabs-images] - 10https://gerrit.wikimedia.org/r/496105 (https://phabricator.wikimedia.org/T195103) [06:12:56] (03CR) 10Marostegui: [C: 03+2] db-eqiad.php: Depool db1096 [mediawiki-config] - 10https://gerrit.wikimedia.org/r/496098 (owner: 10Marostegui) [06:13:55] (03Merged) 10jenkins-bot: db-eqiad.php: Depool db1096 [mediawiki-config] - 10https://gerrit.wikimedia.org/r/496098 (owner: 10Marostegui) [06:15:21] !log marostegui@deploy1001 Synchronized wmf-config/db-eqiad.php: Depool db1096 (duration: 01m 07s) [06:15:26] Logged the message at https://wikitech.wikimedia.org/wiki/Server_Admin_Log [06:21:03] !log Testing snapshotting on db1117:3321 to > dbstore1001 - T210292 [06:21:05] Logged the message at https://wikitech.wikimedia.org/wiki/Server_Admin_Log [06:21:06] T210292: Implement a proof of concept of a snapshot cycle automation for a mediawiki section database - https://phabricator.wikimedia.org/T210292 [06:22:33] (03CR) 10jenkins-bot: db-eqiad.php: Depool db1096 [mediawiki-config] - 10https://gerrit.wikimedia.org/r/496098 (owner: 10Marostegui) [06:24:27] !log Finished manual run of unpublished ContentTranslation draft purge script (T217818) [06:24:32] Logged the message at https://wikitech.wikimedia.org/wiki/Server_Admin_Log [06:24:32] T217818: Run unpublished draft purge script for CX (Week of 03/10) - https://phabricator.wikimedia.org/T217818 [06:30:09] PROBLEM - puppet last run on ganeti1006 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 4 minutes ago with 1 failures. Failed resources (up to 3 shown): File[/usr/local/bin/puppet-enabled] [06:30:35] PROBLEM - puppet last run on phab1002 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 4 minutes ago with 1 failures. Failed resources (up to 3 shown): File[/usr/local/bin/puppet-enabled] [06:31:17] PROBLEM - puppet last run on acmechief1001 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 5 minutes ago with 1 failures. Failed resources (up to 3 shown): File[/etc/ferm/conf.d/00_main] [06:31:31] PROBLEM - puppet last run on labmon1002 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 5 minutes ago with 1 failures. Failed resources (up to 3 shown): File[/etc/profile.d/field.sh] [06:40:44] !log Stop MySQL on db1096 for upgrade [06:40:45] Logged the message at https://wikitech.wikimedia.org/wiki/Server_Admin_Log [06:52:13] (03PS1) 10Marostegui: db-eqiad.php: Slowly repool db1096 [mediawiki-config] - 10https://gerrit.wikimedia.org/r/496108 [06:53:50] (03CR) 10Marostegui: [C: 03+2] db-eqiad.php: Slowly repool db1096 [mediawiki-config] - 10https://gerrit.wikimedia.org/r/496108 (owner: 10Marostegui) [06:54:47] (03Merged) 10jenkins-bot: db-eqiad.php: Slowly repool db1096 [mediawiki-config] - 10https://gerrit.wikimedia.org/r/496108 (owner: 10Marostegui) [06:56:00] (03CR) 10jenkins-bot: db-eqiad.php: Slowly repool db1096 [mediawiki-config] - 10https://gerrit.wikimedia.org/r/496108 (owner: 10Marostegui) [06:56:00] !log marostegui@deploy1001 Synchronized wmf-config/db-eqiad.php: Slowly repool db1096 (duration: 00m 55s) [06:56:01] Logged the message at https://wikitech.wikimedia.org/wiki/Server_Admin_Log [06:56:07] RECOVERY - puppet last run on ganeti1006 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [06:56:31] RECOVERY - puppet last run on phab1002 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [06:57:11] RECOVERY - puppet last run on acmechief1001 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [06:57:23] RECOVERY - puppet last run on labmon1002 is OK: OK: Puppet is currently enabled, last run 2 minutes ago with 0 failures [06:58:11] (03PS2) 10Elukey: Set notifications disabled for analytics-tool1004 [puppet] - 10https://gerrit.wikimedia.org/r/495876 [06:58:37] (03PS1) 10Marostegui: db-eqiad.php: More traffic to db1096 [mediawiki-config] - 10https://gerrit.wikimedia.org/r/496109 [06:58:59] !log Upgrade MySQL and kernel on db2036 [06:59:00] Logged the message at https://wikitech.wikimedia.org/wiki/Server_Admin_Log [06:59:38] 10Operations, 10Traffic, 10VisualEditor, 10Wikimedia-Apache-configuration: Visual Editor gets stuck opening article (net::ERR_SPDY_PROTOCOL_ERROR 200/Loading failed for the