[01:12:26] PROBLEM - HHVM rendering on mw2244 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:12:26] PROBLEM - HHVM rendering on mw2180 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:13:16] RECOVERY - HHVM rendering on mw2180 is OK: HTTP OK: HTTP/1.1 200 OK - 78591 bytes in 0.302 second response time [01:13:17] RECOVERY - HHVM rendering on mw2244 is OK: HTTP OK: HTTP/1.1 200 OK - 78591 bytes in 0.333 second response time [02:17:56] PROBLEM - HHVM rendering on mw1289 is CRITICAL: HTTP CRITICAL: HTTP/1.1 503 Service Unavailable - 1308 bytes in 0.001 second response time [02:18:06] PROBLEM - Nginx local proxy to apache on mw1289 is CRITICAL: HTTP CRITICAL: HTTP/1.1 503 Service Unavailable - 1308 bytes in 0.006 second response time [02:18:37] PROBLEM - Apache HTTP on mw1289 is CRITICAL: HTTP CRITICAL: HTTP/1.1 503 Service Unavailable - 1308 bytes in 0.001 second response time [02:18:56] RECOVERY - HHVM rendering on mw1289 is OK: HTTP OK: HTTP/1.1 200 OK - 78559 bytes in 0.342 second response time [02:19:06] RECOVERY - Nginx local proxy to apache on mw1289 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 617 bytes in 0.033 second response time [02:19:37] RECOVERY - Apache HTTP on mw1289 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 617 bytes in 0.158 second response time [02:42:27] PROBLEM - HHVM rendering on mw2171 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [02:43:17] RECOVERY - HHVM rendering on mw2171 is OK: HTTP OK: HTTP/1.1 200 OK - 78583 bytes in 0.296 second response time [02:46:26] (03PS1) 10Madhuvishy: public_dumps: Remove module path and rename to distribution [puppet] - 10https://gerrit.wikimedia.org/r/401198 (https://phabricator.wikimedia.org/T171539) [02:50:41] (03CR) 10Madhuvishy: [C: 032] public_dumps: Remove module path and rename to distribution [puppet] - 10https://gerrit.wikimedia.org/r/401198 (https://phabricator.wikimedia.org/T171539) (owner: 10Madhuvishy) [03:25:16] PROBLEM - MariaDB Slave Lag: s1 on dbstore1002 is CRITICAL: CRITICAL slave_sql_lag Replication lag: 851.38 seconds [03:49:16] RECOVERY - MariaDB Slave Lag: s1 on dbstore1002 is OK: OK slave_sql_lag Replication lag: 123.98 seconds [12:13:53] (03CR) 10TerraCodes: [C: 031] Add patrol to Image-reviewer on Commons [mediawiki-config] - 10https://gerrit.wikimedia.org/r/401160 (https://phabricator.wikimedia.org/T183835) (owner: 10Revi) [20:14:05] 10Operations, 10monitoring, 10Patch-For-Review: Uninstall ganglia from the fleet - https://phabricator.wikimedia.org/T177225#3865946 (10Quiddity) [20:51:02] * addshore is gonna backport https://gerrit.wikimedia.org/r/#/c/399627/ [20:58:48] * addshore does so [20:59:38] !log addshore@tin Synchronized php-1.31.0-wmf.12/extensions/WikimediaEvents/WikimediaEventsHooks.php: [[gerrit:399627|Add onBeforeInitializeWMDECampaign]] T182797 T182794 PT 1/2 (duration: 00m 55s) [20:59:51] Logged the message at https://wikitech.wikimedia.org/wiki/Server_Admin_Log [20:59:52] T182797: prepare patch for tracking user registrations and guided tours - https://phabricator.wikimedia.org/T182797 [20:59:52] T182794: Deploy 'hack' patch & logging for tracking user registrations and guided tour - https://phabricator.wikimedia.org/T182794 [21:01:10] !log addshore@tin Synchronized php-1.31.0-wmf.12/extensions/WikimediaEvents: [[gerrit:399627|Add onBeforeInitializeWMDECampaign]] T182797 T182794 PT 2/2 (duration: 00m 53s) [21:01:22] Logged the message at https://wikitech.wikimedia.org/wiki/Server_Admin_Log [21:01:28] and that is all [22:10:08] 10Puppet, 10Cloud-Services, 10Toolforge, 10Patch-For-Review: Make standalone puppetmasters optionally use PuppetDB - https://phabricator.wikimedia.org/T153577#2884609 (10Paladox) I think on labs we can host it on the same instance as the puppetmaster. I got it working like that and reduces the complexity o...