[00:00:05] do I even need to sync that? [00:00:54] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:05:05] bd808|BUFFER: 'Started update apaches' sounds a bit funny [00:15:54] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 469866 bytes in 9.717 second response time [00:18:36] !log ori Finished scap: Cherry-pick Ibe8e67ebf for MobileFrontend on 1.23wmf22 and 1.24wmf1; add GlobalCssJs extension to 1.24wmf1 and 1.23wmf22 (duration: 32m 53s) [00:18:43] Logged the message, Master [00:18:54] RECOVERY - Varnishkafka Delivery Errors on cp3020 is OK: kafka.varnishkafka.kafka_drerr.per_second OKAY: 0.0 [00:20:00] (03PS3) 10Ori.livneh: Enable GlobalCssJs on testwiki & test2wiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/127178 (owner: 10Legoktm) [00:20:02] (03CR) 10Ori.livneh: [C: 032] Enable GlobalCssJs on testwiki & test2wiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/127178 (owner: 10Legoktm) [00:20:04] RECOVERY - Varnishkafka Delivery Errors on cp3019 is OK: kafka.varnishkafka.kafka_drerr.per_second OKAY: 0.0 [00:20:11] (03Merged) 10jenkins-bot: Enable GlobalCssJs on testwiki & test2wiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/127178 (owner: 10Legoktm) [00:20:52] !log ori updated /a/common to {{Gerrit|Ie9b265be9}}: Enable GlobalCssJs on testwiki & test2wiki [00:20:58] Logged the message, Master [00:21:20] !log ori synchronized wmf-config/InitialiseSettings.php 'Ie9b265be9: Enable GlobalCssJs on testwiki & test2wiki (1/2)' [00:21:26] Logged the message, Master [00:21:37] !log ori synchronized wmf-config/CommonSettings.php 'Ie9b265be9: Enable GlobalCssJs on testwiki & test2wiki (2/2)' [00:21:43] Logged the message, Master [00:23:06] mwalker|away: I'm not sure what needs to be done to deploy Ib984e9820, so I'm skipping it, sorry. [00:23:37] [00:27:54] PROBLEM - Varnishkafka Delivery Errors on cp3020 is CRITICAL: kafka.varnishkafka.kafka_drerr.per_second CRITICAL: 1433.133301 [00:28:05] PROBLEM - Varnishkafka Delivery Errors on cp3019 is CRITICAL: kafka.varnishkafka.kafka_drerr.per_second CRITICAL: 1877.800049 [00:28:54] RECOVERY - Varnishkafka Delivery Errors on cp3020 is OK: kafka.varnishkafka.kafka_drerr.per_second OKAY: 0.0 [00:29:05] RECOVERY - Varnishkafka Delivery Errors on cp3019 is OK: kafka.varnishkafka.kafka_drerr.per_second OKAY: 0.0 [00:39:54] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:43:54] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 471791 bytes in 9.731 second response time [00:53:10] ori, *nods* I was distracted by gwicke :p [00:53:14] I'll deploy it monday [00:53:54] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:56:54] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 471127 bytes in 9.782 second response time [00:59:54] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:00:54] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 471123 bytes in 9.885 second response time [01:05:04] PROBLEM - Puppet freshness on db1056 is CRITICAL: Last successful Puppet run was Wed 16 Apr 2014 06:54:47 AM UTC [01:10:54] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:12:44] (03CR) 10Jeremyb: "(in reply to Dzahn 04-15 15:49)" [operations/puppet] - 10https://gerrit.wikimedia.org/r/111386 (owner: 10Jeremyb) [01:12:54] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 470425 bytes in 9.847 second response time [01:15:54] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:16:54] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 470425 bytes in 9.792 second response time [01:46:54] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:47:54] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 467693 bytes in 9.595 second response time [01:56:54] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [02:02:54] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 465980 bytes in 9.826 second response time [02:11:54] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [02:12:54] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 463858 bytes in 9.753 second response time [02:13:04] PROBLEM - Disk space on virt0 is CRITICAL: DISK CRITICAL - free space: /a 3082 MB (3% inode=99%): [02:18:54] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [02:19:04] PROBLEM - Disk space on virt0 is CRITICAL: DISK CRITICAL - free space: /a 3748 MB (3% inode=99%): [02:19:54] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 463855 bytes in 9.627 second response time [02:30:04] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [02:38:04] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 463857 bytes in 9.781 second response time [02:39:53] !log LocalisationUpdate completed (1.23wmf22) at 2014-04-18 02:39:51+00:00 [02:40:01] Logged the message, Master [03:01:04] RECOVERY - Disk space on virt0 is OK: DISK OK [03:06:08] !log LocalisationUpdate completed (1.24wmf1) at 2014-04-18 03:06:06+00:00 [03:06:15] Logged the message, Master [03:24:04] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [03:30:04] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 454319 bytes in 9.749 second response time [03:37:04] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [03:38:04] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 453369 bytes in 9.883 second response time [03:51:04] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [03:53:04] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 452141 bytes in 9.688 second response time [03:56:04] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [03:59:04] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 452140 bytes in 9.603 second response time [04:04:04] mwalker: I am told that we have the correct puppet/ruby packages for trusty already in the repo so you should be able to spin up a labs instance on it without hassle [04:04:26] !log LocalisationUpdate ResourceLoader cache refresh completed at Fri Apr 18 04:04:21 UTC 2014 (duration 4m 20s) [04:04:32] Logged the message, Master [04:06:04] PROBLEM - Puppet freshness on db1056 is CRITICAL: Last successful Puppet run was Wed 16 Apr 2014 06:54:47 AM UTC [04:12:40] (03CR) 10MaxSem: "Note that $wgMFRemovableClasses doesn't control extracts anymore, so this change needs to adappt to post https://gerrit.wikimedia.org/r/12" (032 comments) [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/126226 (owner: 10Prtksxna) [04:13:39] apergos, do you know when those became available? [04:13:59] I'm just trying to figure out why Ryan would have been unable to resolve the conflicts he was seeing when he tried [04:14:25] I think he wasn't relying on those packages [04:19:09] on copper (running trusty) I see puppet 2.7.11 and ruby 1.8 which is consistent with our precise setup [04:19:51] so it should 'just work' [04:28:04] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [04:29:05] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 454558 bytes in 9.275 second response time [04:43:04] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [04:44:04] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 454736 bytes in 9.869 second response time [04:49:14] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [04:50:14] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 454729 bytes in 9.658 second response time [04:58:14] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:02:14] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 454730 bytes in 9.718 second response time [05:05:49] a typical retreival of the index page for gitblit (dynamically generated) takes between 9 and 12 seconds now it seems, the check_http(s) cutoff is 10 [05:06:32] having it retrive something a little lighter weight would be nice, if there is a good option [05:06:35] * apergos pokes around [05:08:14] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:11:14] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 454748 bytes in 9.823 second response time [05:13:09] (03PS1) 10ArielGlenn: change gitblit url check to retrieve something lightweight [operations/puppet] - 10https://gerrit.wikimedia.org/r/127204 [05:15:30] (03CR) 10Dzahn: [C: 031] change gitblit url check to retrieve something lightweight [operations/puppet] - 10https://gerrit.wikimedia.org/r/127204 (owner: 10ArielGlenn) [05:16:14] (03CR) 10ArielGlenn: [C: 032] change gitblit url check to retrieve something lightweight [operations/puppet] - 10https://gerrit.wikimedia.org/r/127204 (owner: 10ArielGlenn) [05:17:14] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:17:32] hush you [05:23:14] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 456057 bytes in 9.577 second response time [05:26:11] "GET /tree/mediawiki%2Fcore.git HTTP/1.1" 200 58374 T=0s [05:26:26] that's more like it [05:26:45] cool, yep [05:50:24] (03CR) 10Chad: "We could go even more lightweight than mw/core. How about operations/puppet?" [operations/puppet] - 10https://gerrit.wikimedia.org/r/127204 (owner: 10ArielGlenn) [06:01:32] ^demon|away: it already takes 0s, but change if you like :-) [06:08:54] (03CR) 10Dzahn: [C: 032] remove admins::restricted from lucene role [operations/puppet] - 10https://gerrit.wikimedia.org/r/126939 (owner: 10Dzahn) [06:29:34] (03PS2) 10Chad: New wikis done building [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/126806 [06:29:40] (03CR) 10Chad: [C: 032] New wikis done building [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/126806 (owner: 10Chad) [06:29:48] (03Merged) 10jenkins-bot: New wikis done building [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/126806 (owner: 10Chad) [06:31:06] !log demon synchronized wmf-config/InitialiseSettings.php 'Next round of wikis done building Cirrus indexes, throw into beta mode' [06:31:12] Logged the message, Master [06:33:18] springle: removing m1-master.pmtpa.wmnet is also harmless, right.. it pointed to db35 which is now down [06:33:59] there are still s1-secondary, s5-secondary, m2-secondary [06:34:12] all talking about the DNS entries [06:34:54] those are db63,db73,db48 [06:36:20] (03PS2) 10ArielGlenn: add wiktionary.eu, link to wiktionary.org [operations/dns] - 10https://gerrit.wikimedia.org/r/126932 (owner: 10Dzahn) [06:37:08] (03CR) 10ArielGlenn: [C: 032] add wiktionary.eu, link to wiktionary.org [operations/dns] - 10https://gerrit.wikimedia.org/r/126932 (owner: 10Dzahn) [06:37:19] :) [06:38:38] (03CR) 10Aklapper: [C: 031] "My guts say Yes, but any reference for "Mozilla recommends it"?" [operations/puppet] - 10https://gerrit.wikimedia.org/r/126206 (owner: 10Dzahn) [06:39:34] (03CR) 10Dzahn: "Aklapper, reference: https://wiki.mozilla.org/Security/Server_Side_TLS" [operations/puppet] - 10https://gerrit.wikimedia.org/r/126206 (owner: 10Dzahn) [06:40:46] (03CR) 10Dzahn: "well, what they do in the "Apache" section there." [operations/puppet] - 10https://gerrit.wikimedia.org/r/126206 (owner: 10Dzahn) [06:41:45] (03CR) 10Dzahn: "or even more strict as in https://www.insecure.ws/2013/10/11/ssltls-configuration-for-apache-mod_ssl/" [operations/puppet] - 10https://gerrit.wikimedia.org/r/126206 (owner: 10Dzahn) [06:42:13] (03CR) 10Dzahn: "If you do not enable RC4 or 3DES (“old” clients may not be able to connect!):" [operations/puppet] - 10https://gerrit.wikimedia.org/r/126206 (owner: 10Dzahn) [06:48:16] (03CR) 10ArielGlenn: [C: 032] Redirect wiktionary.eu to www.wiktionary.org [operations/apache-config] - 10https://gerrit.wikimedia.org/r/126937 (owner: 10Odder) [06:55:11] mutante: yes, we can remove those pmtpa dns cnames [06:56:58] (03CR) 10Springle: [C: 031] decom, remove db35,db38 [operations/dns] - 10https://gerrit.wikimedia.org/r/126972 (owner: 10Dzahn) [06:57:19] (03CR) 10Dzahn: [C: 032] decom, remove db35,db38 [operations/dns] - 10https://gerrit.wikimedia.org/r/126972 (owner: 10Dzahn) [06:57:49] springle: thx, done [06:59:43] (03CR) 10Dzahn: [C: 032] remove rendering.pmtpa,rendering.svc.pmtpa [operations/dns] - 10https://gerrit.wikimedia.org/r/126971 (owner: 10Dzahn) [07:06:41] PROBLEM - Puppet freshness on db1056 is CRITICAL: Last successful Puppet run was Wed 16 Apr 2014 06:54:47 AM UTC [07:07:33] (03PS3) 10Springle: Remove mysql client from bastionhost [operations/puppet] - 10https://gerrit.wikimedia.org/r/126027 (owner: 10Hoo man) [07:16:40] (03CR) 10Dzahn: "works. testing 2 urls on 190 servers, totalling 380 requests" [operations/apache-config] - 10https://gerrit.wikimedia.org/r/126937 (owner: 10Odder) [07:20:04] (03PS2) 10Dzahn: remove arptest [operations/dns] - 10https://gerrit.wikimedia.org/r/125950 [07:20:59] (03PS3) 10Dzahn: remove arptest [operations/dns] - 10https://gerrit.wikimedia.org/r/125950 [07:24:33] (03PS1) 10Dzahn: remove api.svc.pmtpa.wmnet [operations/dns] - 10https://gerrit.wikimedia.org/r/127209 [07:24:36] (03CR) 10jenkins-bot: [V: 04-1] remove api.svc.pmtpa.wmnet [operations/dns] - 10https://gerrit.wikimedia.org/r/127209 (owner: 10Dzahn) [07:25:03] (03PS2) 10Dzahn: remove api.svc.pmtpa.wmnet [operations/dns] - 10https://gerrit.wikimedia.org/r/127209 [07:32:19] (03PS1) 10Dzahn: remove Tampa appserver mgmt [operations/dns] - 10https://gerrit.wikimedia.org/r/127210 [07:37:39] (03CR) 10ArielGlenn: [C: 031] remove api.svc.pmtpa.wmnet [operations/dns] - 10https://gerrit.wikimedia.org/r/127209 (owner: 10Dzahn) [07:37:55] (03PS2) 10Dzahn: remove Tampa appserver reverse DNS and mgmt [operations/dns] - 10https://gerrit.wikimedia.org/r/127210 [07:43:24] (03CR) 10Dzahn: [C: 032] remove api.svc.pmtpa.wmnet [operations/dns] - 10https://gerrit.wikimedia.org/r/127209 (owner: 10Dzahn) [07:56:21] (03CR) 10Dzahn: [C: 032] remove arptest [operations/dns] - 10https://gerrit.wikimedia.org/r/125950 (owner: 10Dzahn) [07:57:25] !log DNS update - remove api.svc, arptest.pmtpa .. [07:57:31] Logged the message, Master [07:58:37] (03PS3) 10ArielGlenn: add ntp servers on eeden.esams, rubidium (rt #7101) [operations/puppet] - 10https://gerrit.wikimedia.org/r/125954 [08:00:10] (03CR) 10ArielGlenn: [C: 032] add ntp servers on eeden.esams, rubidium (rt #7101) [operations/puppet] - 10https://gerrit.wikimedia.org/r/125954 (owner: 10ArielGlenn) [08:06:23] (03CR) 10Springle: [C: 031] "I don't disagree with this since I either tunnel or use mysql directly on the db boxes themselves. However couple notes:" [operations/puppet] - 10https://gerrit.wikimedia.org/r/126027 (owner: 10Hoo man) [08:11:12] PROBLEM - NTP peers on dobson is CRITICAL: NTP CRITICAL: Server not synchronized, Offset unknown [08:13:12] RECOVERY - NTP peers on dobson is OK: NTP OK: Offset 0.000135 secs [08:20:38] (03PS2) 10Dzahn: Add ttf-kochi-mincho and ttf-kochi-gothic to imagescalers [operations/puppet] - 10https://gerrit.wikimedia.org/r/126729 (owner: 10Reedy) [08:21:01] PROBLEM - NTP peers on linne is CRITICAL: NTP CRITICAL: Server not synchronized, Offset unknown [08:23:01] RECOVERY - NTP peers on linne is OK: NTP OK: Offset -0.005693 secs [08:25:23] (03CR) 10Dzahn: [C: 032] Add ttf-kochi-mincho and ttf-kochi-gothic to imagescalers [operations/puppet] - 10https://gerrit.wikimedia.org/r/126729 (owner: 10Reedy) [08:27:06] (03CR) 10Dzahn: "notice: /Stage[main]/Imagescaler::Packages::Fonts/Package[ttf-kochi-gothic]/ensure: ensure changed 'purged' to 'latest'" [operations/puppet] - 10https://gerrit.wikimedia.org/r/126729 (owner: 10Reedy) [08:30:16] Reedy: [08:30:19] root@mw1153:~# fc-match 'Kochi Micho' [08:30:19] DejaVuSans.ttf: "DejaVu Sans" "Book" [08:30:32] fc-match 'Kochi Gothic' [08:30:32] kochi-gothic-subst.ttf: "Kochi Gothic" "Regular" [08:31:20] ah, "micho" != "mincho" [08:31:27] fc-match 'Kochi Mincho' [08:31:27] kochi-mincho-subst.ttf: "Kochi Mincho" "Regular" [08:34:52] (03CR) 10Dzahn: "this would be good to have for "#5148: move Torrus away from manutius" one remaining Tampa blocker" [operations/puppet] - 10https://gerrit.wikimedia.org/r/108498 (owner: 10Matanya) [08:37:33] (03PS1) 10Springle: MHA site-switch templates are broken with less than two available DCs, and technically useless in this situation anyway. Disable them until we get a replacement for PMTPA that could actually handle a switch over. [operations/puppet] - 10https://gerrit.wikimedia.org/r/127212 [08:43:42] (03CR) 10Dzahn: "that file has been renamed by Coren in Change-Id: If985e506d5b1" [operations/puppet] - 10https://gerrit.wikimedia.org/r/106907 (owner: 10Stwalkerster) [08:45:01] (03PS1) 10Hashar: contint: apply beta natfix on Jenkins slaves [operations/puppet] - 10https://gerrit.wikimedia.org/r/127213 [08:46:33] (03CR) 10Dzahn: "the line is still " 88 proxy_set_header X-Forwarded-For $remote_addr;" though. just now it's in domainproxy.conf" [operations/puppet] - 10https://gerrit.wikimedia.org/r/106907 (owner: 10Stwalkerster) [08:47:49] (03Abandoned) 10Dzahn: nrpe: enable on virt0 [operations/puppet] - 10https://gerrit.wikimedia.org/r/107424 (owner: 10Gage) [08:51:35] (03CR) 10Hashar: [C: 031 V: 032] "Applied on contint puppetmaster. Both slaves are still reachable from gallium and they now manage to contact the beta cluster entries suc" [operations/puppet] - 10https://gerrit.wikimedia.org/r/127213 (owner: 10Hashar) [08:52:08] (03CR) 10Dzahn: [C: 04-2] "matanya, i think this can be abandoned then" [operations/puppet] - 10https://gerrit.wikimedia.org/r/119488 (owner: 10Matanya) [08:53:44] (03CR) 10Dzahn: [C: 04-2] "taking the liberty to abandon this, because Peter wrote it, Mark voted it down and Gabriel said he doesn't need it anymore" [operations/puppet] - 10https://gerrit.wikimedia.org/r/72653 (owner: 10Pyoungmeister) [08:54:00] (03Abandoned) 10Dzahn: proposal for allowing gabriel sudo access for varnishadm for parsoid caches [operations/puppet] - 10https://gerrit.wikimedia.org/r/72653 (owner: 10Pyoungmeister) [08:59:58] (03PS1) 10ArielGlenn: Revert "add ntp servers on eeden.esams, rubidium (rt #7101)" [operations/puppet] - 10https://gerrit.wikimedia.org/r/127214 [09:00:15] (03PS2) 10ArielGlenn: Revert "add ntp servers on eeden.esams, rubidium (rt #7101)" [operations/puppet] - 10https://gerrit.wikimedia.org/r/127214 [09:03:43] (03CR) 10Dzahn: [C: 031] toollabs: Add expect to exec nodes [operations/puppet] - 10https://gerrit.wikimedia.org/r/125201 (owner: 10Yuvipanda) [09:07:10] (03CR) 10ArielGlenn: [C: 032] Revert "add ntp servers on eeden.esams, rubidium (rt #7101)" [operations/puppet] - 10https://gerrit.wikimedia.org/r/127214 (owner: 10ArielGlenn) [09:08:04] (03CR) 10Dzahn: [C: 04-1] "can we use networks from class network::constants here instead of listing networks?" [operations/puppet] - 10https://gerrit.wikimedia.org/r/117674 (owner: 10Matanya) [09:09:14] (03CR) 10Dzahn: [C: 031] Tools: Install package libxml2-utils for xmllint [operations/puppet] - 10https://gerrit.wikimedia.org/r/120187 (owner: 10Tim Landscheidt) [09:10:13] PROBLEM - NTP peers on dobson is CRITICAL: NTP CRITICAL: Server not synchronized, Offset unknown [09:12:03] PROBLEM - NTP peers on linne is CRITICAL: NTP CRITICAL: Server not synchronized, Offset unknown [09:12:54] they'll be back shortly [09:13:13] RECOVERY - NTP peers on dobson is OK: NTP OK: Offset -0.000587 secs [09:14:03] RECOVERY - NTP peers on linne is OK: NTP OK: Offset 0.003985 secs [09:15:24] (03CR) 10Dzahn: "please fix the path conflict" [operations/puppet] - 10https://gerrit.wikimedia.org/r/119438 (owner: 10Tim Landscheidt) [09:17:03] (03CR) 10Dzahn: [C: 04-2] cache: puppet 3 compatibility fix: fully qualify variable [operations/puppet] - 10https://gerrit.wikimedia.org/r/111787 (owner: 10Matanya) [09:19:48] (03CR) 10Dzahn: [C: 031] Describe Math related packages in a class [operations/puppet] - 10https://gerrit.wikimedia.org/r/115133 (owner: 10Hashar) [09:21:21] (03PS2) 10Springle: MHA site-switch templates are broken with less than two available DCs, and technically useless in this situation anyway. Disable them until we get a replacement for PMTPA that could actually handle a switch over. [operations/puppet] - 10https://gerrit.wikimedia.org/r/127212 [09:23:30] (03CR) 10Springle: [C: 032] MHA site-switch templates are broken with less than two available DCs, and technically useless in this situation anyway. Disable them until [operations/puppet] - 10https://gerrit.wikimedia.org/r/127212 (owner: 10Springle) [09:25:03] RECOVERY - Puppet freshness on db1056 is OK: puppet ran at Fri Apr 18 09:24:57 UTC 2014 [09:30:28] (03CR) 10ArielGlenn: "If these are node scope aren't they covered? See" [operations/puppet] - 10https://gerrit.wikimedia.org/r/111787 (owner: 10Matanya) [09:34:58] (03CR) 10Aklapper: [C: 031] bugzilla, use better SSL cipher suite [operations/puppet] - 10https://gerrit.wikimedia.org/r/126205 (owner: 10Dzahn) [09:50:18] (03PS1) 10Dzahn: add new Tech News atom feed to Planet [operations/puppet] - 10https://gerrit.wikimedia.org/r/127222 [10:01:15] (03CR) 10Nemo bis: [C: 031] "Soon in your language!" [operations/puppet] - 10https://gerrit.wikimedia.org/r/127222 (owner: 10Dzahn) [10:06:22] !log Upgrading Jenkins to latest LTS version 1.532.3 [10:06:29] Logged the message, Master [10:10:05] !log Jenkins upgraded to 1.532.3. [10:10:11] Logged the message, Master [10:10:15] apergos: only 4 minutes \O/ Thank you very much. [10:10:26] yw [10:24:48] (03CR) 10Matanya: Pass puppet-lint on realm.pp (032 comments) [operations/puppet] - 10https://gerrit.wikimedia.org/r/127138 (owner: 10Hashar) [10:28:03] matanya: what do you mean by nesting at https://gerrit.wikimedia.org/r/#/c/127138/1/manifests/realm.pp ? [10:28:09] (03Abandoned) 10Matanya: openstack: qualify var [operations/puppet] - 10https://gerrit.wikimedia.org/r/119488 (owner: 10Matanya) [10:28:12] hi hashar [10:28:18] oh and hi :-] [10:28:24] on a very broken connection [10:29:08] שָׁלוֹם [10:29:31] my hebrew is as good as copy pasting from https://en.wikipedia.org/wiki/Jewish_greetings [10:35:12] sorry hashar did you see my reply ? [10:35:29] matanya: nop [10:35:44] https://dpaste.de/3Kwz [10:35:51] ack [10:36:02] more readable i think [10:36:27] ohh [10:37:20] (03CR) 10Hashar: Pass puppet-lint on realm.pp (032 comments) [operations/puppet] - 10https://gerrit.wikimedia.org/r/127138 (owner: 10Hashar) [10:37:38] matanya: thanks :] [10:37:42] (03PS2) 10Hashar: Pass puppet-lint on realm.pp [operations/puppet] - 10https://gerrit.wikimedia.org/r/127138 [11:02:00] (03CR) 10Matanya: puppet-lint role/nova.pp (0316 comments) [operations/puppet] - 10https://gerrit.wikimedia.org/r/127147 (owner: 10Hashar) [11:05:06] (03CR) 10Odder: [C: 031] add new Tech News atom feed to Planet [operations/puppet] - 10https://gerrit.wikimedia.org/r/127222 (owner: 10Dzahn) [11:06:47] (03CR) 10Dzahn: [C: 032] add new Tech News atom feed to Planet [operations/puppet] - 10https://gerrit.wikimedia.org/r/127222 (owner: 10Dzahn) [11:35:03] (03PS1) 10Hashar: zuul: compress log daily [operations/puppet] - 10https://gerrit.wikimedia.org/r/127230 [11:37:45] (03PS3) 10ArielGlenn: contint: gives access to Bryan Davis [operations/puppet] - 10https://gerrit.wikimedia.org/r/126155 (owner: 10Hashar) [11:39:33] (03CR) 10ArielGlenn: [C: 032] contint: gives access to Bryan Davis [operations/puppet] - 10https://gerrit.wikimedia.org/r/126155 (owner: 10Hashar) [11:40:35] (03PS2) 10Hashar: contint: compress Jenkins console logs once per day [operations/puppet] - 10https://gerrit.wikimedia.org/r/125991 [11:41:12] (03CR) 10Hashar: contint: compress Jenkins console logs once per day (031 comment) [operations/puppet] - 10https://gerrit.wikimedia.org/r/125991 (owner: 10Hashar) [11:45:22] (03CR) 10JanZerebecki: [C: 031] "Refusing users that only have support for less secure protocols (like max. SSL3 for IE6 on Windows XP) can still be done in an additional " [operations/puppet] - 10https://gerrit.wikimedia.org/r/126206 (owner: 10Dzahn) [11:48:56] !log removing mw-jenkinsbot (the wikimedia jenkins installation) from #wikimedia-labs [11:49:02] Logged the message, Master [12:11:56] (03CR) 10Hoo man: "> Removing the mysql client, given it's merely a utility and not a service, won't really affect security, traffic, or load. Just saying." [operations/puppet] - 10https://gerrit.wikimedia.org/r/126027 (owner: 10Hoo man) [12:26:41] (03CR) 10Hoo man: [C: 031] "One step at a time :) We maybe also need a new group which is like the old restricted (but below mortals)." [operations/puppet] - 10https://gerrit.wikimedia.org/r/126941 (owner: 10Dzahn) [12:31:18] (03CR) 10Dzahn: [C: 032] bugzilla, use better SSL cipher suite [operations/puppet] - 10https://gerrit.wikimedia.org/r/126205 (owner: 10Dzahn) [12:32:27] (03PS2) 10Dzahn: bugzilla, use SSLProtocol ALL -SSLv2 [operations/puppet] - 10https://gerrit.wikimedia.org/r/126206 [12:34:27] (03CR) 10Dzahn: [C: 032] bugzilla, use SSLProtocol ALL -SSLv2 [operations/puppet] - 10https://gerrit.wikimedia.org/r/126206 (owner: 10Dzahn) [12:35:38] (03CR) 10Hoo man: [C: 031] remove sudo::appserver from bastions [operations/puppet] - 10https://gerrit.wikimedia.org/r/126014 (owner: 10Dzahn) [12:41:21] (03CR) 10Dzahn: "TLS 1.2 Yes" [operations/puppet] - 10https://gerrit.wikimedia.org/r/126206 (owner: 10Dzahn) [12:42:32] (03CR) 10Dzahn: "no more RC4" [operations/puppet] - 10https://gerrit.wikimedia.org/r/126205 (owner: 10Dzahn) [12:43:50] (03CR) 10Springle: "> Well, it encourages users to misuse bastions, which *can* be quite risky if someone gains access (eg. publicly readable .my.cnf, passwor" [operations/puppet] - 10https://gerrit.wikimedia.org/r/126027 (owner: 10Hoo man) [12:44:26] (03PS1) 10ArielGlenn: turn off rsyncs to/from dataset2, prep for 12th floor move [operations/puppet] - 10https://gerrit.wikimedia.org/r/127235 [13:06:56] !log Bugzilla Apache, changed SSL cipher suite in I7e9adc182dc ,might cost a a few % performance but zirconium had plenty [13:07:02] Logged the message, Master [13:13:36] (03PS1) 10Hashar: Jenkins job validation (DO NOT SUBMIT) [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/127236 [13:14:11] (03CR) 10Hashar: "recheck" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/127236 (owner: 10Hashar) [13:14:37] (03CR) 10Hashar: "recheck" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/127236 (owner: 10Hashar) [13:15:49] (03CR) 10Hashar: "recheck" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/127236 (owner: 10Hashar) [13:16:04] (03CR) 10Hashar: "recheck" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/127236 (owner: 10Hashar) [13:16:08] (03PS1) 10Dzahn: remove all Tampa ms-be swift boxes from puppet [operations/puppet] - 10https://gerrit.wikimedia.org/r/127237 [13:17:41] paravoid: ^ are they going to be reinstalled? [13:18:09] ms-be [13:24:32] (03PS1) 10Dzahn: remove ms-be-1-12 from DHCP, netboot [operations/puppet] - 10https://gerrit.wikimedia.org/r/127239 [13:25:41] they need wiping [13:25:41] (03PS2) 10Dzahn: remove all Tampa ms-be swift boxes from puppet [operations/puppet] - 10https://gerrit.wikimedia.org/r/127237 [13:25:55] non-destructive wiping that is [13:26:21] (03CR) 10coren: [C: 032] "We need legal review only in cases where the exposed information contains or potentially contains non-public information. Inspection of t" [operations/software] - 10https://gerrit.wikimedia.org/r/118582 (owner: 10Aude) [13:27:16] haha [13:27:21] better specify now [13:28:40] paravoid: ok, they dont need DHCP for that, was just wondering about netboot, partman recipe and stuff [13:28:56] correc [13:28:59] correct [13:29:26] (03PS2) 10Prtksxna: TextExtracts: Add classes and elements to the exclusion list [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/126226 [13:30:05] cmjohnson1 asked me to do the decom asap.. are you still copying files? [13:30:16] no [13:30:16] should i simply shutdown as well [13:30:19] ok [13:30:27] I'm copying files from eqiad to esams [13:30:31] never used tampa for that [13:30:36] k [13:31:30] (03CR) 10Dzahn: [C: 032] remove all Tampa ms-be swift boxes from puppet [operations/puppet] - 10https://gerrit.wikimedia.org/r/127237 (owner: 10Dzahn) [13:33:09] (03CR) 10Prtksxna: "I've adapted the changes to https://gerrit.wikimedia.org/r/127170 and added comments." [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/126226 (owner: 10Prtksxna) [13:33:15] (03PS2) 10Dzahn: remove ms-be-1-12 from DHCP, netboot [operations/puppet] - 10https://gerrit.wikimedia.org/r/127239 [13:33:20] (03CR) 10Prtksxna: TextExtracts: Add classes and elements to the exclusion list (032 comments) [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/126226 (owner: 10Prtksxna) [13:33:40] mutante: [13:33:42] also ms-fe [13:34:12] paravoid: ok [13:34:14] and also, ms-be.cfg & ms-be-ssd.cfg shouldn't be needed anymore [13:34:24] thanks [13:34:27] so ditch those two [13:34:39] off now, ttyl [13:34:46] cya [13:54:23] !log ms-be1-12 - removing from puppet,salt,icinga [13:54:29] Logged the message, Master [13:58:57] PROBLEM - Host ps1-a3-sdtpa is DOWN: PING CRITICAL - Packet loss = 100% [13:59:07] PROBLEM - Host ps1-a5-sdtpa is DOWN: PING CRITICAL - Packet loss = 100% [13:59:27] PROBLEM - Host ps1-a4-sdtpa is DOWN: PING CRITICAL - Packet loss = 100% [14:01:35] (03CR) 10Dzahn: [C: 032] remove ms-be-1-12 from DHCP, netboot [operations/puppet] - 10https://gerrit.wikimedia.org/r/127239 (owner: 10Dzahn) [14:12:27] (03PS1) 10Dzahn: remove ms-fe[14] from DHCP,remove partman recipes [operations/puppet] - 10https://gerrit.wikimedia.org/r/127244 [14:12:44] (03CR) 10jenkins-bot: [V: 04-1] remove ms-fe[14] from DHCP,remove partman recipes [operations/puppet] - 10https://gerrit.wikimedia.org/r/127244 (owner: 10Dzahn) [14:13:06] (03PS2) 10Dzahn: remove ms-fe[14] from DHCP,remove partman recipes [operations/puppet] - 10https://gerrit.wikimedia.org/r/127244 [14:16:21] (03CR) 10Dzahn: [C: 032] remove ms-fe[14] from DHCP,remove partman recipes [operations/puppet] - 10https://gerrit.wikimedia.org/r/127244 (owner: 10Dzahn) [14:19:15] (03PS1) 10Dzahn: remove ms-fe[14] from puppet, decom [operations/puppet] - 10https://gerrit.wikimedia.org/r/127245 [14:20:10] (03CR) 10Dzahn: [C: 032] remove ms-fe[14] from puppet, decom [operations/puppet] - 10https://gerrit.wikimedia.org/r/127245 (owner: 10Dzahn) [14:24:17] !log ms-fe[14] - stop puppet,revoke certs,remove icinga [14:24:22] Logged the message, Master [14:25:11] (03PS1) 10ArielGlenn: move the puppet snmptrap into a class so it can be run in last stage [operations/puppet] - 10https://gerrit.wikimedia.org/r/127246 [14:27:05] manybubbles: mornin! [14:27:27] ottomata: morning! [14:27:31] time for 1007 & 1008 [14:27:32] ? [14:31:25] PROBLEM - Host es6 is DOWN: PING CRITICAL - Packet loss = 100% [14:31:34] PROBLEM - Host es5 is DOWN: PING CRITICAL - Packet loss = 100% [14:33:24] (03PS1) 10Matanya: swift: remove swift role from tampa [operations/puppet] - 10https://gerrit.wikimedia.org/r/127247 [14:33:40] (03CR) 10jenkins-bot: [V: 04-1] swift: remove swift role from tampa [operations/puppet] - 10https://gerrit.wikimedia.org/r/127247 (owner: 10Matanya) [14:36:02] manybubbles: 1007 & 1008? shall I start? [14:36:11] sure! [14:36:15] was just in a meeting but donw now [14:36:18] done now [14:36:19] ah ok, moving shards off [14:37:22] !log ms-be 1-12, Tampa Swift boxes, shutdown [14:37:27] Logged the message, Master [14:38:04] grr..es5 and es6 are throwing icinga msgs ...should've have been decom'd [14:41:40] !log disabling puppet on stat1 for decom [14:41:46] Logged the message, Master [14:43:13] !log ms-fe[14] - shutting down [14:43:19] Logged the message, Master [14:46:57] (03PS1) 10Ottomata: Removing references to stat1, adding stat1 to decomissioning.pp [operations/puppet] - 10https://gerrit.wikimedia.org/r/127250 [14:47:26] mutante: you saw my reply on ssl chain? [14:48:00] jeremyb: no [14:48:28] ottomata: while I have you, elastic1001 is reporting down in ganglia [14:48:39] mutante: https://gerrit.wikimedia.org/r/111386 [14:50:49] (03CR) 10Ottomata: [C: 032 V: 032] Removing references to stat1, adding stat1 to decomissioning.pp [operations/puppet] - 10https://gerrit.wikimedia.org/r/127250 (owner: 10Ottomata) [14:51:38] there it goes.... [14:51:45] !log powering down stat1 for decom [14:51:51] Logged the message, Master [14:54:05] !log es5,es6 - revoke puppet certs, salt keys, icinga [14:54:11] Logged the message, Master [14:54:18] ottomata: I'm going to step out for about 45 minutes. ping you can call me if anything blows up but I think its all pretty normal stuff. BTW, the current cluster master is 1002. 1001 was the master when you restarted it so I don't even think we'll get another master election during this process [14:54:47] ACKNOWLEDGEMENT - Host es5 is DOWN: PING CRITICAL - Packet loss = 100% daniel_zahn RT #6266 [14:54:48] ACKNOWLEDGEMENT - Host es6 is DOWN: PING CRITICAL - Packet loss = 100% daniel_zahn RT #6266 [14:56:09] cool, yeha, no probs [14:56:13] wait, what? [14:56:19] 1001 came back as a master? [14:56:26] manybubbles|away: ^ [14:59:43] PROBLEM - Host labstore4 is DOWN: PING CRITICAL - Packet loss = 100% [14:59:55] jeremyb: https://gerrit.wikimedia.org/r/#/c/126008/1/manifests/certs.pp [15:01:30] ottomata: no, sorry, 1001 came back as non-master [15:01:32] its all good [15:01:38] when you bounced 1001 1002 took over [15:01:56] and now that it has taken over there should be no need for a master election as you bounce the other machines today [15:02:24] its just find [15:07:54] (03CR) 10Hashar: "random thoughts." [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/127236 (owner: 10Hashar) [15:08:12] (03CR) 10Hashar: "recheck" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/127236 (owner: 10Hashar) [15:08:50] (03CR) 10Hashar: "the magic trick seems to work now :-]" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/127236 (owner: 10Hashar) [15:09:39] (03CR) 10Hashar: "recheck" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/127236 (owner: 10Hashar) [15:10:38] (03Abandoned) 10Hashar: Jenkins job validation (DO NOT SUBMIT) [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/127236 (owner: 10Hashar) [15:10:57] (03PS2) 10ArielGlenn: formey: decom [operations/puppet] - 10https://gerrit.wikimedia.org/r/126976 (owner: 10Matanya) [15:13:01] (03PS2) 10BBlack: Only tag 470-07 if going through proxy. [operations/puppet] - 10https://gerrit.wikimedia.org/r/125347 (owner: 10Dr0ptp4kt) [15:13:29] wait, but manybubbles|away [15:13:48] (03CR) 10ArielGlenn: [C: 032] formey: decom [operations/puppet] - 10https://gerrit.wikimedia.org/r/126976 (owner: 10Matanya) [15:13:48] so right now, 1002, 1007 and 1013 are masters, right? [15:13:52] (03PS1) 10JanZerebecki: bugzilla apache config: disable caching directives [operations/puppet] - 10https://gerrit.wikimedia.org/r/127254 [15:13:56] (03CR) 10Dzahn: [C: 032] remove lvs1-6 lvs1-6.wikimedia.org [operations/dns] - 10https://gerrit.wikimedia.org/r/126954 (owner: 10Dzahn) [15:14:04] and we are about to do the master dance with 1007 and 1008 [15:14:11] won't 1008 become the new master when we take down 1007? [15:14:49] (03PS3) 10BBlack: Only tag 470-07 if going through proxy. [operations/puppet] - 10https://gerrit.wikimedia.org/r/125347 (owner: 10Dr0ptp4kt) [15:14:57] (03CR) 10BBlack: [C: 032 V: 032] Only tag 470-07 if going through proxy. [operations/puppet] - 10https://gerrit.wikimedia.org/r/125347 (owner: 10Dr0ptp4kt) [15:15:03] !log DNS update - removing lvs1-6 [15:15:09] Logged the message, Master [15:16:41] (03PS1) 10Andrew Bogott: Remove labstore1 and 2 from puppet [operations/puppet] - 10https://gerrit.wikimedia.org/r/127255 [15:19:05] (03PS2) 10Andrew Bogott: Remove labstore1 and 2 from puppet [operations/puppet] - 10https://gerrit.wikimedia.org/r/127255 [15:20:39] (03PS2) 10BBlack: Set domain to TLD on GeoIP cookie [operations/puppet] - 10https://gerrit.wikimedia.org/r/127131 (owner: 10Ori.livneh) [15:21:42] (03PS3) 10Dzahn: remove Tampa appserver reverse DNS and mgmt [operations/dns] - 10https://gerrit.wikimedia.org/r/127210 [15:22:29] (03CR) 10Andrew Bogott: [C: 032] Remove labstore1 and 2 from puppet [operations/puppet] - 10https://gerrit.wikimedia.org/r/127255 (owner: 10Andrew Bogott) [15:23:16] (03CR) 10Dzahn: [C: 032] remove Tampa appserver reverse DNS and mgmt [operations/dns] - 10https://gerrit.wikimedia.org/r/127210 (owner: 10Dzahn) [15:24:01] !log DNS update - removing all the Tampa mw/srv mgmt [15:24:06] Logged the message, Master [15:26:10] PROBLEM - Host labstore1 is DOWN: PING CRITICAL - Packet loss = 100% [15:26:10] PROBLEM - Host labstore2 is DOWN: PING CRITICAL - Packet loss = 100% [15:26:43] (03CR) 10Jgreen: [C: 031] Remove db48 and db49 from OTRS mail duties. db49 is decommissioned already so hasn't worked as a secondary for a while. [operations/puppet] - 10https://gerrit.wikimedia.org/r/126203 (owner: 10Springle) [15:28:03] (03PS2) 10ArielGlenn: formey:decom [operations/dns] - 10https://gerrit.wikimedia.org/r/126978 (owner: 10Matanya) [15:28:15] (03PS2) 10Springle: Remove db48 and db49 from OTRS mail duties. db49 is decommissioned already so hasn't worked as a secondary for a while. [operations/puppet] - 10https://gerrit.wikimedia.org/r/126203 [15:28:22] those warnings are my fault, puppet is taking forever on neon (as always) [15:28:29] (03CR) 10Springle: [C: 032] Remove db48 and db49 from OTRS mail duties. db49 is decommissioned already so hasn't worked as a secondary for a while. [operations/puppet] - 10https://gerrit.wikimedia.org/r/126203 (owner: 10Springle) [15:28:39] (03CR) 10ArielGlenn: [C: 032] formey:decom [operations/dns] - 10https://gerrit.wikimedia.org/r/126978 (owner: 10Matanya) [15:28:42] (03PS1) 10JanZerebecki: bugzilla: enable strict transport security [operations/puppet] - 10https://gerrit.wikimedia.org/r/127256 [15:32:25] (03CR) 10JanZerebecki: "Though probably correct, not actually tested." [operations/puppet] - 10https://gerrit.wikimedia.org/r/127256 (owner: 10JanZerebecki) [15:33:10] ottomata: 1002, 1007, and 1008 are master elgiible [15:33:13] but only 1002 is the master [15:33:28] sorry 1007 and 1013 can take over [15:33:34] but they won't unless 1001 goes down [15:33:51] the quorum that Elasticsearch needs is two out of three eligible masters online [15:36:51] paravoid: Jeff_Green was a outage report written for the fundraising banner issue from yesterday? [15:37:35] (03CR) 10BBlack: [C: 04-1] "A couple things:" [operations/puppet] - 10https://gerrit.wikimedia.org/r/127131 (owner: 10Ori.livneh) [15:38:24] (03PS1) 10Dzahn: remove ms-be/ms-fe Tampa boxes [operations/dns] - 10https://gerrit.wikimedia.org/r/127261 [15:38:29] !log switched mchenry to use db1048/db1049 for OTRS address lookups [15:38:34] Logged the message, Master [15:38:56] greg-g: afaik paravoid was working on a report but I haven't seen it yet [15:40:39] Jeff_Green: k, I'd love to chat with K4 about it today, and having something to point at would help, but no major rush (I see the flurry of tampa shutdown activity) [15:41:45] greg-g: sure, I would just track her down in the fundraising channel, the tampa stuff doesn't really affect fundraising anymore [15:42:15] (03PS2) 10Dzahn: remove ms-be/ms-fe Tampa boxes [operations/dns] - 10https://gerrit.wikimedia.org/r/127261 [15:43:22] Jeff_Green: what's that channel? [15:43:39] #wikimedia-fundraising [15:43:43] * greg-g prepares to go to window shortcut "G" [15:43:50] logical [15:44:40] (03CR) 10Dzahn: [C: 032] remove ms-be/ms-fe Tampa boxes [operations/dns] - 10https://gerrit.wikimedia.org/r/127261 (owner: 10Dzahn) [15:44:59] (03PS1) 10coren: Remove all traces of labstore[34] from puppet [operations/puppet] - 10https://gerrit.wikimedia.org/r/127262 [15:45:21] !log DNS update - removing Tampa msbe/msfe [15:45:27] Logged the message, Master [15:47:12] (03PS1) 10Springle: Remove db48 from m2. [operations/puppet] - 10https://gerrit.wikimedia.org/r/127263 [15:48:25] PROBLEM - MySQL Slave Delay on db1048 is CRITICAL: CRIT replication delay 336 seconds [15:49:25] PROBLEM - Varnish HTTP mobile-backend on cp3014 is CRITICAL: Connection refused [15:49:26] (03PS1) 10Dzahn: remove labstore 1-4 [operations/dns] - 10https://gerrit.wikimedia.org/r/127264 [15:49:54] hey manybubbles [15:50:02] yo [15:50:04] role/elasticsearch.pp says that 1008 is master eligible [15:50:05] not 1007 [15:50:08] is that correct? [15:50:27] ottomata: as in, right now. let me check [15:50:31] yeah [15:50:53] ottomata: that is correct [15:50:56] I remember! [15:51:01] 1007 was broken for a long time [15:51:10] it kept rebooting so we turned it off until it was fixes [15:51:12] ah right ok, well that's fine then, right? we just do the dance backwards? [15:51:14]