[00:18:27] $ dsh -g apaches -cM 'test -e /sys/fs/cgroup/memory || echo help' [00:18:27] mw1035: help [00:18:27] mw1145: help [00:18:27] mw1078: help [00:18:27] mw1152: help [00:18:27] mw1150: help [00:21:35] !log some apaches are missing /sys/fs/cgroup/* so shell execution from MW fails. Stopped apache on mw1035 during investigation [00:21:45] Logged the message, Master [00:23:39] PROBLEM - Apache HTTP on mw1035 is CRITICAL: Connection refused [00:24:00] it's not mounted [00:25:06] (03CR) 10Reedy: "(1 comment)" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/85165 (owner: 10Reedy) [00:29:15] !log on mw1035,mw1145,mw1078,mw1152,mw1150: running "start cgconfig" to fix cgroups [00:29:26] Logged the message, Master [00:31:06] (03PS1) 10Reedy: Rename www portals files to drop wikipedia from name [operations/apache-config] - 10https://gerrit.wikimedia.org/r/89760 [00:35:12] I love our apache configs [00:35:14] #arbcom_nlwiki Wiki [00:35:14] [00:35:14] DocumentRoot "/usr/local/apache/common/docroot/arbcom_nlwiki" [00:35:14] ServerName arbcom.de.wikimedia.org [00:36:15] Already removed in https://gerrit.wikimedia.org/r/#/c/89329/ [00:37:19] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 00:37:16 UTC 2013 [00:37:24] more cgroup breakage: http://paste.tstarling.com/p/PBaCGB.html [00:37:39] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [00:41:34] !log manually created MW cgroups on all apaches since apparently the init script is totally broken [00:41:47] Logged the message, Master [00:59:39] RECOVERY - Apache HTTP on mw1035 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.296 second response time [01:03:46] springle, awake? [01:05:27] andrewbogott: yup [01:05:36] can you help me troubleshoot a broken mysql install? [01:05:46] ok [01:05:55] on labs instance wikimetrics [01:06:10] I've modified things a bit and now mysql won't start. I think it's an apparmor issue but I can't pin it down. [01:06:35] hm… probably I need to give you access to the system; what's your username on wikitech? [01:06:49] Springle [01:07:12] ok -- let me know if you can't access or can't sudo or anything. [01:07:28] I think it has a public ip… probably you can just ssh wikimetrics.wmflabs.org [01:08:02] hm, maybe not, I guess you need to go via bastion [01:12:15] where is mysqld err going? syslog? [01:12:33] /var/log/mysql [01:20:13] definitely apparmor [01:21:18] andrewbogott: what was the recent modification you made? my.cnf or apparmor mysqld profile? [01:21:40] Well... [01:21:59] Switching over to the mysql module. It didn't manage apparmor formerly, so I'm trying to get things caught up. [01:22:02] So, both... [01:22:20] But at this point it's the necessary apparmor changes that I'm curious about. [01:28:01] springle, it seems like apparmor is protecting the datadir, but the setting looks right to me... [01:28:54] i just added the /mnt/mysql/** rwk, line [01:28:59] seems happy now [01:29:17] andrewbogott: ^ [01:30:55] hm, ok… I must be confused about where the proper datadir is. [01:31:12] Anyway, thanks, I can sort it out. [01:31:13] proper? [01:32:22] /etc/mysql/my.cnf:20 is what i assumed the datadir to be :) [01:32:24] (03PS1) 10Ryan Lane: Fix docroot ownership for sartoris project [operations/puppet] - 10https://gerrit.wikimedia.org/r/89761 [01:33:24] springle: yeah, I was distracted by /var/lib/mysql [01:34:32] :) [01:37:29] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 01:37:22 UTC 2013 [01:37:39] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [01:47:44] (03CR) 10Ryan Lane: [C: 032] Fix docroot ownership for sartoris project [operations/puppet] - 10https://gerrit.wikimedia.org/r/89761 (owner: 10Ryan Lane) [01:53:33] !log rebooting mw1035 for init testing [01:53:51] Logged the message, Master [01:54:56] (03PS1) 10Andrew Bogott: Further attempts at mysql module apparmor. [operations/puppet] - 10https://gerrit.wikimedia.org/r/89762 [01:56:49] PROBLEM - Host mw1035 is DOWN: PING CRITICAL - Packet loss = 100% [01:57:01] (03CR) 10Andrew Bogott: [C: 032] Further attempts at mysql module apparmor. [operations/puppet] - 10https://gerrit.wikimedia.org/r/89762 (owner: 10Andrew Bogott) [01:57:19] RECOVERY - Host mw1035 is UP: PING OK - Packet loss = 0%, RTA = 2.43 ms [01:59:39] PROBLEM - Apache HTTP on mw1035 is CRITICAL: Connection refused [02:01:39] RECOVERY - Apache HTTP on mw1035 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.093 second response time [02:01:50] (03PS1) 10Andrew Bogott: More app-armor floundering! [operations/puppet] - 10https://gerrit.wikimedia.org/r/89763 [02:02:56] (03CR) 10Andrew Bogott: [C: 032] More app-armor floundering! [operations/puppet] - 10https://gerrit.wikimedia.org/r/89763 (owner: 10Andrew Bogott) [02:14:53] !log LocalisationUpdate completed (1.22wmf20) at Tue Oct 15 02:14:53 UTC 2013 [02:15:10] Logged the message, Master [02:20:18] !log LocalisationUpdate completed (1.22wmf21) at Tue Oct 15 02:20:18 UTC 2013 [02:20:33] Logged the message, Master [02:25:32] (03PS1) 10Andrew Bogott: Switch labs instances to use the mysql module. [operations/puppet] - 10https://gerrit.wikimedia.org/r/89764 [02:37:25] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 02:37:21 UTC 2013 [02:37:39] !log LocalisationUpdate ResourceLoader cache refresh completed at Tue Oct 15 02:37:39 UTC 2013 [02:37:45] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [02:37:51] Logged the message, Master [02:48:10] (03PS1) 10MZMcBride: Minor tweaks to admins.pp. [operations/puppet] - 10https://gerrit.wikimedia.org/r/89765 [03:27:47] do we have generic names for the newer & older branches in production? like 'unstable' / 'stable', or something like that. [03:28:43] there was some talk of having names for git-deploy to use. but those would be alternating. you couldn't know from the name which was newer [03:29:26] hmm, thanks. [03:30:55] i suppose you could do "which does the "php" symlink point to?" programatically [03:31:11] and it's also easy to get a list of all current versions [03:31:14] ori-l [03:37:31] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 03:37:21 UTC 2013 [03:37:41] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [03:39:38] jeremyb: yes, i was just fishing for a less clumsy locution for "the newer of the two branches deployed to production at a given point in time" [03:44:04] ori-l: oh, you're talking to a human not a machine, huh [03:44:24] yes [04:37:27] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 04:37:18 UTC 2013 [04:37:47] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [05:16:56] RECOVERY - Host cp1052 is UP: PING OK - Packet loss = 0%, RTA = 0.29 ms [05:18:53] !log powercycled cp1052, it was unresponsive and unreachable via mgmt [05:19:12] Logged the message, Master [05:37:26] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 05:37:17 UTC 2013 [05:37:46] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [05:46:57] (03PS4) 10TTO: Miscellaneous cleanup of InitialiseSettings [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/86415 [06:37:26] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 06:37:18 UTC 2013 [06:37:46] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [06:53:12] (03PS1) 10ArielGlenn: depool db1022 to use as source for clone of other db [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89772 [06:55:06] (03CR) 10ArielGlenn: [C: 032] depool db1022 to use as source for clone of other db [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89772 (owner: 10ArielGlenn) [06:56:51] !log ariel synchronized wmf-config/db-eqiad.php 'db1022 (s6) depooled to use as clone source' [06:57:05] Logged the message, Master [07:24:15] cgroups again, sigh [07:37:30] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 07:37:22 UTC 2013 [07:37:40] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [07:56:40] PROBLEM - RAID on ms-be1008 is CRITICAL: CRITICAL: 1 failed logical drive(s) (Offline) [07:57:10] PROBLEM - Disk space on ms-be1008 is CRITICAL: DISK CRITICAL - /srv/swift-storage/sdc1 is not accessible: Input/output error [08:00:12] RECOVERY - Disk space on ms-be1008 is OK: DISK OK [08:11:01] (03CR) 10Akosiaris: [C: 032] Minor tweaks to admins.pp. [operations/puppet] - 10https://gerrit.wikimedia.org/r/89765 (owner: 10MZMcBride) [08:14:07] (03CR) 10Akosiaris: [V: 032] Minor tweaks to admins.pp. [operations/puppet] - 10https://gerrit.wikimedia.org/r/89765 (owner: 10MZMcBride) [08:14:53] (03PS1) 10ArielGlenn: db1006 (s6) -> file_per_table, mariadb [operations/puppet] - 10https://gerrit.wikimedia.org/r/89776 [08:15:54] (03CR) 10ArielGlenn: [C: 032] db1006 (s6) -> file_per_table, mariadb [operations/puppet] - 10https://gerrit.wikimedia.org/r/89776 (owner: 10ArielGlenn) [08:17:50] who's got the changes to modules/mysql/templates/apparmor.template.usr.sbin.mysqld.erb unmerged? [08:18:00] seems like andrewbogott [08:18:06] i am looking at the same thing now [08:18:22] worst thing is: git show 9e3493dc [08:18:39] says it is not just those 2 lines ... but 5 in total [08:18:42] andrewbogott_afk: [08:19:04] yep [08:19:54] i think we don't use the module however... [08:20:01] at least not yet... [08:23:42] so ... it's two different patchsets [08:23:54] 9e3493dca52bdef910ca89801e2a7ea09fd67ecc and 1ed783d5aea635c6e6cce867f6110517db29f5ba [08:24:06] I'm going to merge it (including the admin.pp changes) [08:24:25] he is doing something and then partly undoing it... [08:24:43] it probably won't hurt but i am unsure of what he was trying to achieve [08:25:22] since it's not going to affect the production myqls (afaict) [08:25:39] merged [08:25:52] and I'll just check on a live db host anyways [08:28:20] looks great [08:28:29] seems like it will affect neon. For some reason icinga::monitor includes mysql, probably to have virtual resources realized [08:28:51] mmmm [08:29:02] well let's see what it does over there [08:29:52] mm puppet takes a while to run there (but it does complete eventually) [08:30:03] sometimes [08:30:13] cause others it just says execution expired [08:35:07] actually since reducing run frequency (and getting errors in the manifest fixed) neon has been completing runs consistently [08:36:28] naggen neeeds a rewrite [08:36:40] but maybe we can just ditch it and go with puppetdb [08:37:29] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 08:37:22 UTC 2013 [08:49:04] anyways seems like that change didn't touch neon [08:49:14] :-) [08:51:36] do you guys have experience with puppetdb? if so, how was it? [09:03:15] i don't have any... But it is something we should look at [09:06:03] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [09:29:45] hello [09:34:51] hashar: hello \O/ [09:36:10] (03PS1) 10Ori.livneh: NavigationTiming: aggregate data by bits cache host; add 'totalPageLoadTime' [operations/puppet] - 10https://gerrit.wikimedia.org/r/89785 [09:36:16] anyone available to review? [09:37:23] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 09:37:21 UTC 2013 [09:37:51] ori-l: stat = 'browser.%s.%s.%s:%s|ms' % (metric, bits_cache, value) [09:38:03] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [09:38:07] format string says 4, tuple is 3 elements [09:38:07] god damn it, good eye [09:38:21] ok, you pass the test [09:38:32] :-) [09:38:59] (03PS2) 10Ori.livneh: NavigationTiming: aggregate data by bits cache host; add 'totalPageLoadTime' [operations/puppet] - 10https://gerrit.wikimedia.org/r/89785 [09:39:07] now the real commit [09:41:00] just out of curiosity... [09:41:01] bits_cache = meta.get('recvFrom', '').split('.')[0] [09:41:09] this line will always succeed ? [09:41:20] or can it be that there will be an indexError ? [09:41:28] not sure what is does btw... :-) [09:42:21] it can't be an index error [09:42:53] ''.split('nonexistent separator')[0] = '' [09:43:18] it gets the first part of the hostname [09:44:24] ok, thanks. LGTM then. Want me to merge now ? [09:44:38] sure, thanks! [09:45:55] akosiaris: meta is a has, the get() methods attempt to retrieve the value for the key 'recvFrom', if there the key does not exist it returns an empty string. [09:45:56] BUT [09:46:02] if the key contains None, you get an error :-] [09:46:22] >>> {'key': None}.get('key','').split('foo') [09:46:22] hashar: SyntaxError: Unexpected token : [09:46:23] AttributeError: 'NoneType' object has no attribute 'split' [09:46:37] ecmabot-wm: learn python [09:46:37] hashar: There is no command: learn python [09:47:32] hashar: lol... yeah it makes sense. I suppose though that scenario will not show up ? [09:48:02] cause you need code to explicitly set recvFrom to None [09:48:33] which sounds bad from an architectural point of view... Just not having the key in the hash is safer [09:50:29] (03CR) 10Akosiaris: [C: 032] "After a quick chat with Ori on IRC about a question I had LGTM." [operations/puppet] - 10https://gerrit.wikimedia.org/r/89785 (owner: 10Ori.livneh) [09:51:24] yes, it's not going to be None [09:52:20] akosiaris: if it was for the space shuttle, I would catch the exception and run some fallback code :-] [09:52:24] but he [09:52:31] we are not in rocket science [09:52:55] so yeah, as long as the 'recvFrom' is set to something having a split() method, we should be fine [09:52:55] i it was a space shuttle i would require you to mathematically prove every line of code :P [09:53:02] this is consumed from the validated event stream [09:53:08] it is a string or its absent [09:53:20] I have met someone working for Airbus [09:53:49] I can't recall the detail, but basically they are using some language that let you mathematically prove that all possible code paths combinaisons are valid [09:53:53] you can still have bugs though [09:54:15] ATS or Haskell, usually [09:54:18] but at least they are feature, not that much unexpected one [09:54:42] sometimes ML [09:54:48] ah ADA [09:55:00] I think it was that one [09:55:14] we got someone at the WMF that used to do avionic programming, not sure who though [09:55:29] mwalker used to do embedded stuff IIRC [09:58:13] well, puppet was pretty much blazing fast [09:58:19] and i see that cpu usage on stafford is down too [09:58:27] what was it? just the change to 60 mins rather than 30/ [10:00:40] I guess so [10:00:52] I am not sure how puppet client /server works [10:01:08] but it seems the client request a catalog using the unix epoch [10:01:16] which would trigger a catalog recompilation on the server [10:02:01] with 600 servers we have like 10 recompilations per minutes now instead of 20 [10:02:11] (pure speculations) [10:03:10] I have seen some installations using something like `git rev-parse HEAD` for the catalog version [10:03:27] and a friend just installed a varnish in front of puppet to cache catalogs :D [10:03:45] some others are rsyncing the puppet files to all servers which run their own local puppetmaster [10:03:48] but that is evil [10:05:35] akosiaris: thank you for the merge of my evil git::clone( integration/jenkins ) ( https://gerrit.wikimedia.org/r/87058 ) [10:05:49] hashar: :-) [10:06:06] /sbin/iptables -A tftp -s 198.35.26.0/22 -j ACCEPT [10:06:11] I will eventually migrate it to Sartoris whenever I found out how to use it :-] [10:06:13] * akosiaris sad.... [10:06:21] we don't even own /22... we own /23... [10:06:52] we need to put all those things in one single place. That info is scattered all over the place.... :-( [10:08:19] yes, just recuding the puppet run frequency had that impact on stafford [10:08:22] *reducing [10:13:57] ori-l: if still around I was wondering whether you were still maintaining deployment-fluoride.pmtpa.wmflabs . It is a receiver for some log2udp on ports 8423 and 8324 but there are no process listening on deployment-fluoride [10:14:18] ori-l: not really important though [10:26:16] (03PS1) 10ArielGlenn: repool db1006 (s6) warming up, after upgrade/convert to mariadb [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89790 [10:27:43] (03CR) 10ArielGlenn: [C: 032] repool db1006 (s6) warming up, after upgrade/convert to mariadb [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89790 (owner: 10ArielGlenn) [10:29:18] !log ariel synchronized wmf-config/db-eqiad.php 'db1006 (s6) repooled, warming up after upgrade/conversion' [10:29:34] Logged the message, Master [10:33:27] (03PS1) 10Akosiaris: Adding IPv4/6 networks in ferms defs [operations/puppet] - 10https://gerrit.wikimedia.org/r/89791 [10:37:24] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 10:37:20 UTC 2013 [10:37:54] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [10:47:44] RECOVERY - HTTPS on cp4003 is OK: OK - Certificate will expire on 01/20/2016 12:00. [10:47:54] RECOVERY - HTTPS on cp4004 is OK: OK - Certificate will expire on 01/20/2016 12:00. [10:48:04] RECOVERY - HTTPS on cp4002 is OK: OK - Certificate will expire on 01/20/2016 12:00. [10:50:28] (03CR) 10Mark Bergsma: [C: 04-1] "I don't like doing this in a temporary and not very well defined way, because it will get used all over the place before we ever get aroun" [operations/puppet] - 10https://gerrit.wikimedia.org/r/89791 (owner: 10Akosiaris) [11:29:09] mark: You are right about this https://gerrit.wikimedia.org/r/#/c/89791. It's just that i needed it in an install-server module rewrite and it was an even worse idea to stick it in there. I suppose I can stall the module till I get the time to chase all this down and clear it up. Aside from this, any objections on the approach of using network.pp to document all networks and using it to populate firewall rules ? [11:29:26] that's exactly what we planned [11:29:36] that's the thing; we want ferm to finally do this the right way and fix all this [11:29:43] by putting temporary measures in there too, it will never happen :) [11:30:02] I'd much rather have yet another temp crappy old iptables rule in puppet than to do almost the same thing in ferm :) [11:30:46] ok. Stalling the install-server module until we find the time to clean that thing up then [11:30:54] thanks [11:36:45] going to the ripe venue now [11:37:24] !log jenkins refreshing all mediawiki extensions jobs so they log Junit results under /logs/ [11:37:25] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 11:37:20 UTC 2013 [11:37:37] Logged the message, Master [11:37:55] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [12:09:26] (03PS1) 10Lcarr: fixing rancid file with new routers [operations/puppet] - 10https://gerrit.wikimedia.org/r/89808 [12:11:10] (03CR) 10Lcarr: [C: 032] fixing rancid file with new routers [operations/puppet] - 10https://gerrit.wikimedia.org/r/89808 (owner: 10Lcarr) [12:37:25] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 12:37:19 UTC 2013 [12:37:54] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [13:23:10] (03PS1) 10Lcarr: IPv6 blocks for ulsfo interconnects and loopbacks [operations/dns] - 10https://gerrit.wikimedia.org/r/89822 [13:23:57] (03CR) 10Lcarr: [C: 032] IPv6 blocks for ulsfo interconnects and loopbacks [operations/dns] - 10https://gerrit.wikimedia.org/r/89822 (owner: 10Lcarr) [13:26:34] (03PS12) 10Reedy: WIP don't deduce sites based on docroot stuff [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/85165 [13:26:38] (03PS13) 10Reedy: Don't deduce sites based on docroot stuff [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/85165 [13:28:56] (03PS1) 10ArielGlenn: db1006 (s6) back to full weight in the pool [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89824 [13:29:45] (03CR) 10ArielGlenn: [C: 032] db1006 (s6) back to full weight in the pool [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89824 (owner: 10ArielGlenn) [13:31:27] !log ariel synchronized wmf-config/db-eqiad.php 'db1006 (s6) back to full weight in pool' [13:31:40] Logged the message, Master [13:36:12] (03PS1) 10Reedy: All non wikipedias to 1.22wmf21 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89825 [13:37:25] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 13:37:23 UTC 2013 [13:37:55] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [13:59:21] (03PS1) 10Reedy: Add urls file for testing with apache-fast-test etc [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89828 [13:59:47] (03CR) 10Reedy: [C: 032] Add urls file for testing with apache-fast-test etc [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89828 (owner: 10Reedy) [14:00:17] (03Merged) 10jenkins-bot: Add urls file for testing with apache-fast-test etc [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89828 (owner: 10Reedy) [14:22:54] (03PS14) 10Reedy: Don't deduce sites based on docroot stuff [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/85165 [14:23:20] (03CR) 10Reedy: [C: 032] Don't deduce sites based on docroot stuff [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/85165 (owner: 10Reedy) [14:23:29] (03Merged) 10jenkins-bot: Don't deduce sites based on docroot stuff [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/85165 (owner: 10Reedy) [14:23:45] (03PS7) 10Reedy: Update tests to remove docroot setting [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/85172 [14:23:50] (03CR) 10Reedy: [C: 032] Update tests to remove docroot setting [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/85172 (owner: 10Reedy) [14:24:01] (03Merged) 10jenkins-bot: Update tests to remove docroot setting [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/85172 (owner: 10Reedy) [14:29:50] (03PS1) 10Reedy: Correct a few minor url issues to skip 404s etc [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89830 [14:30:33] (03CR) 10Reedy: [C: 032] Correct a few minor url issues to skip 404s etc [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89830 (owner: 10Reedy) [14:32:01] (03Merged) 10jenkins-bot: Correct a few minor url issues to skip 404s etc [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89830 (owner: 10Reedy) [14:35:29] !log reedy synchronized multiversion/ 'Changing away from docroot based site deduction' [14:35:41] Logged the message, Master [14:35:57] Well, enwiki isn't broken! :D [14:35:59] aude: ^^ [14:36:45] :) [14:37:06] brave to refactor that [14:37:30] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 14:37:22 UTC 2013 [14:38:00] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [14:47:06] (03CR) 10Reedy: "(1 comment)" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/85165 (owner: 10Reedy) [14:53:59] (03CR) 10Reedy: "I've deployed the PHP code and that seems to be working fine, so this should be good to be retested for deployment :)" [operations/apache-config] - 10https://gerrit.wikimedia.org/r/84707 (owner: 10Reedy) [14:57:23] Anyone around who fancies merging and pushing some apache config cleanup for me please? [15:04:51] !log reedy synchronized php-1.22wmf21/extensions/Wikibase [15:05:03] Logged the message, Master [15:08:36] Reedy: do you have +2 on puppets? :) [15:08:45] I don't, sorry :( [15:08:56] (03PS1) 10Yurik: Allow ESI only if enable_esi flag is set in the puppet cluster options [operations/puppet] - 10https://gerrit.wikimedia.org/r/89836 [15:08:56] its ok :) [15:09:11] Reedy: but do you think you could explain how extract2 works for me? :) [15:09:23] i want to make something similar for zero, need your expertise [15:09:37] Uh [15:09:38] i think you were the one who wrote that thing in the beginning [15:09:41] yeah [15:10:06] Haha, no [15:10:17] If it all blames to me, then I probably imported it into git [15:10:31] mark or paravoid, could you +2 https://gerrit.wikimedia.org/r/89836 - its for beta cluster testing of ESI [15:10:52] Reedy: do you know who is the one to blame (explain) that thing? [15:10:55] it looks scarry [15:11:31] hashar: actually that thing was your suggestion :) ^ [15:11:44] What's difficult about it? It's relatively simple.. [15:13:29] q [15:13:31] gah [15:14:24] yurik_: that's the thing that makes the main page of www.wikipedia.org? [15:14:33] aude: yep [15:14:38] that thing scares me [15:14:49] i am trying to figure out how to make it so that m.wiki and zero.wiki could have a custom startup page that is generated by apache instead of being a hardcoded wiki page on meta [15:14:52] it gets the templates from meta wiki [15:14:57] exactly [15:15:09] parse / get content and displays it [15:15:17] amazingly simple and scary :) [15:15:19] but i need to run some code to generate that page if the X-CS header is set and if the user comes to m. or zero. [15:15:30] oh, no images [15:15:43] aude: no images is fine [15:15:51] is that the requirement? [15:15:59] or more? [15:15:59] but there are lots of weird dirs in the config [15:16:19] ? [15:16:20] (03CR) 10Mark Bergsma: [C: 04-1] "(1 comment)" [operations/puppet] - 10https://gerrit.wikimedia.org/r/89836 (owner: 10Yurik) [15:18:01] mark, but that feature is only for the beta cluster testing [15:18:10] and nothing bad would happen if I manually set that header [15:18:21] without coming from the -TEST range [15:18:47] yurik_: not sure i really understand completely what you need [15:19:35] yurik_: yup that would be fine :-] [15:19:46] yurik_: this way you can enable_esi on the staging instance for mobile [15:19:47] hashar: mark -1 it :( [15:19:53] see his comment :-] [15:19:56] and he is not here to answer [15:19:59] he is the actual varnish expert hehe [15:20:23] yeah, that's true [15:20:34] one of those cases when its easier just do as asked :( [15:23:29] yes [15:23:31] (03PS2) 10Yurik: Allow ESI only if enable_esi flag is set in the puppet cluster options [operations/puppet] - 10https://gerrit.wikimedia.org/r/89836 [15:24:24] (03CR) 10Mark Bergsma: [C: 032] Allow ESI only if enable_esi flag is set in the puppet cluster options [operations/puppet] - 10https://gerrit.wikimedia.org/r/89836 (owner: 10Yurik) [15:24:36] (03CR) 10Yurik: "(1 comment)" [operations/puppet] - 10https://gerrit.wikimedia.org/r/89836 (owner: 10Yurik) [15:27:04] aude: in case the X-CS header is set (meaning that the user comes from a known mobile partner network), any visit to m.wikipedia.org or zero.wikipedia.org (or any other m.* actually) should see a custom-generated page [15:27:52] (03CR) 10Mark Bergsma: "You're mixing tabs and spaces here" [operations/dns] - 10https://gerrit.wikimedia.org/r/89822 (owner: 10Lcarr) [15:29:22] yurik_: ah ok [15:29:27] that page is currently generated at en.m.wikipedia.org/wiki/Special:ZeroRatedMobileAccess but you can't view it unless you set header on your browser: X-CS:250-99 [15:29:40] but it shouldn't be part of "en" [15:29:49] it should be in the language of the carrier [15:31:16] yurik_: so have it on meta wiki? [15:31:32] aude: what do you mean? its not a static page [15:31:33] * aude really doesn't know the best place [15:31:42] it's a special page [15:31:54] have one on all wikipedias? or it just one place? [15:32:21] meta contains the raw html of the page that extract2 yields in the place of "www.wikipedia.org" [15:32:34] right [15:32:43] what i need is something similar to extract2 that will yield a dynamic page [15:32:49] instead of m & zero subdomains [15:33:00] oh [15:33:47] and instead of the special page? [15:34:00] correct [15:35:59] * aude sees apache rewrite rule for zero :) [15:37:01] instead i assume it would redirect to the docroot page [15:37:23] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 15:37:18 UTC 2013 [15:37:24] and then not quite sure how to find out the languages [15:38:03] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [15:38:11] or be an alias or something [15:38:40] (03CR) 10Reedy: "Depends on https://gerrit.wikimedia.org/r/84707" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/84721 (owner: 10Reedy) [15:38:47] aude: sorry, got disconnected, please repeat [15:39:10] yurik_2: i see things are handled with apache redirect to the special page [15:39:28] would it redirect instead (or alias) to something in docroot? [15:40:10] * aude would like to do fancy lang redirects for wikidata (btw) although it's complicated [15:42:14] yurik_: or maybe it's possible to obfusicate (w/ apache) the fact that users are being redirected to the special page? [15:47:02] mark: ping [15:47:09] pong [15:47:55] mark, re cassandra testing: labs is very IO-limited, does not have SSDs etc [15:47:55] you don't need that for functional testing [15:47:55] so would not actually provide useful benchmark data [15:48:00] I have done functional testing on my laptop [15:48:13] which happens to have an SSD, which helps [15:49:03] we actually want to push cassandra to see if it holds up, and how it performs with a big data set [15:50:31] what dataset are you going to put in? [15:50:55] initially a wikitext dump, then some old revisions and HTML [15:51:15] if i remember right from when i last investigated cassandra, the compaction where it merges sstables is where its most likely to run into issues(especially with IO) so will want to make sure enough data is going in to simulate regular compaction [15:51:34] yup [15:51:57] in local testing I imported a part of a wikitext times times [15:51:57] gwicke: who on Ops is working with you on that? [15:51:57] *a wikitext dump [15:52:17] mark: Ryan_Lane is providing moral support [15:53:24] otherwise it is mainly a matter of trying it [15:53:24] sounds like good stuff :) I liked cassandra, but was just leery about their php support and current unfinished transition to CQL3, but from an eng standpoint its a nice system [15:53:24] installing cassandra is rather easy [15:53:25] * mark sighs [15:53:51] mark: I also offered to do the apt-get install via puppet, but without sudo tweaking the configs and simulating failures will be slower [15:54:21] it might make more sense to figure that out without puppet, and then puppetize when the first testing was successful [15:54:52] yes [15:56:40] if anybody in ops is interested to join the experiment then that would be great too of course [15:56:40] that would be a good idea [15:56:40] or this will turn into another failed adoption [15:56:41] i'll discuss it in our meetings later today [15:57:33] mark: is that the Monday meeting moved to Tuesday? [15:57:34] the internal ops meeting [15:57:34] yes [15:57:34] there's also a scrum of scrums before that [15:57:34] ah, not the Monday morning Dev/Ops meeting [15:57:34] it would be good to discuss it there too [15:57:41] plenty of meetings today [15:57:46] apparently [15:58:09] my monday calls have been moved today [15:59:00] 9pm till 1am despite today being my birthday hehe [15:59:00] it's our surprise to you [15:59:00] happy bday hashar, you can spend time with us [15:59:00] ohh, happy birthday hashar [15:59:12] :-] [15:59:52] happy birthday! [16:00:20] hashar: happy birthday! [16:02:58] gwicke: for perf testing some acquaintance told me about http://tsung.erlang-projects.org/ apparently used to stress tests boxes [16:03:35] gwicke: supports HTTP, WebDAV, SOAP, PostgreSQL, MySQL, LDAP and Jabber/XMPP [16:04:41] hashar: sounds interesting [16:04:51] I have node client code that parses an XML DOM and inserts that into cassandra [16:05:05] would be easy to convert that to post it to the web service instead [16:09:43] hashar: some of the features like reports about response times sound interesting, especially when not just testing for peak throughput [16:10:07] gwicke: tip tip: he is a freelancer in France [16:10:07] :] [16:10:16] but yeah tsung looks promising. [16:10:28] he has been using it to test Postgre backed website [16:10:44] the 'XML configuration system' sounds a bit off-putting at first [16:11:09] hopefully it is not full Java / Zope madness [16:15:28] (03CR) 10Mark Bergsma: "I don't mind the idea; I think that will provide very useful information." [operations/puppet] - 10https://gerrit.wikimedia.org/r/89359 (owner: 10Ori.livneh) [16:15:48] gwicke: write it in json and convert to xml ! :-D [16:16:23] aude: sorry, stepped away for a sec. I'm not sure really how current extract/docroot handles it all, that's why trying to find who initially wrote it :) [16:16:28] ori-l: basically, I'd like to avoid new uses & extra instances of varnishncsa at this point, if we can roll this into the new varnishkafka plans/infrastructure then I'm all for it [16:17:44] hashar: ;) [16:18:09] yurik_: ok :) [16:18:46] mark: yeah, I'm completely in favor of migrating to varnishkafka; been working on kafka integration in EL too: https://gerrit.wikimedia.org/r/#/c/85337/ [16:19:02] cool [16:19:27] I believe paravoid has added udp output to varnishkafka for completion, but I think it would be better if you could just use the kafka stream [16:19:37] yes, that's been the plan [16:19:56] ok [16:20:01] if yu respond that on the gerrit change I'll +1 it ;) [16:20:33] i'm still a bit wary of starting another varnishncsa instance on the bits caches right now [16:20:47] most bits servers don't have a lot of memory, and varnishncsa has a memleak where it uses up to 8 GB sometimes [16:20:52] (03CR) 10Ori.livneh: "> I'd like to make sure that this as well as the eventlogging can be replaced with the Varnishkafka solution that the analytics team is mi" [operations/puppet] - 10https://gerrit.wikimedia.org/r/89359 (owner: 10Ori.livneh) [16:20:56] with 2 instances that can really lead to trouble [16:21:07] well, ok, let's hold off then [16:21:38] one thing we can do is have a single instance but use a URL path component to differentiate streams. this would require some reconfiguration of the receivers, but if that's less problematic i can certainly do that [16:21:50] to differentiate streams on the receiving end, I mean [16:22:39] (03CR) 10Mark Bergsma: "Then I'm in favor of this plan. My only concern right now is adding another varnishncsa instance on the bits boxes, with the memleak this " [operations/puppet] - 10https://gerrit.wikimedia.org/r/89359 (owner: 10Ori.livneh) [16:22:44] preferring an additional varnishncsa instance was predicated on the assumption that it has a small footprint, but if it causes trouble then i'll do that [16:22:46] yeah [16:22:53] yeah it doesn't [16:23:02] alsoit does all the log processing yet again of course, so most cpu usage is duplicated [16:23:10] it's not the best written varnish program [16:23:27] varnishkafka is way better [16:23:27] is it ready to use? [16:23:27] and we'll only need one instance of it [16:23:39] it's pretty much ready to use, but analytics isn't ready for it yet I believe [16:23:43] !log bd808 synchronized php-1.22wmf20/maintenance/purgeChangedPages.php [16:23:48] also they need a migration path to migrate off the udp stuff [16:23:56] Logged the message, Master [16:24:30] ah, ottomata, let me know if I can help that along [16:24:30] hi reading [16:24:57] Reedy: I got a ton of "mwXX: Permission denied (publickey)." from that sync-file run [16:25:16] ori-l: varnishkafka is pretty good and ready, snaps is still making some changes [16:25:32] currently, we are using it to produce to kafka in a test setup [16:25:39] from a host in eqiad and a host in esams [16:25:46] (03Abandoned) 10Ori.livneh: Add varnish::logging::client_stats [operations/puppet] - 10https://gerrit.wikimedia.org/r/89359 (owner: 10Ori.livneh) [16:25:48] we then check the produced logs for missing data [16:25:57] we aren't confident that producing from esams is 100% reliable [16:26:00] it seems to be most of the time [16:26:01] bd808: ssh -A ? [16:26:06] and i was hoping to write an email today saying that we are good to go [16:26:28] ottomata: cool, that's excellent [16:26:29] but, there are times on the esams host we are using where the buffer is filling up in varnishkafka [16:26:40] snaps thinks this is just due to cross atlantic latency [16:26:47] so we are increasing the buffer size to see if that makes a difference [16:26:56] but it takes a while for me to know for sure [16:26:56] tcp can't do miracles :) [16:26:56] ha, yeah [16:27:10] i like to collect 2 days of logs before I draw conclusions on this [16:27:23] Reedy: Yes, but… `ssh-add -l` says agent isn't connected. [16:27:48] Snaps also has a bug where sometimes the log lines are longer than what he allocates for working with them, he's fixing that and making that configurable or something [16:28:01] but, varnishkafka mostly works really really well [16:28:13] ori-l, do you need to send data somewhere via udp, or would kafka work for you? [16:28:25] heh [16:28:25] kafka would totally totally work [16:28:45] !log bd808 synchronized php-1.22wmf20/maintenance/purgeChangedPages.php [16:28:55] (03PS1) 10Manybubbles: Switch enwikisource to CirrusSearch as primary [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89848 [16:29:05] ok cool [16:29:40] Reedy: Hmmm… `ssh-add -l` shows keys before I try to run sync-file. After it does not. [16:29:45] oh ottomata... [16:29:57] as you setup those new varnish boxes yourself [16:29:58] do I still need to tell you that we're gonna use them and analytics should be aware of that? ;) [16:30:00] * bd808 blames .ssh/config [16:30:15] heheh nope, I am aware! of the mobiles at least [16:30:20] i haven't actually set those up yet :/ [16:30:21] good ;) [16:30:24] been doing analytics stuff [16:30:27] yeah just bits is ready [16:30:32] the rest we'll worry about later [16:30:37] but yeah, when we puppetize the mobile stuff for esams and eqiad we can do the same for ulsfo [16:30:38] I think bits will see some traffic there tomorrow [16:30:42] with varnishkafka [16:30:43] oh cool [16:30:56] bits also doesn't have analytics logging, so doesn't matter there [16:31:00] !log bd808 synchronized php-1.22wmf20/maintenance/purgeChangedPages.php [16:31:05] i guess we can change that with varnishkafka [16:31:15] bd808: \o/ [16:31:47] Reedy: All better now. I had pushed dotfiles to tin that included a .ssh/config that did bad things like dropping my agent [16:32:05] bd808: if you want a merit badge, follow the file name argument with a short explanation (sync-file foo/bar.php "updating bar.php to prepare for x") [16:32:16] (03CR) 10Chad: [C: 032] Switch enwikisource to CirrusSearch as primary [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89848 (owner: 10Manybubbles) [16:32:46] (03Merged) 10jenkins-bot: Switch enwikisource to CirrusSearch as primary [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89848 (owner: 10Manybubbles) [16:36:18] !log bd808 synchronized php-1.22wmf21/maintenance/purgeChangedPages.php 'Updating purgeChangedPages.php to prepare for HTCP purge' [16:36:30] Logged the message, Master [16:37:15] Achievement unlocked: push to all the things [16:37:25] +20G [16:37:28] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 16:37:18 UTC 2013 [16:38:34] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [16:42:14] !log demon synchronized wmf-config/InitialiseSettings.php 'enwikisource getting cirrus as primary' [16:42:26] Logged the message, Master [16:57:56] (03PS1) 10Jgreen: remove awight@boo ssh key, RT#5965 [operations/puppet] - 10https://gerrit.wikimedia.org/r/89854 [16:59:09] (03CR) 10Jgreen: [C: 032 V: 031] remove awight@boo ssh key, RT#5965 [operations/puppet] - 10https://gerrit.wikimedia.org/r/89854 (owner: 10Jgreen) [17:10:13] (03PS1) 10Manybubbles: Turn debugging on for enwikisource [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89856 [17:18:06] paravoid did you remove megacli from ms-be1003/8 no longer works? only sda and sdb are showing as software raid? [17:18:43] what do you mean? [17:18:57] ^d: I believe I've discovered the issue - it looks like the last time we (probably I) rebuilt the index for enwikisource we only did the content index. I'm not sure why. the general index is way out of date [17:19:07] so queries against it are sometimes sad [17:19:17] I'm doing an in place reindex on just that index right now [17:19:22] which should unsad it [17:19:45] paravoid: MegaCli -PDList -aALL no longer works [17:20:01] oh, try "megacli", no caps [17:20:04] oh and the sw raid info was bogus ...i was still in iron :-P [17:20:05] <^d> manybubbles: Ah [17:20:19] ^d: I dont' really know how that happened [17:20:36] but it was hard to figure out because the warning logs that CirrusSearch spits out in dev are apparently bit bucketed in prod [17:20:49] weird...that must be new [17:20:55] the no CAPS [17:21:12] yep [17:21:17] I updated to a newer version [17:21:40] cool [17:21:43] good to know [17:21:55] PROBLEM - RAID on searchidx1001 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [17:22:45] RECOVERY - RAID on searchidx1001 is OK: OK: State is Optimal, checked 1 logical drive(s), 4 physical drive(s) [17:23:29] (03Abandoned) 10Manybubbles: Turn debugging on for enwikisource [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89856 (owner: 10Manybubbles) [17:23:51] paravoid: but now all our cheat sheets dont blindly copy and paste! [17:23:53] ;] [17:23:56] fix them? :P [17:24:13] * RobH makes alias on all systems via salt [17:24:14] fixed! [17:24:17] (kidding) [17:24:32] via salt? bad RobH, that's clearly a puppet thing [17:24:35] (also kidding) [17:24:42] if im going to do it wrong [17:24:51] im gonna do it wrong with IMMEDIATE results [17:25:06] * YuviPanda tweets root keys [17:25:08] but yes, puppet would be the correct way to inact a horrible fix [17:25:11] heh [17:33:14] akosiaris, apergos, sorry for leaving those patches unmerged; they are harmless as you guessed. [17:33:47] put it on our tab [17:34:06] add enugh of em up and either they'll be a beer or they'll cancel out our tabs [17:34:43] I'd happily buy you a beer if you're ever able to leave the country again… how's that going? [17:37:25] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 17:37:21 UTC 2013 [17:37:55] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [17:39:30] (03PS1) 10ArielGlenn: db1022 (s6) back in the pool, it's done doing its clone thang [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89864 [17:40:57] (03CR) 10ArielGlenn: [C: 032] db1022 (s6) back in the pool, it's done doing its clone thang [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89864 (owner: 10ArielGlenn) [17:42:52] !log ariel synchronized wmf-config/db-eqiad.php 'db1022 (s6) back in the pool' [17:43:07] Logged the message, Master [17:44:58] paravoid: apparently gluster has a funny way of implementing the swift API [17:48:43] as in reusing most swift code but having it write to gluster where it normally writes to local files, heh [17:49:15] !log enable icinga notifications for db1006 services [17:49:17] apergos: ^ [17:49:25] Logged the message, Master [17:51:09] thanks [17:51:38] it's hours after it went back in but I've had the logs open anyways [17:54:04] !log rebuilt search index for enwikisource's non-content documents because it had snuck out of date and was causing trouble [17:54:16] Logged the message, Master [17:54:35] greg-g: so I'm finally done making CirrusSearch primary on enwikisource. [17:55:04] manybubbles: wow [17:55:12] greg-g: it broke [17:55:16] and I unbroke it [17:55:21] manybubbles: suck/well done [17:55:22] ;) [17:55:40] sorry for the trouble [17:57:31] congrats! [18:01:33] (03CR) 10Chad: [C: 032] All non wikipedias to 1.22wmf21 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89825 (owner: 10Reedy) [18:01:45] (03Merged) 10jenkins-bot: All non wikipedias to 1.22wmf21 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89825 (owner: 10Reedy) [18:02:18] yay! [18:03:06] !log demon rebuilt wikiversions.cdb and synchronized wikiversions files: [18:03:19] Logged the message, Master [18:20:31] (03PS1) 10Matanya: ssh: convert into a module [operations/puppet] - 10https://gerrit.wikimedia.org/r/89869 [18:20:58] (03CR) 10jenkins-bot: [V: 04-1] ssh: convert into a module [operations/puppet] - 10https://gerrit.wikimedia.org/r/89869 (owner: 10Matanya) [18:27:21] (03PS2) 10Matanya: ssh: convert into a module [operations/puppet] - 10https://gerrit.wikimedia.org/r/89869 [18:27:44] (03CR) 10jenkins-bot: [V: 04-1] ssh: convert into a module [operations/puppet] - 10https://gerrit.wikimedia.org/r/89869 (owner: 10Matanya) [18:30:07] YuviPanda: may i use your wizdom? [18:30:26] mark: sure? [18:31:04] for the change above, i fail to understand why jenkins complains about missing ssh.pp [18:31:24] sure it is missing, i removed it and converted the stuff into a module. [18:31:42] so, YuviPanda I fail to see my mistake, and hence can't fix it :/ [18:32:44] a module most not be imported [18:32:48] matanya: i think i had something similar once where it's a race condition like thing, when you delete a file and jenkins wants to check it [18:32:49] must not be imported [18:32:55] ah [18:33:00] plus it is not a module [18:33:11] just moving the file around does not make it a module... [18:33:31] you need to split the various classes in their respective files [18:33:38] so the autoloader can find them [18:33:56] akosiaris: each class into a seperate manifest? [18:33:59] yep, one file per class [18:34:01] yes [18:34:07] and defines as well [18:34:12] * matanya hits his head in the wall [18:34:18] have a look at some other modules we have [18:34:28] yeah, some how i forgot this [18:34:38] yes, i feel stupid now [18:34:48] dont [18:34:55] i'l re-do this the proper way [18:35:03] why? [18:35:28] you tried to do a good job, failing the first time is no reason to feel bad [18:35:37] just try again [18:35:47] this btw might be very handy http://docs.puppetlabs.com/puppet/2.7/reference/modules_fundamentals.html [18:36:01] oh, i thoght you meant to convert it into a module [18:36:32] I am pretty sure that after skimming you will make ssh a nice module :-) [18:37:16] i converted miredo into a module last time, and it was only one class, so i didn't split it. turns out i got confused. thanks for guidence. [18:37:25] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 18:37:22 UTC 2013 [18:37:30] you are welcome [18:37:46] * matanya goes back to refresh himself with the docs [18:37:55] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [18:38:13] (03Abandoned) 10Matanya: ssh: convert into a module [operations/puppet] - 10https://gerrit.wikimedia.org/r/89869 (owner: 10Matanya) [18:39:24] (03PS1) 10Ori.livneh: Add Ganglia view for bits varnishes TTFB [operations/puppet] - 10https://gerrit.wikimedia.org/r/89877 [18:39:33] mark: ^ [18:39:45] (data is already in ganglia; this just makes it convenient to view) [18:41:07] not pmtpa? :) [18:41:14] i know it's not really active right now and going away [18:41:52] (03CR) 10Mark Bergsma: [C: 032] Add Ganglia view for bits varnishes TTFB [operations/puppet] - 10https://gerrit.wikimedia.org/r/89877 (owner: 10Ori.livneh) [18:42:19] thanks. wait, are there actually pmtpa bits, are are you joking? [18:42:30] there are [18:42:48] we haven't been using it since the eqiad switchover, but esp now with gdnsd there's no reason we couldn't use both [18:42:56] however, since we're moving away, let's not start now ;) [18:43:18] (03CR) 10Faidon Liambotis: [C: 031] Registered eqiad swift backend configuration [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89045 (owner: 10Aaron Schulz) [18:43:21] sounds good [18:43:47] also no 10G interfaces hehe [18:43:51] would have been problematic with ULS [18:44:12] half of the ganglia views are unpuppetized btw... [18:44:32] uls fo ahem, no we didn't specifically pick that acronym for that [18:49:55] PROBLEM - Host labstore4 is DOWN: PING CRITICAL - Packet loss = 100% [18:50:05] http://ganglia.wikimedia.org/latest/?r=4hr&cs=&ce=&tab=v&vn=Bits+TTFB [18:51:15] RECOVERY - Host labstore4 is UP: PING OK - Packet loss = 0%, RTA = 26.74 ms [19:04:55] (03PS2) 10JanZerebecki: Omit URLs like www.wikivoyage.org/wiki/ from the redirect to mobile as www.m... does not exist. [operations/puppet] - 10https://gerrit.wikimedia.org/r/89879 [19:08:29] (03CR) 10Odder: [C: 031] Set up autopatroller right on eswikivoyage [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89497 (owner: 10TTO) [19:11:42] (03CR) 10Odder: [C: 031] "Just out of curiosity, why do we add multiple wikis to the exception?" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89642 (owner: 10TTO) [19:13:10] ori-l: why EQIAD twice? because it serves people who are farther on average? [19:20:09] (03CR) 10MaxSem: [C: 04-1] "Thanks - the regex seems to work, however Varnish VCLs also need to be updated or the fix will not live past the Squid deprecation:)" [operations/puppet] - 10https://gerrit.wikimedia.org/r/89879 (owner: 10JanZerebecki) [19:21:29] (03CR) 10MaxSem: "It's in templates/varnish/text-frontend.inc.vcl.erb" [operations/puppet] - 10https://gerrit.wikimedia.org/r/89879 (owner: 10JanZerebecki) [19:24:01] (03CR) 10JanZerebecki: [C: 031] "No change in the patch set, just rebased accidentally." [operations/puppet] - 10https://gerrit.wikimedia.org/r/84901 (owner: 10Dzahn) [19:30:10] !log adding mw1125 back, enabling pybal [19:30:24] Logged the message, Master [19:32:28] (03PS3) 10JanZerebecki: Omit URLs like www.wikivoyage.org/wiki/ from the redirect to mobile as www.m... does not exist. [operations/puppet] - 10https://gerrit.wikimedia.org/r/89879 [19:33:41] ori-l: fastly had that problem iirc. kernel bug that only happened in chicago because that's the only place with the right latencies. only 90% sure about the chicago part [19:33:48] errr, Nemo_bis ^ [19:36:52] (03PS1) 10Andrew Bogott: Remove an obsolete role file, labsmw.pp [operations/puppet] - 10https://gerrit.wikimedia.org/r/89983 [19:37:28] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 19:37:19 UTC 2013 [19:37:58] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [19:40:16] jeremyb: right as in too low? [19:41:05] Nemo_bis: no, too high [19:41:44] (03PS1) 10Dzahn: add uk.planet.wikimedia.org (Ukrainian language) [operations/dns] - 10https://gerrit.wikimedia.org/r/89984 [19:42:42] (03CR) 10Andrew Bogott: [C: 032] Remove an obsolete role file, labsmw.pp [operations/puppet] - 10https://gerrit.wikimedia.org/r/89983 (owner: 10Andrew Bogott) [19:42:47] (03PS4) 10MaxSem: Omit URLs like www.wikivoyage.org/wiki/ from the redirect to mobile as www.m... does not exist. [operations/puppet] - 10https://gerrit.wikimedia.org/r/89879 (owner: 10JanZerebecki) [19:43:39] (03CR) 10Andrew Bogott: [C: 032] Remove unused files from files/mysql and templates/mysql. [operations/puppet] - 10https://gerrit.wikimedia.org/r/88646 (owner: 10Andrew Bogott) [19:43:49] (03CR) 10MaxSem: [C: 031] Omit URLs like www.wikivoyage.org/wiki/ from the redirect to mobile as www.m... does not exist. [operations/puppet] - 10https://gerrit.wikimedia.org/r/89879 (owner: 10JanZerebecki) [19:49:38] andrewbogott: Hi. Whatever you did to fix https://bugzilla.wikimedia.org/show_bug.cgi?id=54143 , could you do it again please? :( [19:50:56] Nemo_bis: same instance and project? [19:51:05] andrewbogott: yes [19:51:11] ok, one minute... [19:52:17] Nemo_bis: is that any better? [19:52:49] andrewbogott: yes! Thanks [19:53:00] weird, different kind of failure today [19:53:15] Ah. [19:53:24] It wasn't doing anything special when it happened [19:55:20] YuviPanda: me again :) [19:56:17] (03PS1) 10Dzahn: planet - add Ukranian language and feeds [operations/puppet] - 10https://gerrit.wikimedia.org/r/89989 [19:59:24] (03CR) 10Dzahn: [C: 032] planet - add Ukranian language and feeds [operations/puppet] - 10https://gerrit.wikimedia.org/r/89989 (owner: 10Dzahn) [20:02:48] mutante, are you the king of mailman? [20:03:25] andrewbogott: no:) [20:03:29] Is anyone? [20:03:32] what is it [20:03:41] https://rt.wikimedia.org/Ticket/Display.html?id=5974 [20:04:29] andrewbogott: ok,i'll take it [20:04:43] Is it easier for you to talk me through it or to just do it? [20:05:21] andrewbogott: oh wait, i think it's not even mailman, after reading it [20:05:34] it says "2 lists" but one of them is not @lists [20:05:43] lemme check mchenry [20:05:44] I thought it was an alias but I don't see it in puppet [20:05:50] although I guess not all aliases are puppetized. [20:06:01] Could we change the first one to just be an alias for @lists? [20:06:04] they should be [20:06:10] That seems better than bounce & archive [20:06:13] puppet creates the alias files [20:07:10] The string 'analytics-internal' appears 0 times in our puppet repo [20:07:58] looking [20:08:14] mutante / andrewbogott sorry to inturppet, when one of you can help me out a sec, it would be most appriciated [20:08:25] matanya, what's up? [20:08:33] hi, moduling ssh [20:09:17] so andrewbogott I ran into puppet/manifests/ssh.pp and saw a weird call to a class as a resource [20:09:22] andrewbogott: maybe typo in the ticket, there is analytics@lists and analytics-internal@lists but nothing in mchenry, ack [20:09:26] i'll comment on ticket [20:09:41] which i think is not nessecary, or i didn't get the point of the call [20:09:48] mutante, ok, thank you :) [20:09:57] matanya, line #? [20:10:05] andrewbogott: or it's in Google, we'll find out:) [20:10:06] 9 [20:10:25] tough in my branch it is not there atm [20:10:42] matanya: ssh::hostkeys::collect ? [20:10:47] yes [20:10:50] andrewbogott: yep, it is in Google, OIT's job:) [20:11:09] exim -bt
[20:11:15] mutante: OK, so… is there an established way to redirect an rt ticket to oit? [20:11:30] matanya: looks ok to me other than the dangling : -- is that what concerned you? [20:11:58] andrewbogott: no, i don't see the point in calling this class at all [20:12:48] matanya: OK, maybe I don't understand your question. Earlier you said 'weird call to class as resource' so I thought you were concerned with syntax... [20:12:56] You're just wondering what ssh::hostkeys::collect is for in the first place? [20:13:10] yes, sorry for not being clear andrewbogott [20:13:16] np, I'm reading... [20:13:21] andrewbogott: theoretically the office queue, but no..sigh. though we keep trying to find one. i'll pick the method of adding techsupport@ as AdminCC [20:13:34] (if zendesk is not down anymore) [20:15:52] matanya: At first guess, I'd say that this is using virtual resources -- probably ssh keys are defined virtually here and there, and that class says "On these servers, actually install those keys." [20:15:53] http://docs.puppetlabs.com/guides/virtual_resources.html [20:16:01] But, I don't actually see where those keys are defined yet... [20:16:18] that is exactly my point [20:16:34] grep didn't return any useful stuff related to this [20:16:41] git-blame says that ryan_lane made that class. Perhaps he can clarify. [20:17:05] is he anywhere around? [20:17:29] matanya: one thing to keep in mind is that there's a 'private' repo which contains secure things like keys -- so there are may be some sneaky files in play that you can't see. [20:17:35] did the git blame too :) hoped i was missing some trivial stuff [20:17:38] But in this case I don't see that that matters... [20:17:50] matanya: hey. For ssh as a module you can have a look at the very old change https://gerrit.wikimedia.org/r/#/c/15874/ [20:18:42] totally cool, we did almost the same changes at design level! [20:18:51] thanks hashar :) [20:18:56] +1 paravoid [20:19:25] why wasn't it ever taken care of/merged? [20:21:25] mutante: thanks again for taking that ticket [20:22:09] matanya: ops started moving to modules during summer 2013. That change is from the previous summer (2012) [20:22:15] andrewbogott: np [20:22:20] matanya: so I guess the idea got abandoned / was not a priority :-] [20:22:34] matanya: but it surely is now. We even have rspec unit tests running :-] [20:22:51] matanya, hashar, the answer is: because no one told me about it :) [20:23:00] so i'm glad i can be of help a bit [20:23:18] So it would be stupid to duplicate his work, i guess [20:23:43] matanya, interested in rebasing that change? [20:23:48] It needs it [20:23:54] i guess i can [20:24:02] my last rebse was painfull [20:24:06] *rebase [20:24:21] well you could have a look at Faidon change to get inspiration [20:24:25] then manually replicate the split [20:24:28] first try just hitting the button in Gerrit, matanya [20:24:31] have time to walk me through? i wouldn't want to waste your time [20:24:49] (03PS1) 10Chad: Remove $wgUseDumbLinkUpdate [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89998 [20:25:00] * matanya clicked and now praying  [20:25:08] RobH: another rt question… what happens to tickets like this that are tasks opened by an Op as a note-to-self? Should I just mark them as 'open' without comment? [20:25:15] Yeah, it requires a hand-rebase, gerrit can't do it. [20:25:17] no luck :/ [20:25:24] class ssh::daemon { [20:25:24] package { "openssh-server": [20:25:24] ensure => latest; [20:25:24] } [20:25:24] service { [20:25:25] ssh: [20:25:25] ensure => running, [20:25:26] subscribe => File["/etc/ssh/sshd_config"]; [20:25:26] } [20:25:29] } [20:25:32] oops [20:26:13] andrewbogott: if you have time, and not to painful for you, id like to reabse with your guidence [20:26:30] matanya: ever used "git review -d" before to amend to existing patch set? [20:26:38] yes mutante [20:26:39] matanya: I'm happy to guide if that's educational, or I can just do it :) [20:27:01] you know how to rebase generally, right? Just wondering about the conflicts? [20:27:03] matanya: then do that, and then try like "git rebase -i origin/production" and if that works one more comit --amend and git review again [20:27:42] and if that fails you have to manually edit the file with the conflicts and look for HEAD and >>> [20:28:09] andrewbogott: ticket number? [20:29:02] andrewbogott: i always prefer a fishing rod rather then fish [20:29:15] andrewbogott: sorry, missed ping [20:29:34] if its note to self, you can mark as open if you are trying to triage new tickets, seems to make sense to me. [20:29:39] i'll do the rebase and hope it will work. thanks mutante :) [20:29:53] i tend to just ignore them, but some kind of processing seems a better idea. [20:29:58] RobH, mutante, speaking in general, but for example https://rt.wikimedia.org/Ticket/Display.html?id=5971 [20:30:33] well, its not owned by him [20:30:39] so it could be for him or leslie technically [20:30:47] i don't know if it makes a difference, status: open would look like it's being worked on already [20:30:48] so im not sure i'd mark new [20:30:55] sorry, mispoke [20:30:57] id leave new [20:30:59] RobH: Yeah, and I have the impression that future tasks don't really get assigned to people unless they're really working on them. [20:31:01] not change to open. [20:31:07] ok [20:31:24] So… do nothing, I'm up to that challenge. [20:31:26] heh [20:31:36] agrees [20:32:44] unless you'd have an actual comment or question on the ticket and it would automatically become open [20:34:38] (03CR) 10Dzahn: [C: 032] add uk.planet.wikimedia.org (Ukrainian language) [operations/dns] - 10https://gerrit.wikimedia.org/r/89984 (owner: 10Dzahn) [20:35:03] !log DNS update, add uk.planet [20:35:22] Logged the message, Master [20:35:58] cmjohnson1: this ticket seems aimed at you. https://rt.wikimedia.org/Ticket/Display.html?id=5969 [20:36:00] mutante: misrable fail [20:36:09] i need to do most of it by hand [20:36:32] andrewbogott it does...thx [20:37:25] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 20:37:20 UTC 2013 [20:37:29] matanya: arg..i see, so you edit the file that has conflicts and look for lines with HEAD and >>> etc, and fix it, and then git rebase --continue [20:37:55] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [20:39:05] ACKNOWLEDGEMENT - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours daniel_zahn please see https://rt.wikimedia.org/Ticket/Display.html?id=5908 [20:40:26] andrewbogott: #5908 would be killing williams and change 88147 too, but see my comments (is it _really_ not used) [20:45:42] mutante: OK, another mailman question… how can I reset the admin password for a list? [20:46:44] (03PS1) 10Dzahn: add sidebar links for new planet languages [operations/puppet] - 10https://gerrit.wikimedia.org/r/90005 [20:46:59] andrewbogott: does it still have active admins? [20:47:10] It does but she can't log in [20:47:20] make her ask the other admins first [20:47:32] becaues there is just 1 pass, and if you reset it, the others won't get it [20:47:35] and make another ticket [20:48:01] if there is really no response, then use the master password to login on the listinfo page and set a new one [20:48:38] always make people first ask the ones listed as "list run by" [20:49:23] which list is it for ? [20:50:00] mutante: https://lists.wikimedia.org/mailman/admindb/wikimania-core [20:50:04] It really looks like she's the only admin [20:50:21] what? i _just_ sent her that pass [20:50:27] oh, hm. [20:50:38] well… that means you get another ticket :) https://rt.wikimedia.org/Ticket/Display.html?id=5964 [20:51:03] duplicate [20:51:06] already done all :p [20:51:17] hm, glad I asked [20:51:28] i even subscribed all those people [21:02:08] (03CR) 10Dzahn: [C: 032] add sidebar links for new planet languages [operations/puppet] - 10https://gerrit.wikimedia.org/r/90005 (owner: 10Dzahn) [21:03:50] mutante: seems like it would be faster to use my own branch [21:03:59] the code their is very old [21:05:44] matanya: if you want to use paravoid's patch as a model to update your work, that's fine. I'm surprised there are so many conflicts though. [21:08:09] 3 in site.pp 2 in base.pp and one in ssh.erb [21:08:21] more than i expected andrewbogott [21:08:55] Oh, the site.pp and base.pp stuff should be pretty simple... [21:09:29] let me see if I can get a look at what you're seeing... [21:09:38] now i see: deleted by us: manifests/base.pp [21:09:50] why is that? i haven't deleted it [21:10:15] It's because base is now a module. [21:10:27] So you need to make those corresponding changes in module/base/manifests/whatever [21:13:36] i see, day of merges. I have made a huge painful marge at work today :P [21:17:04] * matanya leaves this work for tomorrow, too late for normal code flow. [21:17:30] Would any Opsen like to volunteer to monitor HTCP activity in esams tomorrow from 20:00-22:00Z while I run a largish purge? [21:17:37] ok. Thanks for working on this, please make sure to add me as a review. [21:37:31] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 21:37:23 UTC 2013 [21:38:01] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [21:41:05] (03PS1) 10Edenhill: Make scratch buffer size configurable (issue #2) [operations/software/varnish/varnishkafka] - 10https://gerrit.wikimedia.org/r/90028 [21:41:06] (03PS1) 10Edenhill: Log failed Kafka message deliveries (issue #1) [operations/software/varnish/varnishkafka] - 10https://gerrit.wikimedia.org/r/90029 [21:41:07] (03PS1) 10Edenhill: Provide some more detail when Kafka ..produce() fails. [operations/software/varnish/varnishkafka] - 10https://gerrit.wikimedia.org/r/90030 [21:52:56] (03PS8) 10Dr0ptp4kt: Add an extra header for cache variance of W0 banners for proxies. [operations/puppet] - 10https://gerrit.wikimedia.org/r/88261 [21:56:45] (03CR) 10Dr0ptp4kt: "(1 comment)" [operations/puppet] - 10https://gerrit.wikimedia.org/r/88261 (owner: 10Dr0ptp4kt) [22:08:13] (03CR) 10Dzahn: [C: 031] "ServerName arbcom.de.wikimedia.org was twice (insted of nl):" [operations/apache-config] - 10https://gerrit.wikimedia.org/r/89329 (owner: 10Reedy) [22:10:49] (03PS9) 10Dereckson: Throttle now handles IP ranges. [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/65644 [22:12:50] (03CR) 10Dzahn: [C: 032] "yep, and they are not in DNS anyways, arbcom is in wikipedia.org, not wikimedia" [operations/apache-config] - 10https://gerrit.wikimedia.org/r/89329 (owner: 10Reedy) [22:13:03] (03CR) 10Dereckson: "@Adamw Can you test this code somehow somewhere?" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/65644 (owner: 10Dereckson) [22:16:26] Reedy: apparently https://deployment.wikimedia.beta.wmflabs.org/ gives No Wiki again [22:16:42] Reedy: I guess the lame router is confused by some docroot change ? (bug is https://bugzilla.wikimedia.org/show_bug.cgi?id=55762 ) [22:17:18] oh.. is that a weird exception? [22:17:21] TimStarling: question for you in my code-review comment on https://gerrit.wikimedia.org/r/#/c/89641/ [22:17:31] I'll be back at a computer soon so will poke [22:17:59] Reedy: haven't investigated, we can look it up together tomorrow if you want [22:18:06] is it worth restarting all apaches (real restart, not graceful) just to remove the unused virtual hosts? [22:18:15] as listd per apache2ctl -S [22:18:36] shrugs if it matters at all [22:18:45] when it's not in DNS anyways [22:19:20] (03CR) 10Dr0ptp4kt: [C: 032] "To be clear, InitialiseSettings.php in mediawiki-config/wmf-config is where applicability for META is now set." [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/86147 (owner: 10Yurik) [22:19:33] (03Merged) 10jenkins-bot: Removed Zero namespaces (480 & 481) from META [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/86147 (owner: 10Yurik) [22:20:16] (03CR) 10Aaron Schulz: [C: 032] Registered eqiad swift backend configuration [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89045 (owner: 10Aaron Schulz) [22:20:25] (03Merged) 10jenkins-bot: Registered eqiad swift backend configuration [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89045 (owner: 10Aaron Schulz) [22:22:12] !log aaron synchronized wmf-config/filebackend.php '013c27272a1adbccad11f02388e030f528e50832' [22:22:26] Logged the message, Master [22:23:16] argg. did sync-apache break? [22:23:33] nevermind [22:28:05] paravoid: hmm, so eqiad has 33923637.838612 MiB of deleted files and tampa 48102054.905341 MiB for Commons [22:28:19] at least the public originals have similar sizes [22:28:21] Reedy: ignore all that :) it's ok [22:29:39] !jenkins mediawiki-core-code-coverage [22:29:39] https://integration.wikimedia.org/ci/job/ [22:29:43] .. [22:29:46] !jenkins mediawiki-core-code-coverage [22:29:46] https://integration.wikimedia.org/ci/job/mediawiki-core-code-coverage [22:37:28] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 22:37:23 UTC 2013 [22:37:58] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [22:41:22] (03CR) 10Dzahn: [C: 032] "already decom'ed, ack." [operations/dns] - 10https://gerrit.wikimedia.org/r/88954 (owner: 10ArielGlenn) [22:41:32] (03PS1) 10Mattflaschen: End GettingStarted A/B test and serve control [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/90045 [22:42:33] !log DNS update - removing decom'ed search servers1-12 [22:42:46] Logged the message, Master [22:53:31] !log maxsem synchronized php-1.22wmf21/extensions/MobileFrontend/ [22:53:42] Logged the message, Master [22:55:26] !log maxsem synchronized php-1.22wmf20/extensions/MobileFrontend/ [22:55:38] Logged the message, Master [22:55:40] greg-g, I'm done:) [22:55:46] MaxSem: sweet, thanks [22:56:10] alright, RoanKattouw, just you and superm401 now for LD [22:56:19] Awesome [22:57:15] so, coordinate between yourselves, I'm going to walk away from the keyboard for a bit [22:58:44] He's not in this channel but I'll ping him in -dev [22:59:35] yah [23:00:13] * greg-g goes [23:03:25] hashar: what's the dbname for deployment.wikimedia.beta.wmflabs.org ? [23:04:36] So I can add a failing test case for it ;) [23:06:15] Reedy: hits in the labs dblist [23:06:17] can't remember :( [23:06:20] heh [23:06:48] Reedy: labswiki [23:07:02] ohhh [23:07:05] you got tests :-] [23:07:20] There were some originally [23:07:36] I expanded them with all our weird odd cases [23:07:46] I guess there's no way to obviously map that to the other [23:08:21] -'labswiki' [23:08:21] +'deploymentwiki' [23:08:41] Can just be static mapped :D [23:08:42] arff [23:09:24] http://labs.wikimedia.beta.wmflabs.org/ yields domain not configured [23:10:29] Where should that go? [23:11:00] if ( preg_match( '/^([^.]+)\.[^.]+\.beta\.wmflabs\.org$/', $serverName, $matches ) ) { [23:11:00] // http://en.wikipedia.beta.wmflabs.org/ [23:11:00] $lang = $matches[1]; [23:11:00] } elseif ( preg_match( '/^([a-z0-9]*)\.beta\.wmflabs\.org$/', $serverName, $matches ) ) { [23:11:00] // http://wikidata.beta.wmflabs.org/ [23:11:01] $lang = $matches[1]; [23:11:03] } [23:11:04] just drop it [23:11:09] I think [23:11:36] to be honest, I am not sure what should be the landing page between deployment.wikimedia.beta.wmflabs.org and labs.wikimedia.beta.wmflabs.org [23:11:50] maybe we should just drop that wiki once its content is moved to wikitech [23:12:16] wheee [23:12:41] !log catrope synchronized php-1.22wmf21/extensions/VisualEditor/modules/ve-mw/init/targets/ve.init.mw.ViewPageTarget.js 'cherry-pick of https://gerrit.wikimedia.org/r/#/c/89773/' [23:12:56] Logged the message, Master [23:13:04] (03PS1) 10Reedy: Make deployment.wikimedia.beta.wmflabs.org work [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/90059 [23:13:05] Reedy: bug 36412 http://labs.wikimedia.beta.wmflabs.org does not redirect --> closed WONTFIX [23:13:14] yay [23:13:16] quoting myself : We will just keep http://deployment.wikimedia.beta.wmflabs.org/ [23:14:15] our routing sucks :-( [23:14:21] multiversion/MWMultiVersion.php is horrible [23:14:56] Hey, at least it's not doing it on docroot paths now! :p [23:15:32] well I am failing asleep for real now [23:15:50] (03CR) 10Reedy: [C: 032] Make deployment.wikimedia.beta.wmflabs.org work [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/90059 (owner: 10Reedy) [23:15:58] mergeymergemerge [23:15:59] poor zuul [23:18:58] (03Merged) 10jenkins-bot: Make deployment.wikimedia.beta.wmflabs.org work [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/90059 (owner: 10Reedy) [23:19:20] !log maxsem synchronized php-1.22wmf21/extensions/MobileFrontend/ [23:21:09] !log maxsem synchronized php-1.22wmf20/extensions/MobileFrontend/ [23:34:26] (03PS1) 10Reedy: Remove nomcomwiki and vewikimedia from tests [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/90064 [23:35:04] (03CR) 10Reedy: [C: 032] Remove nomcomwiki and vewikimedia from tests [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/90064 (owner: 10Reedy) [23:35:24] (03Merged) 10jenkins-bot: Remove nomcomwiki and vewikimedia from tests [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/90064 (owner: 10Reedy) [23:37:29] RECOVERY - Puppet freshness on williams is OK: puppet ran at Tue Oct 15 23:37:22 UTC 2013 [23:37:59] PROBLEM - Puppet freshness on williams is CRITICAL: No successful Puppet run in the last 10 hours [23:38:30] (03CR) 10Adamw: [C: 031] "I'm not an authority on how this is used, so I must defer testing to someone else. Code looks good!" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/65644 (owner: 10Dereckson) [23:46:00] (03PS11) 10Hashar: Jenkins validation (please ignore) [operations/debs/pybal] - 10https://gerrit.wikimedia.org/r/84932 [23:50:45] (03PS3) 10Reedy: Remove wgSkipSkin and wgSkipSkins [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/85645 (owner: 10TTO) [23:51:19] (03CR) 10Reedy: [C: 032] Remove wgSkipSkin and wgSkipSkins [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/85645 (owner: 10TTO) [23:51:30] (03Merged) 10jenkins-bot: Remove wgSkipSkin and wgSkipSkins [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/85645 (owner: 10TTO) [23:53:35] (03PS2) 10Reedy: Add Portale namespace to wgContentNamespaces for itwikivoyage [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89496 (owner: 10TTO) [23:53:46] (03CR) 10Reedy: [C: 032] Add Portale namespace to wgContentNamespaces for itwikivoyage [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89496 (owner: 10TTO) [23:53:57] (03Merged) 10jenkins-bot: Add Portale namespace to wgContentNamespaces for itwikivoyage [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89496 (owner: 10TTO) [23:54:46] (03PS2) 10Reedy: Remove arbcom_*wiki docroots [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89331 [23:54:58] (03CR) 10Reedy: [C: 032] Remove arbcom_*wiki docroots [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89331 (owner: 10Reedy) [23:55:12] (03Merged) 10jenkins-bot: Remove arbcom_*wiki docroots [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89331 (owner: 10Reedy) [23:55:32] (03PS2) 10Reedy: Set up autopatroller right on eswikivoyage [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89497 (owner: 10TTO) [23:56:03] (03CR) 10Reedy: [C: 032] Set up autopatroller right on eswikivoyage [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89497 (owner: 10TTO) [23:56:12] (03Merged) 10jenkins-bot: Set up autopatroller right on eswikivoyage [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89497 (owner: 10TTO) [23:56:49] (03PS2) 10Reedy: Add throttling exception for Philippines editing workshop [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89642 (owner: 10TTO) [23:57:01] (03CR) 10Reedy: [C: 032] Add throttling exception for Philippines editing workshop [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89642 (owner: 10TTO) [23:57:09] (03Merged) 10jenkins-bot: Add throttling exception for Philippines editing workshop [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89642 (owner: 10TTO) [23:58:45] (03PS5) 10Reedy: Miscellaneous cleanup of InitialiseSettings [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/86415 (owner: 10TTO) [23:59:14] (03CR) 10Reedy: [C: 032] Miscellaneous cleanup of InitialiseSettings [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/86415 (owner: 10TTO) [23:59:22] (03Merged) 10jenkins-bot: Miscellaneous cleanup of InitialiseSettings [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/86415 (owner: 10TTO)