[01:57:33] (03CR) 1020after4: "So we probably need to package things up instead of relying on git to deploy the code because there are 3 repositories that need to be kep" (031 comment) [operations/puppet] - 10https://gerrit.wikimedia.org/r/132505 (owner: 10Dzahn) [02:15:52] !log LocalisationUpdate completed (1.24wmf4) at 2014-05-22 02:14:49+00:00 [02:15:59] Logged the message, Master [02:25:16] (03CR) 10CSteipp: Improve nginx TLS/SSL settings. (031 comment) [operations/puppet] - 10https://gerrit.wikimedia.org/r/132393 (https://bugzilla.wikimedia.org/53259) (owner: 10JanZerebecki) [02:30:58] PROBLEM - Puppet freshness on db1007 is CRITICAL: Last successful Puppet run was Wed 21 May 2014 11:30:29 PM UTC [02:39:21] !log LocalisationUpdate completed (1.24wmf5) at 2014-05-22 02:38:18+00:00 [02:39:25] Logged the message, Master [03:00:38] RECOVERY - Puppet freshness on db1007 is OK: puppet ran at Thu May 22 03:00:28 UTC 2014 [03:38:37] !log LocalisationUpdate ResourceLoader cache refresh completed at Thu May 22 03:37:31 UTC 2014 (duration 37m 30s) [03:38:41] Logged the message, Master [04:55:23] (03CR) 10Chad: [C: 031] "lgtm, merge when we're comfortable." [operations/software/elasticsearch/plugins] - 10https://gerrit.wikimedia.org/r/133771 (owner: 10Manybubbles) [04:56:30] (03CR) 10Chad: [C: 031] "lgtm, merge whenever." [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134585 (owner: 10Manybubbles) [05:00:36] (03CR) 10Chad: "I think we should use trebuchet to deploy the repo(s) rather than relying on puppet to clone them." [operations/puppet] - 10https://gerrit.wikimedia.org/r/132505 (owner: 10Dzahn) [05:11:26] (03PS2) 10MZMcBride: Improve nginx TLS/SSL settings. [operations/puppet] - 10https://gerrit.wikimedia.org/r/132393 (https://bugzilla.wikimedia.org/53259) (owner: 10JanZerebecki) [06:04:41] (03CR) 10Nemo bis: "By the way, you can compare with http://translatewiki.net/ which uses https://git.wikimedia.org/blob/translatewiki.git/HEAD/puppet%2Fmodul" [operations/puppet] - 10https://gerrit.wikimedia.org/r/132393 (https://bugzilla.wikimedia.org/53259) (owner: 10JanZerebecki) [06:18:39] do we have a preferred service supervisor / init system? upstart? systemd? plain old init.d scripts? [06:23:58] PROBLEM - graphite.wikimedia.org on tungsten is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:25:48] RECOVERY - graphite.wikimedia.org on tungsten is OK: HTTP OK: HTTP/1.1 200 OK - 1607 bytes in 0.006 second response time [06:28:05] twentyafterfour: upstart [06:29:16] cool. So when puppetizing something I should include an upstart conf for the service and any related daemons in the puppet module? [06:30:37] yes [06:36:52] <_joe_> good morning [06:37:48] <_joe_> twentyafterfour: it strongly depends on what you are puppetizing [06:38:27] <_joe_> most external softwares should be packaged as debs and have the upstart job in the package [06:54:21] morning [06:56:52] <_joe_> good morning matanya [07:03:53] _joe_: ok that makes sense [07:04:31] <_joe_> twentyafterfour: the answer is always 'it depends on the use case :)' [07:05:08] do we have documentation on recommended/required packaging procedures? [07:09:03] <_joe_> twentyafterfour: can I ask what software/service you are talking about? [07:09:10] * _joe_ is now curious [07:09:40] _joe_: phabricator [07:10:21] thinking about the best way to package and deploy it. Phabricator doesn't follow many standard os conventions and instead reinvents a few wheels [07:10:46] <_joe_> twentyafterfour: let me check it [07:11:50] <_joe_> twentyafterfour: it's basically a LAMP software right? [07:12:17] yeah with a few daemons and dependencies [07:12:47] <_joe_> so for the lamp stack use the usual puppet setup [07:12:52] a few people have already been working on it, I'm just working on making it production ready [07:13:20] https://gerrit.wikimedia.org/r/#/c/132505/5 [07:13:22] <_joe_> I guess you ask chasemp for advice on that [07:13:32] <_joe_> when he's around [07:14:17] yeah I'm trying to finish some of it so he doesn't have to [07:14:35] <_joe_> well, the 'daemons' you referred to [07:14:46] <_joe_> what are they, separate projects? [07:14:57] <_joe_> written in what language? [07:15:02] they are php scripts that run as separate processes [07:15:05] background jobs [07:15:21] <_joe_> ok so install everything without packages [07:15:27] <_joe_> never package php, IMO [07:15:34] really? [07:15:44] <_joe_> php apps, sorry [07:15:57] <_joe_> non php the language or the execution environment [07:16:36] why not package them? deploying from git isn't ideal in this case but packaging is definitely a bit of a pain [07:16:51] <_joe_> I was looking at your comment in the patch [07:17:20] <_joe_> I guess libphutil, arcanist and phabricator do have a coordinated release schedule [07:17:48] PROBLEM - HTTP 5xx req/min on tungsten is CRITICAL: CRITICAL: 6.67% of data exceeded the critical threshold [500.0] [07:18:06] yeah it's ad-hoc but they need to be in sync when one updates to a new revision it often requires the latest revision of another... [07:18:06] <_joe_> uhm gotta look into that sorry [07:18:57] <_joe_> ok we had a BIG spike around 7:15 UTC [07:19:02] <_joe_> checking that... [07:23:54] (03CR) 1020after4: "trebuchet looks like an ok way to go, as long as we can specify dependencies among the packages or just put libphutil and phabricator into" [operations/puppet] - 10https://gerrit.wikimedia.org/r/132505 (owner: 10Dzahn) [07:24:51] <_joe_> twentyafterfour: sorry, looking into an issue [07:25:05] _joe_: no problem [07:25:14] <_joe_> It's already over, but if I don't try to understand it now I'll forget to do it later [07:25:31] indeed, take your time :) [07:30:48] RECOVERY - HTTP 5xx req/min on tungsten is OK: OK: Less than 1.00% data above the threshold [250.0] [07:33:19] <_joe_> ok I'm back [07:33:38] <_joe_> I know not enough about trebuchet to know if dependencies are supported [07:33:56] <_joe_> you could just release the three as a bundle via trebuchet I guess [07:57:53] good morning [08:00:01] <_joe_> ciao hashar [08:00:52] (03CR) 10Alexandros Kosiaris: [C: 032] "@dzahn, niah, matanya is correct. We don't even run the puppet daemon anymore (and for a long time now) and TBH I see no reason to run the" [operations/puppet] - 10https://gerrit.wikimedia.org/r/131333 (owner: 10Matanya) [08:04:44] _joe_: 503's : [08:04:51] If you report this error to the Wikimedia System Administrators, please include the details below. [08:04:51] Request: GET http://he.wikipedia.org/wiki/%D7%A2%D7%9E%D7%95%D7%A1_6, from 10.128.0.108 via cp4016 cp4016 ([10.128.0.116]:3128), Varnish XID 1201529834 [08:04:52] Forwarded for: 62.0.53.15, 10.128.0.108, 10.128.0.108 [08:04:52] Error: 503, Service Unavailable at Thu, 22 May 2014 07:15:24 GMT [08:05:18] and : [08:05:21] If you report this error to the Wikimedia System Administrators, please include the details below. [08:05:22] Request: GET http://he.wikipedia.org/wiki/%D7%92%D7%90%D7%99%D7%94_(%D7%9C%D7%95%D7%95%D7%99%D7%99%D7%9F), from 10.128.0.108 via cp4017 cp4017 ([10.128.0.117]:3128), Varnish XID 16023081 [08:05:22] Forwarded for: 62.0.53.15, 10.128.0.108, 10.128.0.108 [08:05:23] Error: 503, Service Unavailable at Thu, 22 May 2014 07:15:24 GMT [08:06:25] akosiaris you can look as well :) [08:06:57] and more: Request: GET http://he.wikipedia.org/wiki/%D7%A6%27%D7%90%D7%A0%D7%92-%D7%90%D7%94_3, from 10.128.0.109 via cp4009 cp4009 ([10.128.0.109]:3128), Varnish XID 337384636 [08:07:14] <_joe_> matanya: pastebin!!! [08:07:24] oh, right, sorry :/ [08:07:30] <_joe_> matanya: now? [08:07:40] yes. see times [08:08:04] actually for last hour, on and off [08:09:22] <_joe_> matanya: I suspect it's some connectivity issue , lemme take a further look though [08:09:58] _joe_: saw your mail, i'm wonder why i go through ulsfo [08:10:07] it makes no sense to me [08:10:15] <_joe_> matanya: what is your ip? in query [08:13:02] matanya: I must say I have to congratulate you for being able to switch from left to right to right to left writing [08:13:29] I am looking at the pages you pasted and man, my brain is having a hard time [08:13:43] poor you :) i'm so use to it [08:14:36] <_joe_> you all switch alphabets as well [08:14:45] <_joe_> it's pretty impressive for me [08:14:46] anyway, more info on this so we can debug it ? [08:14:57] and gladly, gnome support separate key map for every window, so chat is en only [08:15:01] _joe_: hey you are right! I 'd never thought of that [08:15:39] akosiaris: why i'm i routed to ulsfo ? that is my first question [08:15:55] indeed [08:15:55] no caching in esams ? [08:16:09] nothing to do with that (hopefully) [08:16:16] it should be geodns [08:17:05] right, and in dns israel goes to esams (iirc) [08:17:35] so what does host he.wikipedia.org say for you ? [08:18:10] I think you should be seeing wikipedia-lb.wikimedia.org is an alias for text-lb.esams.wikimedia.org. [08:18:13] do you ? [08:18:20] yes, if do [08:18:24] *i do [08:18:42] but when refreshing a page and looking in firebug: cp1068 miss (0), cp4010 miss (0), cp4010 frontend miss (0) [08:19:35] oh, now it changed again [08:19:42] and you have not messed up with any firefox settings, any weird extensions setting Host: headers and things like that right ? [08:19:45] text-lb.ulsfo.wikimedia.org [08:19:56] i haven't [08:19:56] ah there we are [08:19:58] weird [08:20:25] connected to 198.35.26.96 [08:20:35] which is indeed ulsfo [08:21:06] <_joe_> matanya: what is your dns? [08:21:24] <_joe_> maybe it does something funny [08:21:36] outside of our lan it is google [08:21:44] 8.8.8.8 ? [08:21:54] <_joe_> ok [08:21:58] <_joe_> so that's the reason [08:22:01] <_joe_> lemme check [08:22:06] so it does not honour EDNS client subnet then ? [08:22:17] They had fixed that [08:22:32] <_joe_> akosiaris: it does [08:22:34] yes, 8.8.8.8 [08:22:36] <_joe_> just checked [08:22:46] <_joe_> matanya: dig he.wikipedia.org @8.8.8.8 please [08:22:49] http://dpaste.com/3677CW3/ [08:23:14] vs: http://dpaste.com/049TAF6/ [08:23:27] indeed google's fault [08:23:32] <_joe_> matanya: not so sure [08:23:54] <_joe_> matanya: google points me to esams [08:24:01] see the second paste [08:24:21] <_joe_> yes, I think this is something wrong in geoip data we get [08:24:42] <_joe_> because IL is supposed to be directed to esams [08:25:43] that is relatively easy to verify [08:26:06] matanya: your IP please ? [08:26:12] or at least your subnet [08:26:15] <_joe_> akosiaris: just checked [08:54:53] akosiaris: rights question, when one has sudo rights on a machine it mean he can run stuff like otto@box: sudo command if he has root on the same machine this mean he has root@box, not the right to do any command as sudo. is this right? this is the setup ? [08:56:08] matanya: well if has root he implicitly has the right to use sudo (not that he needs to) but yes you are correct [08:58:02] <_joe_> matanya: dig +trace he.wikipedia.org [08:58:29] <_joe_> do not paste the result here pls! [08:59:36] <_joe_> akosiaris: which california ip got esams as an answer? [09:00:18] the lb answer [09:00:28] 198.35.26.0 [09:00:43] but ns0.wikimedia.org answers text-lb.eqiad for this one [09:01:07] _joe_: https://dpaste.de/JzAr and https://dpaste.de/0v2Q [09:01:15] argh this keeps on changing [09:01:18] this is erratic... [09:11:26] <_joe_> matanya: no I asked you to do [09:11:36] <_joe_> "dig he.wikipedia.org +trace" [09:12:11] https://dpaste.de/Y2w9 [09:15:21] (03PS1) 10Matanya: admins: add otto as analytics root [operations/puppet] - 10https://gerrit.wikimedia.org/r/134793 [09:19:57] (03CR) 10Filippo Giunchedi: [C: 031] remove 'maerlant' from site.pp/dsh/dhcp [operations/puppet] - 10https://gerrit.wikimedia.org/r/134745 (owner: 10Dzahn) [09:20:37] (03PS1) 10Matanya: admins: move otto to class admin rather than sudo_user [operations/puppet] - 10https://gerrit.wikimedia.org/r/134794 [09:23:26] (03CR) 10Matanya: [C: 031] remove 'maerlant' from site.pp/dsh/dhcp [operations/puppet] - 10https://gerrit.wikimedia.org/r/134745 (owner: 10Dzahn) [09:23:50] (03PS2) 10Filippo Giunchedi: update legal terms for dumps.wm.o [operations/puppet] - 10https://gerrit.wikimedia.org/r/134121 [09:24:04] (03CR) 10Filippo Giunchedi: [C: 032] update legal terms for dumps.wm.o [operations/puppet] - 10https://gerrit.wikimedia.org/r/134121 (owner: 10Filippo Giunchedi) [09:24:14] (03CR) 10Filippo Giunchedi: [V: 032] update legal terms for dumps.wm.o [operations/puppet] - 10https://gerrit.wikimedia.org/r/134121 (owner: 10Filippo Giunchedi) [09:25:03] (03PS4) 10Giuseppe Lavagetto: puppet3: fix videoscaler role [operations/puppet] - 10https://gerrit.wikimedia.org/r/134588 [09:36:43] (03CR) 10Giuseppe Lavagetto: [C: 032] puppet3: fix videoscaler role [operations/puppet] - 10https://gerrit.wikimedia.org/r/134588 (owner: 10Giuseppe Lavagetto) [09:40:19] (03PS1) 10Matanya: admins: add manybubbels and elasticsearch group [operations/puppet] - 10https://gerrit.wikimedia.org/r/134796 [09:44:41] (03CR) 10Alexandros Kosiaris: [C: 032] Fix parsoidCacheUri config in labs [operations/puppet] - 10https://gerrit.wikimedia.org/r/134643 (owner: 10Catrope) [09:45:02] (03CR) 10Alexandros Kosiaris: [C: 032] Add hewiki to Parsoid config in labs [operations/puppet] - 10https://gerrit.wikimedia.org/r/134649 (https://bugzilla.wikimedia.org/65490) (owner: 10Catrope) [09:45:04] (03PS1) 10Matanya: admins: add elasticsearch-roots to elasticsearch nodes [operations/puppet] - 10https://gerrit.wikimedia.org/r/134797 [09:46:24] (03PS2) 10Matanya: admins: add manybubbels and elasticsearch group [operations/puppet] - 10https://gerrit.wikimedia.org/r/134796 [09:46:51] (03PS3) 10Alexandros Kosiaris: let tin have a proper IPv6 address [operations/puppet] - 10https://gerrit.wikimedia.org/r/134284 (owner: 10Dzahn) [09:46:59] (03CR) 10Alexandros Kosiaris: [C: 032] let tin have a proper IPv6 address [operations/puppet] - 10https://gerrit.wikimedia.org/r/134284 (owner: 10Dzahn) [09:47:37] (03PS3) 10Alexandros Kosiaris: fix "read_only" setting in rsyncd setups [operations/puppet] - 10https://gerrit.wikimedia.org/r/133647 (owner: 10Dzahn) [09:49:03] phuedx: you are welcome :-) [09:49:11] :) [09:49:39] i was looking in to making the zuul-config change myself [09:49:47] (03CR) 10Alexandros Kosiaris: [C: 032] fix "read_only" setting in rsyncd setups [operations/puppet] - 10https://gerrit.wikimedia.org/r/133647 (owner: 10Dzahn) [09:49:53] phuedx: and welcome around :D I haven't spot you are working for Growth [09:50:02] ah sorry [09:50:03] and was trying to figure out if adding the test and lint sections would work [09:50:07] no no! [09:50:09] (03PS4) 10Alexandros Kosiaris: let tin have a proper IPv6 address [operations/puppet] - 10https://gerrit.wikimedia.org/r/134284 (owner: 10Dzahn) [09:50:10] thanks for doing it! [09:50:15] :D [09:50:18] I though you were US based so did it myself :-/ We can pair together for the next change if you want [09:50:21] (03CR) 10Alexandros Kosiaris: [C: 032 V: 032] let tin have a proper IPv6 address [operations/puppet] - 10https://gerrit.wikimedia.org/r/134284 (owner: 10Dzahn) [09:50:33] that'd be great! thanks! [09:50:42] i'm london based [09:50:48] I wasn't sure whether I could add the test/gate-and-submit pipeline after the template. Seems Zuul supports it quite well nowadays. [09:50:51] \O/ [09:50:57] one more european dev yeahhh [09:51:09] I am in France myself, and attempt to maintain the CI infrastructure [09:51:21] so I am barely involved in mediawiki dev anymore [09:51:54] directly [09:51:56] but you facilitate it daily! [09:52:09] akosiaris: would you please have some bandwidth to +2 two changes for contint in labs please? I need a few iptables rules there to reach NATed instances. [09:52:28] sure [09:52:30] akosiaris: the changes are already deployed on the local puppet master : https://gerrit.wikimedia.org/r/127213 https://gerrit.wikimedia.org/r/129687 [09:52:42] andrew +2ed them yesterday but they needed a rebase hehe [09:52:56] <_joe_> hashar: I can help with that [09:53:06] race with alexandros so! [09:53:54] I was mumbling yesterday how I got too many patch opened. Found out I haven't asked reviews for them and whenever I asked review all of them got merged under a couple hours \O/ [09:54:41] \o/ [09:54:45] <_joe_> hashar: I'm building debian packages, which is just taking time between builds [09:54:51] ahh [09:55:07] I even got a bunch of Jenkins jobs to build debian packages on patch submission [09:55:09] hashar: so including base::firewall means you will got ferm with policy DROP there [09:55:13] is that ok with you ? [09:55:20] akosiaris: let me triple check [09:55:41] Quite probably you have already done it in another class just making sure [09:55:47] akosiaris: yeah that is fine. Ferm is already applied on the targeted hosts [09:55:52] cool [09:55:54] merging then [09:55:54] Chain INPUT (policy DROP) [09:55:59] thanks for the warning [09:56:08] (03CR) 10Alexandros Kosiaris: [C: 032] contint: apply beta natfix on Jenkins slaves [operations/puppet] - 10https://gerrit.wikimedia.org/r/127213 (owner: 10Hashar) [09:56:46] so both merged, anything else ? [09:57:08] the rests of the changes are experimental [09:57:11] before I have to decide whether I want to put my head into ganglia and/or diamond or OpenLDAP [09:57:32] thanks akosiaris ! happy ganglia/diamond digging hehe [09:57:49] thanks for giving me the answer to my dilema :-) [09:58:08] hmm. now i hit esams, did you change anything _joe_ ? [09:58:18] matanya: no we did not [09:58:24] <_joe_> matanya: your dns cache expired [09:58:25] <_joe_> :) [09:58:41] it seems it was some erratic behaviour on google's part [09:58:46] or maybe not [09:58:56] the lesson is we need some better DNS monitoring [09:59:35] (03PS8) 10Hashar: contint: split Zuul server and merger (DO NOT SUBMIT) [operations/puppet] - 10https://gerrit.wikimedia.org/r/129292 [09:59:45] matanya: please ping us if you suddenly switch to another DC again [09:59:52] (03CR) 10Hashar: [C: 04-1] "rebased. Still a work in progress." [operations/puppet] - 10https://gerrit.wikimedia.org/r/129292 (owner: 10Hashar) [10:00:02] sure akosiaris [10:01:56] err: Could not retrieve catalog from remote server: Error 400 on SERVER: Duplicate definition: Package[puppet-lint] is already defined in file /etc/puppet/manifests/role/labs.pp at line 33; cannot redefine at /etc/puppet/modules/contint/manifests/packages/labs.pp:66 on node i-000001bd.eqiad.wmflabs [10:01:58] pfff [10:02:02] i hate puppet [10:02:25] don't, at least you have a clear error message :) [10:02:58] <_joe_> hashar: it's a pity my compiler does not work on labs [10:03:00] he [10:03:11] _joe_: is london configured to hit eqiad or esams ? [10:03:25] so wouldn't you love it if you could do include 'packagename' ? [10:03:36] and it would work the same way as classes ? [10:03:45] I know I would be thrilled... [10:03:47] sounds like a dream to me [10:04:40] (03PS1) 10Hashar: contint: puppet-lint is now available in labs [operations/puppet] - 10https://gerrit.wikimedia.org/r/134799 [10:05:04] _joe_: if you can https://gerrit.wikimedia.org/r/134799 in :-]  That drop my dupe puppet-lint package which is now installed on all labs instances [10:05:25] maybe I should add a require Package['puppet-lint'] or something [10:05:40] <_joe_> akosiaris: we can try to do something like a define that does not declare the package if it's already declared [10:05:49] <_joe_> it will slow puppet down a lot, though [10:06:02] there is such a define in puppet stdlib [10:06:35] and how will you name the define ? [10:07:01] let's say it is define lala::package () { blah blah} [10:07:02] akosiaris _joe_ now london is hitting eqiad instead of esams [10:07:31] <_joe_> matanya: how did you determine that? [10:07:46] from another box i have [10:08:09] _joe_ : ensure_resource( 'package', 'puppet-lint', {'ensure' => 'present' }) :-D [10:08:10] <_joe_> matanya: ip please [10:08:11] https://forge.puppetlabs.com/puppetlabs/stdlib [10:08:23] <_joe_> hashar: ugly [10:08:26] yeah [10:08:43] I am not sure how to declare the dependency though [10:09:11] (03CR) 10Hashar: [C: 04-1] "Ideally I would want to declare the dependency on puppet-lint to make sure it is around." [operations/puppet] - 10https://gerrit.wikimedia.org/r/134799 (owner: 10Hashar) [10:10:36] <_joe_> hashar: a Require [10:10:50] any clue what would be the exact syntax? [10:10:56] <_joe_> hashar: require => Package['puppet-lint'] [10:11:41] _joe_: ah but I would need to have it inside the class definition ( contint::packages::labs ) [10:11:50] _joe_: aka have contint::packages::labs to require the package [10:12:24] maybe i can just: require Package['puppet-lint'] [10:12:45] <_joe_> hashar: what needs this package? [10:12:56] modules/contint/manifests/packages/labs.pp [10:12:59] for contint purposes [10:13:10] <_joe_> no I mean [10:13:10] that class is only installed on labs and the package is nowadays provided on all instances [10:13:15] <_joe_> where do you use it? [10:13:23] <_joe_> in the contint module [10:13:52] it is included in role classes [10:14:16] (03CR) 10Nikerabbit: standardize a few things in admins.pp for conversion (031 comment) [operations/puppet] - 10https://gerrit.wikimedia.org/r/134394 (owner: 10Rush) [10:14:35] <_joe_> no, I mean, is there some part of the contint module that uses puppet-lint? [10:14:51] <_joe_> or some script that will not run without it? [10:15:14] yeah Jenkins will have some jobs failling [10:15:34] the contint::packages::labs is to be installed on Jenkins nodes in labs and ensure we have everything required by the Jenkins jobs [10:15:51] maybe I can Class['contint::packages::labs'] => Package['puppet-lint'] [10:18:27] <_joe_> hashar: let me take a look at the contint class [10:19:28] maybe https://dpaste.de/0Akt [10:19:37] <_joe_> hashar: you just include contint::packages::labs or a parent class? [10:20:04] I just include contint::packages::labs [10:20:09] <_joe_> oh ok [10:20:25] that is done in manifests/role/ci.pp role::ci::slave::labs [10:20:48] <_joe_> you know, I'm not sure what you wrote will work. It should, not sure it will [10:21:18] <_joe_> I have some kind of memory of these things failing with a Class declaration [10:21:26] <_joe_> let me build a test case though [10:22:15] I have a puppet master, trying it :) [10:22:43] <_joe_> hashar: you will not see if that does not do what advertised, though :) [10:24:11] notice: /Stage[main]/Role::Labs::Instance/Package[puppet-lint]/ensure: ensure changed 'purged' to 'present' [10:24:11] \O/ [10:24:17] (03PS4) 10Matanya: dns recurses: add firewall [operations/puppet] - 10https://gerrit.wikimedia.org/r/133515 [10:24:27] (03PS2) 10Hashar: contint: puppet-lint is now available in labs [operations/puppet] - 10https://gerrit.wikimedia.org/r/134799 [10:24:32] <_joe_> hashar: and that is the labs class doing that [10:24:51] (03CR) 10Hashar: "Apparently declaring the dependency in the class works fine:" [operations/puppet] - 10https://gerrit.wikimedia.org/r/134799 (owner: 10Hashar) [10:25:05] _joe_: yup wich is fine [10:26:10] if I remove the package definition from the labs class. I get a dependency error which is nice [10:26:17] SERVER: Could not find resource 'Package[puppet-lint]' for relationship on 'Class[Contint::Packages::Labs]' [10:26:46] (03CR) 10Hashar: [V: 032] "If the puppet-lint package is not defined somewhere else, puppet complains with:" [operations/puppet] - 10https://gerrit.wikimedia.org/r/134799 (owner: 10Hashar) [10:27:37] sorry I am stealing a bunch of your brain cycles :( [10:28:25] <_joe_> hashar: yes, the only thing that is tricky is, that dependency will NOT ensure package puppet-lint is installed before class Contint::Packages::Labs is applied [10:28:40] <_joe_> which is definitely NOT what you would expect [10:29:14] <_joe_> with classes included with 'include', the -> works differently that it would do with defines [10:29:24] <_joe_> or with explicitly declared classes [10:29:26] <_joe_> FUN! [10:30:54] ohh [10:31:27] <_joe_> https://dpaste.de/mPE9 for reference [10:33:47] <_joe_> from the puppet manual: '-> (ordering arrow) Causes the resource on the left to be applied before the resource on the right' [10:34:19] so that will ensure puppet-lint is realized first isn't it ? [10:34:38] the ordering is not much of an issue anyway. I just need to make sure puppet-lint is present [10:34:41] <_joe_> it should. It will not! [10:36:41] <_joe_> it will do what you need though :) [10:37:32] \O/ [10:37:40] so I guess you can land https://gerrit.wikimedia.org/r/#/c/134799/ in :) [10:37:45] <_joe_> btw, when you start writing python in the middle of a puppet manifest, your brain is really trying to tell you something [10:38:03] ahah [10:39:40] <_joe_> oh no, I will have to give your change -1 I guess [10:39:54] <_joe_> -1 "puppet is dumb" [10:40:24] <_joe_> the dependency gets declared only when the class and/or define is applied, it seems [10:41:07] <_joe_> well, since your needs are met, who cares :) [10:41:29] yeah I guess that is good enough [10:41:47] it is not like that part of the code is processing millions of dollars :] [10:43:11] (03CR) 10Giuseppe Lavagetto: [C: 032] contint: puppet-lint is now available in labs [operations/puppet] - 10https://gerrit.wikimedia.org/r/134799 (owner: 10Hashar) [10:43:49] <_joe_> hashar: done [10:44:42] I gotta learn italian now [10:44:42] <_joe_> (btw, I hate having my RL name printed in bold here everytime I do something on gerrit, it kinda feels weird to see my RL name in IRC) [10:44:47] to figure out a way to say thanks [10:45:05] <_joe_> hashar: grazie == mercy [10:45:06] that is probably your labs account name [10:45:08] <_joe_> ehm [10:45:12] <_joe_> s/y/i/ [10:45:19] <_joe_> hashar: it is [10:45:19] mine is Hashar for whatever reason [10:45:23] instead of my real name :-( [10:45:38] <_joe_> hashar: not a real issue, it's just weird :P [10:46:04] <_joe_> well, it feels weird. [10:47:31] we [10:47:41] we can most probably have Gerrit to use the real name field in LDAP for display purposes [10:59:15] (03CR) 10Nemo bis: Gather all soft-disabled uploads wikis in one config item (031 comment) [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134400 (owner: 10Nemo bis) [10:59:37] seems like paravoid is signing a lot of pgp keys. He is near the center of the Greek web of trust http://www.void.gr/kargig/blog/wp-content/GreekWoT_2012_02_network_v2.png D [10:59:51] (purple disc in the middle, slightly on the left of a huge red disk) [10:59:55] disc [10:59:56] ahahaha [11:00:17] (03PS5) 10Nemo bis: Gather all soft-disabled uploads wikis in one config item [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134400 [11:04:12] the whole blog for our Greek ops http://www.void.gr/kargig/blog/2013/07/08/greek-pgp-web-of-trust-2012-edition/ :) [11:06:52] Anyone around who can look into email logs for password reset email status? [11:07:17] There's yet another user quite annoyed that they are locked out of their account [11:07:30] But supposedly-sent password reset emails do not arrive in their inbox [11:09:39] Krenair: I can try. pm me with details [11:09:55] email is probably wrong / not expected :-( [11:10:06] user.user_email is a good source :) [11:10:19] ? [11:10:24] hashar, actually I copied their address they contacted us from into the password reset form myself [11:10:35] But they reported no email arriving [11:13:15] (03CR) 10Gilles: Introduce finer-grained Media Viewer EventLogging sampling [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134343 (owner: 10Gilles) [11:46:30] The password reset form often lies, to avoid leaks [11:49:20] Are these cases documented? [11:57:56] I don't think so, it just says that an email was sent unconditionally, even when no email was sent at all or the email didn't exist etc. IIRC. [12:01:06] Easily tested [12:17:58] PROBLEM - mysqld processes on db1009 is CRITICAL: PROCS CRITICAL: 0 processes with command name mysqld [12:18:23] Reedy, perhaps you can check to see whether a given email address is confirmed against an account? [12:19:25] <_joe_> Nemo_bis: there is no easy way to monitor the delivery of an email message from an application perspective. So even if the local MTA accepted the message, you have no guarantee of the correctness of the email address and/or of delivery [12:40:22] (03PS1) 10Gilles: Launch Media Viewer for all users on Italian wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134808 [12:40:24] (03PS1) 10Gilles: Launch Media Viewer for all users on Russian wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134809 [12:40:26] (03PS1) 10Gilles: Launch Media Viewer for all users on all Wikisources [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134810 [12:40:28] (03PS1) 10Gilles: Launch Media Viewer for all users on German wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134811 [12:40:30] (03PS1) 10Gilles: Launch Media Viewer for all users on English wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134812 [12:41:18] Those are literally atomic commits :D [12:42:45] (03CR) 10Gilles: [C: 04-2] "Scheduled May 29th" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134810 (owner: 10Gilles) [12:43:02] (03CR) 10Gilles: [C: 04-2] "Scheduled June 3rd" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134811 (owner: 10Gilles) [12:43:09] (03CR) 10Gilles: [C: 04-2] "Scheduled June 3rd" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134812 (owner: 10Gilles) [12:45:23] (03Abandoned) 10Gilles: FUTURE: Eighth batch of pilot sites for Media Viewer [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/129828 (owner: 10MarkTraceur) [13:11:10] (03CR) 10Ottomata: [C: 032] "Shall I merge?" [operations/puppet] - 10https://gerrit.wikimedia.org/r/134608 (owner: 10Hashar) [13:12:57] ottomata: I was wondering whether that is the good way to use rsync :) [13:13:17] (03CR) 10Hashar: "Merely wondering whether that is how to setup a rsync server :)" [operations/puppet] - 10https://gerrit.wikimedia.org/r/134608 (owner: 10Hashar) [13:13:28] I think that's fine, I use it like that for a few things [13:21:28] ottomata: I still have to confirm a few things though. Such as whether connections between labs projects are firewalls [13:21:30] firewalled [13:24:35] aye, you might have to set up a public proxy port to your hosts and use the public hostname [13:25:51] (03PS1) 10Alexandros Kosiaris: WIP: postgresql ganglia plugins [operations/puppet] - 10https://gerrit.wikimedia.org/r/134817 [13:27:29] (03CR) 10jenkins-bot: [V: 04-1] WIP: postgresql ganglia plugins [operations/puppet] - 10https://gerrit.wikimedia.org/r/134817 (owner: 10Alexandros Kosiaris) [13:28:01] ottomata: ho it is all for internal use :) [13:28:19] oh, just in labs? [13:29:02] hashar: May 6 [13:29:02] 15:43 hashar: rebooting aggregator, it is stalled somehow: There was an error collecting ganglia data (127.0.0.1:8654): fsockopen error: Connection refused [13:29:05] ottomata: I would like to remove the hardcoded hosts_allow line [13:29:10] had that fixed it ? [13:29:14] akosiaris: yup [13:29:19] cool [13:29:24] akosiaris: gotta rebuild that to a stronger instance though [13:29:25] I am doing it again then :-) [13:29:38] the gmond collectors are eating all the cpu iirc [13:30:11] aaaa nice... [13:30:22] hashar, if you remove the hosts_allow line, then anyone who can access the port will be able to sync files to that rsync module [13:30:38] well it should be puppetized so it is just a question of rsync the rrds [13:30:49] ottomata: yeah and I am not sure what is able to access it [13:30:58] PROBLEM - Puppet freshness on db1007 is CRITICAL: Last successful Puppet run was Thu 22 May 2014 10:30:34 AM UTC [13:31:16] akosiaris: and have to update the collector IP address. I would also like to split the web frontend and aggregator to different instances. But not sure how to achieve that. [13:31:47] (03PS6) 10Hashar: contint: rsync host in labs [operations/puppet] - 10https://gerrit.wikimedia.org/r/134608 [13:31:55] aren't they already split up ? I see ganglia-web and aggregator [13:32:12] and aggregator1 and aggregator02 but that is a different question [13:32:36] (03CR) 10Hashar: "rebased" [operations/puppet] - 10https://gerrit.wikimedia.org/r/134608 (owner: 10Hashar) [13:32:57] akosiaris: there is only one instance really in use i think [13:33:05] sigh [13:33:47] if only we could resize a nova instance to give it more CPU ... but that doesn't work (andrew tried) [13:34:42] ganeti ftw :P [13:35:04] (03PS2) 10Alexandros Kosiaris: WIP: postgresql ganglia plugins [operations/puppet] - 10https://gerrit.wikimedia.org/r/134817 [13:35:26] no clue what ganeti is :) [13:36:31] (03CR) 10jenkins-bot: [V: 04-1] WIP: postgresql ganglia plugins [operations/puppet] - 10https://gerrit.wikimedia.org/r/134817 (owner: 10Alexandros Kosiaris) [13:36:42] Oh I am gonna kill jenkins .... [13:37:31] it is complaining about a file I did not even touch! [13:37:37] fun [13:37:52] https://integration.wikimedia.org/ci/job/operations-puppet-pep8/4116/violations/file/modules/deployment/files/modules/deploy.py/ [13:37:55] seems modules/postgresql/files/ganglia/postgresql.py does not pass pep8 though [13:38:01] honestly... I did not even touch that file [13:38:12] yes it does not but I excluded it for now [13:38:30] plus it is upstream's plugin, not mine [13:39:10] anyway I will test it in labs anyway... [13:39:42] will have a look at it [13:39:56] * hashar blames folks for putting code in puppet [13:40:08] ahahah [13:40:28] be thankful we don't deploy code with puppet [13:40:32] cause I have tried that [13:40:38] and it fails so so so so miserably [13:40:54] which is of course to be expected. But you know. hammer and nails [13:42:15] (03PS3) 10Alexandros Kosiaris: WIP: postgresql ganglia plugins [operations/puppet] - 10https://gerrit.wikimedia.org/r/134817 [13:43:41] (03CR) 10jenkins-bot: [V: 04-1] WIP: postgresql ganglia plugins [operations/puppet] - 10https://gerrit.wikimedia.org/r/134817 (owner: 10Alexandros Kosiaris) [13:48:38] (03PS7) 10Hashar: contint: rsync host in labs [operations/puppet] - 10https://gerrit.wikimedia.org/r/134608 [13:49:15] (03CR) 10Hashar: [V: 032] "I removed the hardcoded hosts_allow list entirely. Only labs instance part of the integration project will be able to rsync." [operations/puppet] - 10https://gerrit.wikimedia.org/r/134608 (owner: 10Hashar) [13:49:36] ottomata: https://gerrit.wikimedia.org/r/#/c/134608/ (rsync for contint) can land in :-) Others labs project can't talk with it [13:49:51] no the pep8 issue [13:50:15] mmmk [13:50:41] (03CR) 10Ottomata: [C: 032 V: 032] contint: rsync host in labs [operations/puppet] - 10https://gerrit.wikimedia.org/r/134608 (owner: 10Hashar) [13:50:59] done. [13:52:21] akosiaris: so https://gerrit.wikimedia.org/r/#/c/126846 did fail pep8 but got force merged [13:53:12] shame on ottomata :P [13:53:13] ah pfff [13:53:34] not that I wouldn't do the same [13:53:43] that E128 pep8 issue is very very useless [13:54:19] way too strict about over/under indentation [13:54:51] we can ignore it somehow [13:54:53] (03PS1) 10Ottomata: Fixing lines in deploy.py to make pep8 happy [operations/puppet] - 10https://gerrit.wikimedia.org/r/134821 [13:57:13] (03PS1) 10Hashar: pep8: ignore E128 error [operations/puppet] - 10https://gerrit.wikimedia.org/r/134822 [13:58:11] bah that makes things even worth [13:58:42] (03CR) 10jenkins-bot: [V: 04-1] pep8: ignore E128 error [operations/puppet] - 10https://gerrit.wikimedia.org/r/134822 (owner: 10Hashar) [13:59:00] (03Abandoned) 10Hashar: pep8: ignore E128 error [operations/puppet] - 10https://gerrit.wikimedia.org/r/134822 (owner: 10Hashar) [14:00:12] (03CR) 10Hashar: [C: 031] Fixing lines in deploy.py to make pep8 happy [operations/puppet] - 10https://gerrit.wikimedia.org/r/134821 (owner: 10Ottomata) [14:02:00] (03CR) 10Ottomata: [C: 032 V: 032] Fixing lines in deploy.py to make pep8 happy [operations/puppet] - 10https://gerrit.wikimedia.org/r/134821 (owner: 10Ottomata) [14:08:26] (03CR) 10Rush: "Ok so, for now, I don't want to use trebuchet and based on the conversations I have had with other operations folks that is a shared opini" [operations/puppet] - 10https://gerrit.wikimedia.org/r/132505 (owner: 10Dzahn) [14:08:52] (03PS1) 10Giuseppe Lavagetto: puppet: pin packages to the correct version [operations/puppet] - 10https://gerrit.wikimedia.org/r/134823 [14:19:44] (03CR) 10Ryan Lane: [C: 04-1] Improve nginx TLS/SSL settings. (031 comment) [operations/puppet] - 10https://gerrit.wikimedia.org/r/132393 (https://bugzilla.wikimedia.org/53259) (owner: 10JanZerebecki) [14:26:53] ottomata: you set up the client/server aspect of role::puppet::self, right? Can you help me debug a setup? [14:27:11] sure! [14:27:34] So… in theory I should just set role::puppet::self on the server and role::puppet::self + the name of the server labs instance on the client, yes? [14:28:23] I'm going to add you to this project so you can log in… what's your wikitech username? [14:28:32] Reedy: bug 64885 -- is that something we can run before the (re)merge? [14:29:24] yes, in theory [14:29:32] if you want to do a self hosted master with clients [14:29:36] then you configure them all the same [14:29:39] with the puppetmaster's hostname [14:30:03] the master will notice that the configured puppetmaster hostname matches $::hostname (or $::fqdn, actually), and configure itself according [14:30:05] accordingly [14:30:32] oh my wikitech username [14:30:32] is ottomata [14:30:39] wait, do you mean I configure the master with its own name? [14:31:10] !log pushing new swift rings [14:31:15] Logged the message, Master [14:31:15] (03CR) 10Odder: [C: 04-1] "See comment." (031 comment) [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134400 (owner: 10Nemo bis) [14:31:23] yes [14:31:32] ottomata: master is 'puppet-localmaster' and client is 'trusty-puppet-client' [14:31:37] andrewbogott: all nodes within a puppet uhhh 'cluster' [14:31:42] shoudl have puppetmaster set to the same fqdn [14:32:13] ottomata: but it looks to me like the role class decides if I am a master or a client based on the presence or absence of the 'puppetmaster' setting... [14:32:23] andrewbogott: https://wikitech.wikimedia.org/wiki/Help:Self-hosted_puppetmaster#Set_up_a_multi-instance_self_hosted_puppetmaster [14:32:29] lemme check [14:33:02] andrewbogott: https://github.com/wikimedia/operations-puppet/blob/production/manifests/role/puppet.pp#L56 [14:33:31] # If localhost or if $server matches this node's [14:33:31] # $fqdn, then this is a puppetmaster. [14:33:31] if ($server == 'localhost' or $server == $::fqdn) { [14:33:31] class { 'puppet::self::master': [14:33:34] oh, you're right. [14:33:39] OK, I'll try this again :) [14:33:41] k [14:35:53] (03CR) 10Anomie: "I think we'll have to do this in two stages: stage 1 would add the new variables but not remove the definition of wgMediaViewerSamplingFac" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134343 (owner: 10Gilles) [14:36:29] Reedy: Would you mind double-checking my comment on https://gerrit.wikimedia.org/r/#/c/134343/ ? [14:50:54] gi11es: SWAT in 10 minutes. I'm not sure about 134343 as-is, or about 134804 without (the first stage of) 134343. [14:52:05] anomie: it's fine if we don't log events between the two deploy windows [14:52:17] when the config variables are absent, logging is just turned off [14:52:29] that's why I didn't make a backport for 1.24wmf4 [14:52:38] gi11es: Well, if you are fine with losing logging events then we can go for it. [14:53:10] I am, our graphs have been b0rked since last weekend: http://multimedia-metrics.wmflabs.org/dashboards/mmv [14:53:10] gi11es: (please reply to 134343 with that, just for the record) [14:53:13] will do [14:53:30] * anomie likes permanent record of stuff like this [14:54:52] (03CR) 10Gilles: "It's fine if we lose EventLogging tracking for a couple of hours between the deploy windows. That's why I didn't write a backport specific" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134343 (owner: 10Gilles) [14:55:09] Hi tgr. gi11es is the point person, but SWAT of some patches you were involved with is starting in about 5 minutes in case you wanted to follow along. [14:56:05] which ones? you mean the fix of the one tgr deployed yesterday? [14:57:25] ottomata: looks good now… apparently I just needed to rtfm [14:57:28] thanks [14:57:31] gi11es: He also merged 134064 (source of 134804) and reviewed 134343 [14:57:41] (03CR) 10Alexandros Kosiaris: [C: 032] Move torrus to netmon1001 [operations/dns] - 10https://gerrit.wikimedia.org/r/131499 (owner: 10Alexandros Kosiaris) [14:58:04] right [14:58:30] I would have liked the config change to get a clear +1 but it's tied to 134064, so I assume that tgr is fine with it [14:58:30] Always good to have more eyes if people are looking anyway (: [14:59:58] RECOVERY - Puppet freshness on db1007 is OK: puppet ran at Thu May 22 14:59:56 UTC 2014 [15:00:45] jouncebot: next [15:00:45] In 2 hour(s) and 59 minute(s): MediaWiki train (https://wikitech.wikimedia.org/wiki/Deployments#deploycal-item-20140522T1800) [15:00:56] choo choo [15:01:27] andrewbogott: yw! [15:01:41] jouncebot: last [15:01:46] * anomie wonders what jouncebot is and why it did a /notice rather than a normal ping [15:02:02] gi11es: Anyway, starting the SWAT with the quint fix [15:02:06] anomie: in fact for verification purposes it might be better to get the config change through first, that would allow me to check if things are working on beta, since the code consuming those config variables is already live (134064) [15:02:18] qlright, qunit is a no-brainer [15:02:30] gi11es: That was my plan, actually (: [15:05:07] anomie: jouncebot is mwalker's new bot that scrapes [[:wikitech::Deployments]] [15:05:30] fancy [15:05:51] Source at https://github.com/mattofak/jouncebot [15:07:17] mwalker|away: (when you get unaway) Why a /notice instead of a normal ping? It shows up in an annoying way in my IRC client. [15:10:46] chasemp: hi, are you around ? [15:13:17] !log anomie synchronized php-1.24wmf5/extensions/MultimediaViewer/tests/qunit/mmv/ui/ 'SWAT: Fix qunit tests for MultimediaViewer [[gerrit:134807]]' [15:13:21] gi11es: ^ Make sure that didn't somehow break anything, please [15:13:23] Logged the message, Master [15:14:17] anomie: MediaViewer still works fine. I don't know any production wiki where I can run qunit, though [15:14:31] gi11es: Probably isn't one. I think Jenkins runs it though [15:14:42] yeah, I'd trust jenkins on that one [15:15:01] it was mostly about making sure that the other bigger changeset passed jenkins [15:15:04] gi11es: Config change next. We'll merge, then you can test in Beta quick. [15:15:07] (03CR) 10Anomie: [C: 032] "Ok then. We'll do this first then the backport, for last-minute testing in Beta Labs." [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134343 (owner: 10Gilles) [15:15:47] (03Merged) 10jenkins-bot: Introduce finer-grained Media Viewer EventLogging sampling [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134343 (owner: 10Gilles) [15:16:30] gi11es: ^ ok, test in Beta please [15:16:52] (03CR) 10Andrew Bogott: "I've tested this for changing the client version and it works great. Still testing the self-hosted master case. In the meantime, though," (031 comment) [operations/puppet] - 10https://gerrit.wikimedia.org/r/134823 (owner: 10Giuseppe Lavagetto) [15:17:05] (03PS2) 10Andrew Bogott: puppet: pin packages to the correct version [operations/puppet] - 10https://gerrit.wikimedia.org/r/134823 (owner: 10Giuseppe Lavagetto) [15:18:17] anomie: works, I see the events that weren't there earlier [15:18:47] gi11es: Ok. I'll deploy the config change to production now. Then we'll do the patch for the new logging. [15:18:58] sounds good [15:19:40] (03PS2) 10BBlack: fix daemonization stdio stuff [operations/debs/pybal] - 10https://gerrit.wikimedia.org/r/134651 [15:19:42] (03PS1) 10BBlack: fix for faulty BGP session collisions [operations/debs/pybal] - 10https://gerrit.wikimedia.org/r/134833 [15:21:19] !log anomie synchronized wmf-config/InitialiseSettings.php 'SWAT: Disable old MultimediaViewer logging and pre-enable new logging [[gerrit:134343]]' [15:21:23] Logged the message, Master [15:22:35] !log anomie synchronized wmf-config/CommonSettings.php 'SWAT: Disable old MultimediaViewer logging and pre-enable new logging [[gerrit:134343]]' [15:22:38] gi11es: ^ Ok, let's make sure we didn't break production with that [15:22:39] Logged the message, Master [15:23:09] (03PS2) 10BBlack: fix for faulty BGP session collisions [operations/debs/pybal] - 10https://gerrit.wikimedia.org/r/134833 [15:24:10] anomie: mediaviewer still works fine. predictably, the events aren't hitting eventlogging [15:24:57] gi11es: Doing the last patch then [15:26:29] (03CR) 10Andrew Bogott: "well... my fix doesn't work. Something like that is still needed though." [operations/puppet] - 10https://gerrit.wikimedia.org/r/134823 (owner: 10Giuseppe Lavagetto) [15:27:11] (03PS3) 10BBlack: fix for faulty BGP session collisions [operations/debs/pybal] - 10https://gerrit.wikimedia.org/r/134833 (https://bugzilla.wikimedia.org/7535) [15:28:26] (03PS4) 10BBlack: fix for faulty BGP session collisions [operations/debs/pybal] - 10https://gerrit.wikimedia.org/r/134833 [15:31:20] !log anomie synchronized php-1.24wmf5/extensions/MultimediaViewer/ 'SWAT: Deploy new MultimediaViewer logging to wmf5 wikis [[gerrit:134804]]' [15:31:24] Logged the message, Master [15:31:26] gi11es: ^ Test please [15:32:44] anomie: all good. MediaViewer works fine and the events are making it all the way to the EventLogging DB [15:32:44] good! [15:32:44] * anomie is done with SWAT [15:32:52] anomie: thanks! [15:34:28] anomie: I've just spotted a typo in the config change, by looking at the EventLogging DB. It just means that the value for some events is too high. Any chance to squeeze the fix in the remaining window? [15:34:43] it's not a big deal, I can fix it tomorrow if you're our of time [15:34:46] *out [15:35:09] gi11es: There's 25 minutes left in the window, if you can get it done and merged to master in time. [15:35:21] anomie: will take 5 minutes tops [15:37:02] (03PS1) 10Dzahn: add wikimedia.community, link to wikimedia.com [operations/dns] - 10https://gerrit.wikimedia.org/r/134836 [15:37:38] Reedy, are you able to check something for me? [15:37:51] (03CR) 10Faidon Liambotis: "No objection on this but a more general comment: these daemon/pid routines could be replaced by python-daemon that would properly handle " [operations/debs/pybal] - 10https://gerrit.wikimedia.org/r/134651 (owner: 10BBlack) [15:38:38] (03PS1) 10Gilles: Fix EventLogging event name in Media Viewer config [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134837 [15:38:44] anomie: https://gerrit.wikimedia.org/r/134837 [15:39:27] (03CR) 10Anomie: [C: 032] Fix EventLogging event name in Media Viewer config [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134837 (owner: 10Gilles) [15:39:35] (03Merged) 10jenkins-bot: Fix EventLogging event name in Media Viewer config [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134837 (owner: 10Gilles) [15:40:44] gi11es: Waiting on Jenkins to deploy it to Beta, then I'll ask you to test it [15:40:58] ok [15:45:27] matanya: yes [15:46:54] (03PS4) 10Alexandros Kosiaris: postgresql ganglia plugins [operations/puppet] - 10https://gerrit.wikimedia.org/r/134817 [15:47:39] (03PS3) 10Andrew Bogott: puppet: pin packages to the correct version [operations/puppet] - 10https://gerrit.wikimedia.org/r/134823 (owner: 10Giuseppe Lavagetto) [15:49:15] gi11es: Ok, test please [15:49:37] (03CR) 10Nemo bis: "Odder, as stated in the comment at the beginning of groupOverrides, and as you can see in CommonSettings.php around line 1024, groupOverri" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134400 (owner: 10Nemo bis) [15:50:05] anomie: beta still works fine [15:52:07] !log anomie synchronized wmf-config/InitialiseSettings.php 'SWAT: Fix typo in MultimediaViewer logging config [[gerrit:134837]]' [15:52:14] Logged the message, Master [15:52:16] gi11es: ^ Test please [15:53:14] (03CR) 10Alexandros Kosiaris: [C: 032] postgresql ganglia plugins [operations/puppet] - 10https://gerrit.wikimedia.org/r/134817 (owner: 10Alexandros Kosiaris) [15:55:39] anomie: all good [15:56:32] anomie: thanks again! [16:01:50] Krenair: What's that? [16:03:02] Reedy, about the password reset emails thing earlier, was just wondering if you could confirm what a user claims about their username being linked to the right address [16:03:05] (03CR) 10Anomie: "This patch seems sane, although I don't know whether the absolute path to be used here will actually be correct or not." [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134641 (https://bugzilla.wikimedia.org/64748) (owner: 10BryanDavis) [16:03:51] (03CR) 10Faidon Liambotis: "Warning: merging this won't automatically decom the servers." [operations/puppet] - 10https://gerrit.wikimedia.org/r/133886 (owner: 10MaxSem) [16:03:51] username/wiki? [16:05:15] (03PS1) 10Alexandros Kosiaris: postgresql.pyconf is a template not a file [operations/puppet] - 10https://gerrit.wikimedia.org/r/134841 [16:06:54] (03CR) 10Alexandros Kosiaris: [C: 032] postgresql.pyconf is a template not a file [operations/puppet] - 10https://gerrit.wikimedia.org/r/134841 (owner: 10Alexandros Kosiaris) [16:07:17] Reedy, PM [16:14:49] (03CR) 10Dzahn: "not sure about this one yet. otto already is a (global) root. this might still be a good idea but we need to talk about it and include ott" [operations/puppet] - 10https://gerrit.wikimedia.org/r/134793 (owner: 10Matanya) [16:15:44] _joe_: well, I can't make the puppetmaster case of your patch apply, but I suspect this is just because of the weird way I"m trying to bootstrap it (without the patch actually being merged) [16:17:33] <_joe_> no wait [16:17:58] <_joe_> there is something funny with the change [16:18:12] (03CR) 10Dzahn: [C: 04-1] "2 inline comments." (032 comments) [operations/puppet] - 10https://gerrit.wikimedia.org/r/134796 (owner: 10Matanya) [16:18:52] _joe_: When we're referring to the node-scoped $puppet_version should it be $::puppet_version? [16:18:59] <_joe_> andrewbogott: do you see manifests/site.pp in the list of changes? [16:19:06] <_joe_> andrewbogott: no, one second [16:19:31] My new patchset removes the change to site.pp. [16:20:05] (03CR) 10Dzahn: [C: 04-1] admins: add elasticsearch-roots to elasticsearch nodes (032 comments) [operations/puppet] - 10https://gerrit.wikimedia.org/r/134797 (owner: 10Matanya) [16:20:07] <_joe_> why> [16:20:16] <_joe_> andrewbogott: it NEEDS to be there [16:20:25] I tried to explain in gerrit comments... [16:20:29] But I can explain again :) [16:20:59] It's because defining it in site.pp causes a conflict if I define it anyplace else (specifically in the ldap node definition) [16:21:22] <_joe_> andrewbogott: why so? [16:21:29] <_joe_> andrewbogott: it's the same as $cluster [16:21:40] <_joe_> you first define it at top scope [16:21:47] <_joe_> if you re-define it at node scope [16:22:01] <_joe_> it will effectively override the default [16:22:22] (03CR) 10Dzahn: "see comments on I4c7fca2a5526dc this should only be merged after that" [operations/puppet] - 10https://gerrit.wikimedia.org/r/134794 (owner: 10Matanya) [16:22:25] so… I don't know how to answer 'why' other than by saying, it /did/ throw an error message. [16:22:33] <_joe_> andrewbogott: what error did you see exactly? [16:22:38] And when I removed it from site.pp and defined it in the node, the error went away and it worked... [16:22:49] (03CR) 10Dzahn: [C: 04-1] "-1 just to make clear it should not be merged just yet" [operations/puppet] - 10https://gerrit.wikimedia.org/r/134794 (owner: 10Matanya) [16:22:52] <_joe_> oh you defined it in the node in labs [16:23:06] <_joe_> I have no idea how node variables in labs work [16:23:29] <_joe_> andrewbogott: how did you define the variable? [16:23:38] (03PS3) 10Dzahn: admins: add manybubbles and elasticsearch group [operations/puppet] - 10https://gerrit.wikimedia.org/r/134796 (owner: 10Matanya) [16:23:39] <_joe_> andrewbogott: $puppet_version or $::puppet_version? [16:23:49] $puppet_version [16:24:07] <_joe_> and the error message was... [16:24:35] would you like me to reconstruct the original test case? That'll only take me 20 minutes or so :( [16:24:50] The error message was that $puppet_version was a duplicate definition, as defined in site.pp [16:24:55] <_joe_> andrewbogott: you didn't save the error message? [16:24:57] but I don't have the original error to cut and paste [16:25:03] (03PS2) 10Dzahn: remove 'maerlant' from site.pp/dsh/dhcp [operations/puppet] - 10https://gerrit.wikimedia.org/r/134745 [16:25:03] <_joe_> ok I needed the line and everything [16:25:06] <_joe_> :) [16:25:13] the line was the line in site.pp where the var is defined. [16:25:16] <_joe_> don't worry, though [16:25:35] what's the problem with having a good default behavior if it's undefined? [16:25:50] <_joe_> ok, we can try to set $cluster in labs? [16:25:59] <_joe_> I can do that [16:26:05] <_joe_> I bet that doesn't work either [16:26:31] (03CR) 10Dzahn: [C: 032] remove 'maerlant' from site.pp/dsh/dhcp [operations/puppet] - 10https://gerrit.wikimedia.org/r/134745 (owner: 10Dzahn) [16:26:56] sure, I'll set $cluster, just a second. [16:28:00] err: Could not retrieve catalog from remote server: Error 400 on SERVER: Cannot reassign variable cluster at /etc/puppet/manifests/site.pp:55 on node i-000003a1.eqiad.wmflabs [16:28:06] <_joe_> if this pattern does not work in labs... [16:28:09] <_joe_> ok [16:28:22] <_joe_> god, they're defined as top-scope variables in labs [16:28:28] <_joe_> and not node-scope variables [16:28:34] <_joe_> how lame :( [16:28:41] <_joe_> we should fix _that_ [16:28:57] <_joe_> it's already breaking things, it will break more with puppet 3 [16:29:15] (03CR) 10Reedy: [C: 04-1] "puppet repo needs checking for usages. Symlinks might need leaving till puppet migratd" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134117 (owner: 10Reedy) [16:29:45] <_joe_> andrewbogott: can you point me to the piece of code that does the magic in labs of defining node variables? [16:29:55] It's a puppet feature, ldap node definitions. [16:29:57] lemme google [16:30:04] <_joe_> oh, god. [16:30:14] !log maerlant - it was done for ~8d, old test host that didn't really do anything, revoked salt/pupppet certs, removing from Icinga.. [16:30:14] http://docs.puppetlabs.com/guides/ldap_nodes.html [16:30:19] Logged the message, Master [16:30:25] (03CR) 10Reedy: "It's always been relative. I get the feeling changing it is likely to break something unsuspecting along the way..." [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134641 (https://bugzilla.wikimedia.org/64748) (owner: 10BryanDavis) [16:30:31] Might be that we have it misgconfigured… I haven't ever looked at it. [16:30:34] <_joe_> of course, the lame part of the implementation is not ours [16:30:43] <_joe_> thanks again puppetlabs [16:31:01] Reedy: man, that one sentence sure does instill confidence (your last comment on that patch) [16:31:04] <_joe_> andrewbogott: which is the labs puppetmaster? [16:31:07] Hm, actually it looks like there isn't much configuration, you just point it at ldap and let fly [16:31:07] Reedy: Do you want to take a shot at making 134641 use a relative path? [16:31:16] _joe_: virt1000 [16:31:49] Reedy: I can work on it if you're busy. Seems like we should fix this before tomorrow though. [16:31:53] feeling + likely + unsuspecting + along the way :)) [16:32:02] <_joe_> "All attributes on the LDAP nodes are assigned as variables in the Puppet configuration, just like Facts" [16:32:12] (03PS1) 10Reedy: Add symlinks [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134843 [16:32:14] (03PS1) 10Reedy: testwiki to 1.24wmf6 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134844 [16:32:16] <_joe_> shit they never get one thing right, don't they? [16:32:16] (03PS1) 10Reedy: Wikipedias to 1.24wmf5 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134845 [16:32:18] (03PS1) 10Reedy: group0 to 1.24wmf6 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134846 [16:32:27] bd808: Why tomorrow? [16:32:32] <_joe_> andrewbogott: gimme BLOOD [16:32:43] _joe_: is low fat canned blood ok? [16:32:46] Reedy: Because I can't tell what day it is :) [16:32:47] <_joe_> andrewbogott: I'll try to find a way around this [16:32:56] _joe_: ok, thanks. [16:33:12] For my education, can you tell me why you think my suggested approach (checking for undef etc) won't work? [16:33:22] Is that var used in places that I missed? Or is there something fundamentally wrong with the approach? [16:33:26] <_joe_> Reedy: well, it's bay-area developer blood I seek, so it's probably low-fat and low-protein [16:33:26] bd808: I've manually fixed the symlink before committing 134845 [16:33:36] * bd808 nods [16:34:11] <_joe_> andrewbogott: well the wrong things are basically that undefs in puppet are a terrible thing and they tend to bite you sooner than later [16:34:14] (03CR) 10Reedy: [C: 032] Add symlinks [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134843 (owner: 10Reedy) [16:34:22] (03Merged) 10jenkins-bot: Add symlinks [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134843 (owner: 10Reedy) [16:34:38] Is there more too it than just that undef == "" [16:34:41] <_joe_> andrewbogott: also, undefs are different in templates and in puppet definitions, and inconsistent across puppet versions [16:34:48] Because in this case that seems safe [16:34:58] <_joe_> andrewbogott: yes we may do that [16:34:59] <_joe_> but [16:35:13] <_joe_> we already have things that do depend on that behavior [16:35:25] (03CR) 10Reedy: [C: 032] testwiki to 1.24wmf6 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134844 (owner: 10Reedy) [16:35:34] (03Merged) 10jenkins-bot: testwiki to 1.24wmf6 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134844 (owner: 10Reedy) [16:35:35] <_joe_> we need to fix labs [16:35:37] <_joe_> :) [16:36:25] <_joe_> andrewbogott: my change can wait a few days [16:37:17] !log reedy Started scap: testwiki to 1.24wmf6 and build l10n cache [16:37:22] Logged the message, Master [16:37:30] _joe_: We can't really fix puppetlab's implementation of ldap nodes can we? Short of sending a patch upstream? [16:37:41] <_joe_> andrewbogott: let me check [16:37:47] <_joe_> I want to be sure :) [16:38:38] (03CR) 10BryanDavis: [C: 04-2] "Keeping it relative seems safer as Sam points out. I'll amend soon." [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134641 (https://bugzilla.wikimedia.org/64748) (owner: 10BryanDavis) [16:38:53] mwalker|away: btw, the anchor tag per deploy event is great, gennerally [16:39:20] tgr: let me know if any questions regarding the access [16:39:26] should all work [16:39:57] _joe_: I don't see why not. could fork the module and rename it to wmldap, patching the behavior you want to change in the forked module [16:40:08] <_joe_> andrewbogott: I do have a woraround, maybe. [16:40:18] er that was reply to andrewbogott sorry [16:40:38] <_joe_> andrewbogott: could you please revert your changes? [16:41:03] _joe_: I can, but, can't you just cherry-pick ps1 and modify that? [16:41:06] Or is gerrit fighting with you? [16:41:15] I've got a bit of experience with puppet internals...fwiw [16:41:23] <_joe_> andrewbogott: yeah, I can do that. [16:41:42] <_joe_> andrewbogott: we do have a default node in the labs ldap? [16:42:01] Sort of... [16:42:13] <_joe_> I mean, we should have a node default {} def in site.pp [16:42:16] <_joe_> lemme see [16:42:26] We have defaults that are set for new nodes. There isn't a good way to change the node across all existing vms. [16:42:40] But there's a base role that's included on all nodes. [16:42:46] it is... [16:42:59] role::labs::instance [16:43:08] <_joe_> ok [16:43:08] I'm pretty sure that's everywhere [16:43:25] (03CR) 10Mark Bergsma: [C: 031] "Looks good to me." [operations/debs/pybal] - 10https://gerrit.wikimedia.org/r/134651 (owner: 10BBlack) [16:43:48] <_joe_> andrewbogott: ok this is a start :) [16:44:10] ottomata: fyi, analytics1019 starting to run low on diskspace (/var/lib/hadoop/data/).. just cause Icinga is warning [16:44:14] <_joe_> andrewbogott: do you still have a puppet master in labs for testing? [16:44:22] <_joe_> I had another idea [16:44:31] andrew-local-puppetmaster [16:44:53] <_joe_> the duplicate warning was on site.pp for $cluster, right? [16:45:03] err: Could not retrieve catalog from remote server: Error 400 on SERVER: Cannot reassign variable cluster at /etc/puppet/manifests/site.pp:55 on node i-000003a1.eqiad.wmflabs [16:45:39] _joe_: for your client use andrew-trusty-puppetclient4 [16:45:47] It's not currently pointed at the puppetmaster though, lemme do that now [16:46:03] <_joe_> andrewbogott: ok I think we have a solution here. let me try [16:46:08] (03CR) 10Mark Bergsma: "Those methods are actually meant to be inherited (hence the noop). I'd need to review better to see if that's still giving the desired beh" [operations/debs/pybal] - 10https://gerrit.wikimedia.org/r/134833 (owner: 10BBlack) [16:47:06] thanks mutante [16:47:12] hadn't checked icinga today [16:47:42] ottomata: yw, it's just a warn so far [16:47:51] paravoid: ms-be1012 is down - powercycle or ..? [16:49:03] _joe_: I have to run in a few minutes (a friend is presenting her PhD dissertation) but will be back in a a couple hours. Leave me a hand-off message in gerrit or by email and I'll run some more tests when I return. [16:49:16] <_joe_> andrewbogott: ok [16:50:08] alright changing venues [16:50:28] sorry, wrong channel [16:50:30] _joe_: Applying role::puppet::self to an instance that is already using a labs puppetmaster is ugly because the first thing it does is git clone puppet and then use /that/ rather than the puppetmaster that it was previously pointed at… etc. etc. [16:50:54] changing venues due to lag [16:51:19] yup, its kinda weird andrewbogott, for sure [16:52:00] ottomata: What we're doing is even weirder, trying to serve a /new/ version of role::puppet::self onto an instance which exists only on the local puppetmaster [16:52:12] At some point we'll probably have to just merge and test w/production puppet [16:52:43] (03CR) 10Gergő Tisza: [C: 031] Launch Media Viewer for all users on Italian wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134808 (owner: 10Gilles) [16:53:10] (03CR) 10Gergő Tisza: [C: 031] Launch Media Viewer for all users on Russian wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134809 (owner: 10Gilles) [16:53:39] oh a new version of it [16:53:42] (03CR) 10Gergő Tisza: [C: 031] Launch Media Viewer for all users on all Wikisources [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134810 (owner: 10Gilles) [16:53:42] like,a new change? [16:53:48] haha [16:53:50] yeah that is weird [16:53:50] hm [16:54:12] (03CR) 10Gergő Tisza: [C: 031] Launch Media Viewer for all users on German wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134811 (owner: 10Gilles) [16:54:20] andrewbogott: i think when i was developing the multi client self hosted puppet stuff [16:54:26] i created and destroyed a lot of nodes :p [16:54:30] (03PS1) 10Jforrester: Enable VisualEditor by default on outreachwiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134849 [16:54:31] in the process [16:54:32] (03CR) 10Gergő Tisza: [C: 031] Launch Media Viewer for all users on English wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134812 (owner: 10Gilles) [17:00:56] <_joe_> andrewbogott: btw, it's clearly documented, d'oh [17:01:24] <_joe_> "Variables provided by an ENC are set at top scope. However, all of the classes assigned by an ENC are declared at node scope. This gives approximately the best and most-expected behavior — variables from an ENC are available everywhere, and classes may use node-specific variables." [17:01:54] <_joe_> the N in there is for NODE [17:02:07] <_joe_> talk about 'most-expected' [17:05:49] !log reedy Finished scap: testwiki to 1.24wmf6 and build l10n cache (duration: 28m 31s) [17:05:53] Logged the message, Master [17:09:20] _joe_: So far my approach has been to willfully deny the distinction between node and top scopes :) [17:10:18] <_joe_> andrewbogott: he! with puppet 3, you can't [17:10:37] alas [17:12:45] (03PS4) 10Giuseppe Lavagetto: puppet: pin packages to the correct version [operations/puppet] - 10https://gerrit.wikimedia.org/r/134823 [17:13:36] <_joe_> andrewbogott: inelegant but should do the trick [17:13:45] seems like [17:14:11] <_joe_> tomorrow (it's 7 PM here) i'll find a more elegant solution [17:15:20] <_joe_> see you later or tomorrow [17:15:32] 'k [17:15:40] mutante: powercycle yes [17:16:54] (03PS2) 10Matanya: admins: add elasticsearch-roots to elasticsearch nodes [operations/puppet] - 10https://gerrit.wikimedia.org/r/134797 [17:17:18] (03CR) 10Reedy: [C: 031] "lgtm. Will merge and deploy during 1.24wmf6 window in an hour or so" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134384 (owner: 10Rush) [17:17:30] paravoid: ok, now i had something different already.. it's coming back [17:17:46] !log powercycling ms-be1012 [17:17:50] Logged the message, Master [17:19:43] RECOVERY - Host ms-be1012 is UP: PING OK - Packet loss = 0%, RTA = 0.71 ms [17:22:35] (03PS2) 10BryanDavis: Fix /a/common/php symlink creation [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134641 (https://bugzilla.wikimedia.org/64748) [17:23:34] (03CR) 10BryanDavis: Fix /a/common/php symlink creation [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134641 (https://bugzilla.wikimedia.org/64748) (owner: 10BryanDavis) [17:24:47] (03PS4) 10Matanya: admins: add manybubbels and elasticsearch group [operations/puppet] - 10https://gerrit.wikimedia.org/r/134796 [17:29:30] (03CR) 10Anomie: [C: 031] "Seems sane, one style nitpick. Not tested, besides making sure that glob() wouldn't canonicalize redirects in MULTIVER_COMMON_APACHE in ca" (031 comment) [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134641 (https://bugzilla.wikimedia.org/64748) (owner: 10BryanDavis) [17:29:43] (03PS2) 10Matanya: admins: add otto as analytics root [operations/puppet] - 10https://gerrit.wikimedia.org/r/134793 [17:30:24] mutante: i spoke with akosiaris regarding otto [17:31:02] from log: [08:54:53] akosiaris: rights question, when one has sudo rights on a machine it mean he can run stuff like otto@box: sudo command if he has root on the same machine this mean he has root@box, not the right to do any command as sudo. is this right? this is the setup ? [17:31:02] [08:56:08] matanya: well if has root he implicitly has the right to use sudo (not that he needs to) but yes you are correct [17:31:35] ?? [17:31:45] context ? 134793 ? [17:32:13] matanya: sudo ALL is identical to root [17:32:14] sudo -s [17:32:20] and then you become root@ [17:32:35] or "sudo su" even [17:32:55] mutante: but the user invoking sudo remains the user vs running stuff as root [17:33:10] and seen as such in logs erc [17:33:13] *etc [17:33:20] we gotta talk about this in a few, in an IRC meeting. [17:33:28] agreed [17:33:29] but yea, the context is the "convert to admin yaml [17:33:33] @akosiaris [17:34:46] paravoid, in labs / the main cluster; is there a puppet variable for the hostname to use for the redis servers? [17:35:39] mutante: i wanted to add more patches to the admin yaml, but didn't want to do duplicate work or step on toes, any syncing page? [17:36:34] (03CR) 10BryanDavis: "Just to beat a dead horse: This is what trebuchet is for. You are going to be deploying specific revisions of multiple upstream git reposi" [operations/puppet] - 10https://gerrit.wikimedia.org/r/132505 (owner: 10Dzahn) [17:45:45] (03PS3) 10BryanDavis: Fix /a/common/php symlink creation [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134641 (https://bugzilla.wikimedia.org/64748) [17:46:44] (03CR) 10BryanDavis: Fix /a/common/php symlink creation (031 comment) [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134641 (https://bugzilla.wikimedia.org/64748) (owner: 10BryanDavis) [17:50:25] matanya: if you are root you can also su to any other user [17:50:43] right, so ? :) [17:50:55] matanya: ask otto when he uses root and when the otto user? [17:51:01] on analytics [17:51:08] makes sense [17:52:21] well, if I may... I kind of like matanya's approach. I would like to believe that at some point we won't be using root but personal accounts with sudo rights [17:53:21] i would like that too! [17:53:28] that's why i said it might still be a good idea but otto should be added [17:53:31] there you go [17:53:44] i usually ssh as root when i want to do root things, but only because that's what we seem to do around here [17:53:46] not because I like it :) [17:53:47] i just wonder why did you have sudo ALL in the past [17:53:58] me? [17:54:01] did you actually work as otto? [17:54:02] yea [17:54:09] that may have been before I had root? [17:54:15] that explains it! [17:54:35] all i said was "eh, why sudo ALL when he already is root anyways" [17:54:52] i agree not working as root will be nicer [17:56:14] so time to merge, thanks for all the input [17:56:16] so re: otto in analystics [17:56:48] no ops person should need a second designation, i.e. there is no need for otto to be root on analytics hosts in that way [17:56:52] as part of ops he is root on all hosts [17:57:25] assuming: https://gerrit.wikimedia.org/r/#/c/134793/2/modules/admin/data/data.yaml [17:57:28] chasemp, that is the state of things now, yes, but I think ideally all those with root would also have sudo ALL [17:57:48] yes that is the plan? [17:57:53] maybe? dunno [17:57:57] if you look in admins there is an ops group [17:57:58] that's in my ideal world :p [17:58:03] those people will login as themselves [17:58:05] and sudo [17:58:12] but they will have sudo all everywhere as they are ops [17:58:42] your account in an analytics group specifically should not be required and would in fact be duplicating logic [17:58:42] in the 'new world' [18:00:41] (03CR) 10Rush: [C: 04-2] "should not be needed" [operations/puppet] - 10https://gerrit.wikimedia.org/r/134793 (owner: 10Matanya) [18:00:43] PROBLEM - Disk space on analytics1019 is CRITICAL: DISK CRITICAL - free space: /var/lib/hadoop/data/i 74803 MB (3% inode=99%): [18:00:48] :) :) [18:01:43] i'm on it! [18:02:15] ah i see what you are saying chasemp, ja [18:02:16] tru [18:02:26] i shoudln't need sudo because of anything analytics grouping related [18:02:32] (03CR) 10Reedy: [C: 032] Wikipedias to 1.24wmf5 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134845 (owner: 10Reedy) [18:02:34] agreed [18:02:41] (03Merged) 10jenkins-bot: Wikipedias to 1.24wmf5 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134845 (owner: 10Reedy) [18:02:51] thanks for clarifying chasemp [18:03:17] chasemp: should i do the rest of sudoers ? [18:03:21] sorry for the delay in it, you guys are moving faster than I can keep up [18:03:42] (03Abandoned) 10Matanya: admins: add otto as analytics root [operations/puppet] - 10https://gerrit.wikimedia.org/r/134793 (owner: 10Matanya) [18:04:19] (03Abandoned) 10Matanya: admins: move otto to class admin rather than sudo_user [operations/puppet] - 10https://gerrit.wikimedia.org/r/134794 (owner: 10Matanya) [18:04:19] matanya: not sure what that means, if you give me a bit I will clarify where we are at. analytics specifically will be last at this point, as I want to coordinate with otto. I am working on bastion host stuff, and I haven't looked at what's left entirely yet. [18:04:52] ok, chasemp poke me whenever [18:04:58] !log reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Wikipedias to 1.24wmf5 [18:05:03] Logged the message, Master [18:06:32] greg-g: was beta cluster next branch cut already? need to test a few fixes. [18:07:07] matanya: beta cluster doesn't do 'branches' just auto updates from master every 5 ish minutes [18:07:31] or did I misunderstand? [18:08:17] greg-g: e.g: this bug https://bugzilla.wikimedia.org/show_bug.cgi?id=65490 was fixed, i wanted to test in beta, but it is still broken there [18:08:41] so, my question is, if it updates every few minutes, why isn't the fix in ? [18:08:49] (03CR) 10Rush: "I think we all want the simplest and most robust solution. The split is in the determining of which solution that is :) I have no proble" [operations/puppet] - 10https://gerrit.wikimedia.org/r/132505 (owner: 10Dzahn) [18:09:12] ah, puppet runs are different, I forget how often those are [18:09:14] * greg-g looks [18:09:34] 30 minutes in prod [18:09:36] yeah [18:09:39] not sure on beta [18:09:57] PROBLEM - Apache HTTP on mw1150 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [18:10:29] https://wikitech.wikimedia.org/wiki/Nova_Resource:Deployment-prep/How_code_is_updated#Puppet_and_Salt [18:10:35] looks like manually :/ [18:10:55] i need to attend to the "how code is deployed" lecture [18:11:11] :) [18:11:25] is there a problem with bits? [18:11:46] (03CR) 10Reedy: [C: 032] group0 to 1.24wmf6 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134846 (owner: 10Reedy) [18:11:49] huh: Seems fine for me. [18:11:55] lets ask the question differently : can i get access to Deployment-prep ? [18:12:00] :) [18:12:01] yes [18:12:09] huh: you tell us [18:12:17] (03Merged) 10jenkins-bot: group0 to 1.24wmf6 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134846 (owner: 10Reedy) [18:12:35] Reedy: if there was, it seems to be resolved now [18:12:51] It's possible it blipped switching wikipedia versions [18:12:57] wait no, just got no CSS again [18:13:16] huh: https://ganglia.wikimedia.org/latest/graph_all_periods.php?h=mw1150.eqiad.wmnet&m=cpu_report&r=hour&s=by%20name&hc=4&mc=2&st=1400782375&g=cpu_report&z=large&c=Bits%20application%20servers%20eqiad [18:13:52] jquery changes and such pushed to all wikipedias [18:14:05] yeah [18:14:17] huh: to see all the bits: https://ganglia.wikimedia.org/latest/?c=Bits%20application%20servers%20eqiad&m=cpu_report&r=hour&s=by%20name&hc=4&mc=2 [18:14:28] bits app server and cache load and network bumps are highly expected [18:14:41] yes, version change [18:15:37] matanya: what's your wikitech username? [18:15:37] slow to load -> https://meta.wikimedia.org/wiki/Special:RecentChanges [18:16:05] I see 503s from bits there [18:16:16] yeah, spike in load time:http://gdash.wikimedia.org/dashboards/frontend/ [18:16:28] well, spike as in, upward trend since the deploy [18:16:43] RECOVERY - Disk space on analytics1019 is OK: DISK OK [18:16:53] RECOVERY - Apache HTTP on mw1150 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 400 bytes in 8.715 second response time [18:17:14] what was that about^ [18:18:20] greg-g: matanya [18:22:30] !log reedy rebuilt wikiversions.cdb and synchronized wikiversions files: group0 to 1.24wmf6 [18:22:35] Logged the message, Master [18:24:58] not getting it anymore [18:25:20] good [18:25:31] the "spike" of load time went down: http://gdash.wikimedia.org/dashboards/frontend/ [18:26:18] Anything else weird you're seeing Reedy ? [18:28:27] (03PS2) 10Reedy: wgRCFeedschanges/updates via udp to second destination [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134384 (owner: 10Rush) [18:28:34] (03CR) 10Reedy: [C: 032] wgRCFeedschanges/updates via udp to second destination [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134384 (owner: 10Rush) [18:28:43] (03Merged) 10jenkins-bot: wgRCFeedschanges/updates via udp to second destination [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134384 (owner: 10Rush) [18:29:07] (here comes the Reedy spam) [18:29:23] i wonder how many people are currently using mysql/sql from a bastion [18:29:35] sql doesn't work from bast1001 does it? [18:29:42] wants to take the client away [18:29:52] and make people jump to something different if they need it [18:30:09] Reedy: mysql does [18:30:17] right [18:30:22] !log reedy synchronized wmf-config/ 'I7a02f2615d98428b6f27514e75d935d36e44fcb1' [18:30:25] well [18:30:26] Logged the message, Master [18:30:27] root@bast1001:~# sql [18:30:27] /usr/local/bin/sql: line 4: /etc/cluster: No such file or directory [18:30:29] so people using it would likely be for non directly mediaiwki stuff [18:30:40] actually "sql" fails because of /etc/cluster missing [18:30:50] that would be easy to add. but i want the opposite:) [18:32:41] ah.. hmm .. "people with access to research data should have to go to a special host " [18:32:44] that [18:32:56] just needs to define the special host [18:33:19] where is research data? [18:33:50] mutante: stat1001/2/3 [18:34:22] matanya: then.. i'll remove it from bastion.. and if anyone asks we'll send them there [18:34:47] ottomata: can you please confirm ? [18:35:20] argues that bastion should really be just ssh [18:35:39] that's the point of bastion (and not what fenari was) [18:35:41] mutante: actually i had previously advised people to use bastions for this [18:35:43] not sure if anyone is [18:36:14] mainly because: why give them access to a big server somewhere (like stat1003) if all they are doing is using it as a mysql bastion [18:36:44] (03PS2) 10Reedy: Launch Media Viewer for all users on Italian wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134808 (owner: 10Gilles) [18:37:08] so bastion is just a ssh jump server? that seems to me like a waste of resources. [18:37:24] that is the point of calling it a bastion [18:37:32] oh, that's for later [18:37:33] http://en.wikipedia.org/wiki/Bastion_host [18:37:44] a special purpose computer on a network specifically designed and configured to withstand attacks. The computer generally hosts a single application [18:37:56] all other services are removed or limited to reduce the threat to the computer [18:38:18] (03PS4) 10Reedy: Disable query pages for closed wikis [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/130609 (https://bugzilla.wikimedia.org/42436) (owner: 10Withoutaname) [18:38:19] (03CR) 10Reedy: [C: 032] Disable query pages for closed wikis [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/130609 (https://bugzilla.wikimedia.org/42436) (owner: 10Withoutaname) [18:38:21] (03Merged) 10jenkins-bot: Disable query pages for closed wikis [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/130609 (https://bugzilla.wikimedia.org/42436) (owner: 10Withoutaname) [18:38:52] ottomata: https://gerrit.wikimedia.org/r/#/c/126027/3 [18:39:13] (03CR) 10Reedy: [C: 04-1] Enable VisualEditor by default on outreachwiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134849 (owner: 10Jforrester) [18:39:30] yeah, well. I mean adding just sql seems to fall under bastion in this sense, but whatever :) [18:40:14] hey, are we in shell requests merge time? :) [18:40:28] well, no [18:40:29] that means that the mysql servers have GRANTS to allow connections to private data [18:40:34] sql requires startup of a mediawiki instance [18:40:37] from a host, that is supposed to be locked down [18:40:47] Nemo_bis: no :) [18:40:55] :< [18:41:05] see SWAT [18:41:46] ok, please vote down on gerrit [18:41:53] i'll continue to next patch [18:42:04] mutante: i'm not opposed to that patch [18:42:09] i *think* it won't affect anyone right now [18:42:13] i see your reasoning, need to think of it a bit more [18:42:35] bastion seemed appropriate for a mysql client to me, since it then is just acting as a 'mysql bastion' [18:45:04] ottomata: if we'd send them to stat1003, is there more data that they'd get that isn't in mysql but in the filesystem? [18:46:13] yes, but not anything sensitive [18:46:21] other folks' working data, probably [18:46:23] and do they need another user/password for the mysql connection [18:46:24] wikistats stuff [18:46:29] or does it just allow based on the IP [18:46:36] that i don't know, springle would [18:46:52] they all use the same mysql user/pw right now, not sure if the mysql grant specifies the host [18:47:10] i doubt it does, as we didn't specifically ask sean to fix the grants when we set up stat1003 (and other servers) [18:47:11] (03PS2) 10Reedy: ZeroWiki: Add extra page to whitelist [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/125064 (owner: 10Yurik) [18:47:14] so its probably a wildcard [18:47:15] (03CR) 10Reedy: [C: 032] ZeroWiki: Add extra page to whitelist [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/125064 (owner: 10Yurik) [18:47:28] hmm.. thinking [18:48:00] i'd like to merge that above.. because we think it should not influence anyone [18:48:08] then i'd talk to springle about the grants [18:48:26] (03Merged) 10jenkins-bot: ZeroWiki: Add extra page to whitelist [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/125064 (owner: 10Yurik) [18:50:32] (03PS2) 10Reedy: Change "Open Directory" to DMOZ [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/127627 (https://bugzilla.wikimedia.org/64027) (owner: 10Gerrit Patch Uploader) [18:51:05] (03CR) 10Reedy: [C: 032] Change "Open Directory" to DMOZ [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/127627 (https://bugzilla.wikimedia.org/64027) (owner: 10Gerrit Patch Uploader) [18:51:13] (03Merged) 10jenkins-bot: Change "Open Directory" to DMOZ [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/127627 (https://bugzilla.wikimedia.org/64027) (owner: 10Gerrit Patch Uploader) [18:51:34] (03PS2) 10MarkTraceur: Launch Media Viewer for all users on Russian wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134809 (owner: 10Gilles) [18:51:54] (03CR) 10JanZerebecki: "Thank you for the translatewiki link." [operations/puppet] - 10https://gerrit.wikimedia.org/r/132393 (https://bugzilla.wikimedia.org/53259) (owner: 10JanZerebecki) [18:52:29] (03CR) 10Dzahn: "used to be directory.mozilla.org and now owned by AOL ? hrmm" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/127627 (https://bugzilla.wikimedia.org/64027) (owner: 10Gerrit Patch Uploader) [18:52:58] (03CR) 10Reedy: [C: 04-1] "Needs rebasing" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/129464 (owner: 10Ricordisamoa) [18:53:23] (03CR) 10JanZerebecki: Improve nginx TLS/SSL settings. (031 comment) [operations/puppet] - 10https://gerrit.wikimedia.org/r/132393 (https://bugzilla.wikimedia.org/53259) (owner: 10JanZerebecki) [18:53:41] (03PS2) 10Reedy: Use an actually generic address as $wmgNotificationSender default [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/130037 (https://bugzilla.wikimedia.org/58261) (owner: 10Nemo bis) [18:53:46] (03CR) 10Reedy: [C: 032] Use an actually generic address as $wmgNotificationSender default [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/130037 (https://bugzilla.wikimedia.org/58261) (owner: 10Nemo bis) [18:53:55] (03Merged) 10jenkins-bot: Use an actually generic address as $wmgNotificationSender default [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/130037 (https://bugzilla.wikimedia.org/58261) (owner: 10Nemo bis) [18:55:17] mutante: that's cool [18:55:26] if anyone has trouble they'll let us know and we can thinkg about what to do then [18:55:34] i don't htink anyone uses it that way, at least not that I know of [18:55:39] ottomata: ok, thank you [18:56:01] (03Abandoned) 10Reedy: Update flow cache versioning prefix [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/131084 (owner: 10EBernhardson) [18:57:26] (03CR) 10Reedy: [C: 04-1] "Needs rebasing" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/130809 (https://bugzilla.wikimedia.org/57819) (owner: 10Withoutaname) [18:57:45] (03PS2) 10Reedy: Enable Echo for en_rtl test wiki and enable flow for a test page [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/133969 (owner: 10Ladsgroup) [18:57:50] (03CR) 10Reedy: [C: 032] Enable Echo for en_rtl test wiki and enable flow for a test page [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/133969 (owner: 10Ladsgroup) [18:57:58] (03Merged) 10jenkins-bot: Enable Echo for en_rtl test wiki and enable flow for a test page [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/133969 (owner: 10Ladsgroup) [18:58:37] (03PS3) 10Reedy: Enable FlaggedRevs for Persian Wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/133970 (https://bugzilla.wikimedia.org/65452) (owner: 10Reza) [18:59:11] (03CR) 10Reedy: [C: 032] Enable FlaggedRevs for Persian Wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/133970 (https://bugzilla.wikimedia.org/65452) (owner: 10Reza) [18:59:18] (03Merged) 10jenkins-bot: Enable FlaggedRevs for Persian Wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/133970 (https://bugzilla.wikimedia.org/65452) (owner: 10Reza) [18:59:45] (03PS2) 10Reedy: Set $wgCategoryCollation to 'uca-cs' on cswiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134103 (owner: 10Manybubbles) [19:00:06] (03PS2) 10Reedy: Remove unused AFT config for AbuseFilter [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134482 (owner: 10Nemo bis) [19:00:10] (03CR) 10Reedy: [C: 032] Remove unused AFT config for AbuseFilter [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134482 (owner: 10Nemo bis) [19:00:20] (03Merged) 10jenkins-bot: Remove unused AFT config for AbuseFilter [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134482 (owner: 10Nemo bis) [19:00:52] (03PS4) 10Reedy: Remove Nearby BF from whitelist [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/130718 (owner: 10Jforrester) [19:00:56] (03CR) 10Reedy: [C: 032] Remove Nearby BF from whitelist [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/130718 (owner: 10Jforrester) [19:01:05] (03Merged) 10jenkins-bot: Remove Nearby BF from whitelist [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/130718 (owner: 10Jforrester) [19:01:34] (03PS3) 10Reedy: Remove VE formula editor from BF whitelist (graduated) [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/130719 (owner: 10Jforrester) [19:01:39] (03CR) 10Reedy: [C: 032] Remove VE formula editor from BF whitelist (graduated) [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/130719 (owner: 10Jforrester) [19:01:48] (03Merged) 10jenkins-bot: Remove VE formula editor from BF whitelist (graduated) [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/130719 (owner: 10Jforrester) [19:04:24] !log reedy synchronized wmf-config/ [19:04:29] Logged the message, Master [19:13:10] (03PS3) 10Withoutaname: Create 'noratelimit' user group on dewiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/130809 (https://bugzilla.wikimedia.org/57819) [19:15:21] (03CR) 10Dzahn: "talked about this again on IRC today, i would have merged, but can somebody fix the path conflict? It's one of those old patches that exis" [operations/puppet] - 10https://gerrit.wikimedia.org/r/126027 (owner: 10Hoo man) [19:16:07] i would have to upgrade to sid to merge that, heh [19:16:20] but i probably will [19:17:28] mutante: can i ask you to look at ? i verified it using _joe_'s tool http://puppet-compiler.wmflabs.org/change/134642/html/ [19:19:23] hmm, the puppet-compiler link seems broken, fwiw [19:19:28] 504 [19:21:28] ori: i'm a bit confused by role::mediawiki include role::mediawiki [19:23:13] so there is role::mediawiki which has class webserver, which then includes role::mediawiki [19:23:24] mutante: it's role::mediawiki::webserver. that manifest uses nested classes which we don't use elsewhere because they're wtf [19:23:40] i'm cleaning it up gradually [19:24:34] but in principle it's no more bizarre than any other case of a class including its parent, which happens everywhere [19:24:59] is this database error known: https://fa.wikipedia.org/wiki/%D9%88%DB%8C%DA%98%D9%87:%D8%A2%D9%85%D8%A7%D8%B1_%D8%AA%D8%A7%DB%8C%DB%8C%D8%AF%D9%87%D8%A7 ? [19:25:20] srsly [19:25:31] ori: jobrunner will run on everything after this as opposed to just "tmh"? [19:25:35] Some flagged revs tables existed, but others don't? [19:26:24] (03PS4) 10Ori.livneh: Remove mysql client from bastionhost [operations/puppet] - 10https://gerrit.wikimedia.org/r/126027 (owner: 10Hoo man) [19:26:35] ori: eh, _also_ on tmh, got it [19:26:39] mutante: it was excluding tmh before to work around a parser bug [19:26:40] yeah [19:26:44] Raymond|afk: Fixed, thanks [19:26:51] Reedy: thanks :) [19:26:54] Reedy: editing in fa.wp returns databse error for me too: [19:26:57] تابع: FlaggedRevision::newFromStable [19:26:58] (03CR) 10Ori.livneh: "fixed the path conflict" [operations/puppet] - 10https://gerrit.wikimedia.org/r/126027 (owner: 10Hoo man) [19:26:59] خطا: 1054 Unknown column 'fr_rev_timestamp' in 'field list' (10.64.16.22) [19:27:06] !log Created flaggedrevs_statistics table on fawiki [19:27:11] Logged the message, Master [19:28:34] !log Ran patch-fr_page_rev-index.sql patch on fawiki [19:28:39] Logged the message, Master [19:29:45] the edit saves but we see a message of database error [19:30:13] Amir1: Noted and Reedy is resolving it. [19:30:26] It should be fixed [19:30:30] JohnLewis: thank you [19:30:34] let me check again [19:30:39] Last dberror was 19:27:03 [19:32:22] it's okay now [19:32:24] thank you [19:33:26] Reedy: another 'minor' problem is that even though I'm admin I can't change stability of any page [19:33:46] That's a config/rights problem then [19:34:38] ori: i'm not sure i understand the consequences of this good enough. parser bug.. temp. renaming to something with _ , running on tmh .. and the compiler link i cant see it.. [19:38:42] !log cr1/2-ulsfo: BGP peering with AS11820 (WMF Corp HQ) [19:38:46] Logged the message, Master [19:39:35] (03CR) 10Dzahn: [C: 032] Remove mysql client from bastionhost [operations/puppet] - 10https://gerrit.wikimedia.org/r/126027 (owner: 10Hoo man) [19:39:53] (03CR) 10Dzahn: "thanks Hoo and Ori" [operations/puppet] - 10https://gerrit.wikimedia.org/r/126027 (owner: 10Hoo man) [19:42:40] (03PS1) 10Faidon Liambotis: Explicitly point WMF OIT address space to ulsfo [operations/dns] - 10https://gerrit.wikimedia.org/r/134873 [19:44:48] akosiaris: around? [19:46:08] (03CR) 10Dzahn: "bd808, did this change your -1 to +1?" [operations/puppet] - 10https://gerrit.wikimedia.org/r/134282 (owner: 10Hoo man) [19:46:38] (03CR) 10Withoutaname: [C: 031] Give testwiki some custom namespaces [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/78016 (https://bugzilla.wikimedia.org/52528) (owner: 10TTO) [19:47:15] Reedy: I found it [19:47:22] (03CR) 10BryanDavis: [C: 031] Remove misc::deployment::scap_scripts from terbium [operations/puppet] - 10https://gerrit.wikimedia.org/r/134282 (owner: 10Hoo man) [19:47:23] it's in Special:Protect [19:47:28] thank you [19:47:32] (03CR) 10Dzahn: [C: 032] "Alex, gotcha, it wasn't merged yet (because of jenkins i suppose), doing that now." [operations/puppet] - 10https://gerrit.wikimedia.org/r/131333 (owner: 10Matanya) [19:48:57] (03CR) 10Faidon Liambotis: [C: 032] Explicitly point WMF OIT address space to ulsfo [operations/dns] - 10https://gerrit.wikimedia.org/r/134873 (owner: 10Faidon Liambotis) [19:50:57] (03CR) 10Dzahn: [C: 032] timeout submit_check_result, see rt #5311 [operations/puppet] - 10https://gerrit.wikimedia.org/r/126209 (owner: 10ArielGlenn) [19:52:51] !log reedy Purged l10n cache for 1.23wmf21 [19:52:55] Logged the message, Master [19:53:05] !log reedy Purged l10n cache for 1.23wmf22 [19:53:11] Logged the message, Master [19:53:42] !log reedy Purged l10n cache for 1.24wmf1 [19:53:47] Logged the message, Master [19:54:03] !log reedy Purged l10n cache for 1.24wmf2 [19:54:07] Logged the message, Master [19:54:22] !log reedy Purged l10n cache for 1.24wmf3 [19:54:26] Logged the message, Master [19:54:31] does anyone know about how big the "Recombine all pages, current versions only." dump from en.wikipedia is when it's uncompressed? [19:54:38] big [19:54:39] :D [19:54:46] !log reedy Purged l10n cache for 1.24wmf4 [19:54:50] Logged the message, Master [19:54:56] i'm putting it on an HDD with 139gb left. is there a risk that will fill up? [19:55:04] Yes [19:55:45] There used to be a figure on meta IIRC [19:56:24] mutante: [19:56:30] speaking of use stuff [19:56:34] dario is pinging me about this: [19:56:35] https://gerrit.wikimedia.org/r/#/c/122401/1 [19:56:40] any objections? [19:56:43] should I get someone else to review? [19:56:49] user stuff* [19:57:13] ottomata: yes, but for other reasons than you might expect [19:57:26] chase would like to "freeze" admins.pp for a little while [19:57:35] because we are converting it to the yaml stuff [19:57:44] can we maybe do it in that right away? how urgent? [19:58:15] * marktraceur waves [19:58:20] jackmcbarn: https://superuser.com/questions/53984/is-there-a-way-to-determine-the-decompressed-size-of-a-bz2-file [19:58:26] who created git repos? a new one is broken/incomplete and l10n-bot freaks out: [19:58:27] hey mutante, ottomata [19:58:27] mutante: dunno, asking DarTar [19:58:28] SpecialSimple [19:58:29] tgr and I are going to push MMV to (it|ru)wikis [19:58:29] fatal: Not a git repository (or any of the parent directories): .git [19:58:38] ottomata: my existing comment in there.. that's not an issue.. if it works [19:59:03] Reedy: so i won't know until i download it. i'll just get the broken up xml files and process them one by one in that case [19:59:05] DarTar: mutante says that chasemp wants to freeze admins.pp for a bit... [19:59:10] how urgent is the research group thing? [19:59:35] DarTar: reading that ticket.. sec [19:59:37] jackmcbarn: "After downloading, the archive needs to be uncompressed; this will turn it from a ~9.1 GB bz2 archive into an XML database dump around 42 GB." [19:59:41] not very urgent, it’s just that we are stuck in this limbo where I share SQL creds by via files hosted on fenari or stat1003 [19:59:58] jackmcbarn: Maybe 50GB then [20:00:59] (03CR) 10MarkTraceur: [C: 032] Launch Media Viewer for all users on Italian wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134808 (owner: 10Gilles) [20:01:05] aggregating all creds on stat1003 and removing the dependency on fenari could be a good first step (as many people who need SQL access do not necessarily have shell access to fenari) [20:01:10] (03Merged) 10jenkins-bot: Launch Media Viewer for all users on Italian wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134808 (owner: 10Gilles) [20:01:22] mind you, they may not even have access to stat1003 FWIW [20:01:30] (03CR) 10MarkTraceur: [C: 032] Launch Media Viewer for all users on Russian wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134809 (owner: 10Gilles) [20:01:31] sorry, my own fault :-( [20:01:38] (03Merged) 10jenkins-bot: Launch Media Viewer for all users on Russian wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134809 (owner: 10Gilles) [20:01:59] but I’d love to find a way to make myself unnecessary in the access request process [20:02:03] DarTar: it seems to be more than one problem (gpg password sharing ticket not resolved yet, mysql permissions, adding new researcher group) [20:02:05] so any suggestion is welcome [20:02:43] the minimal problem we’re trying to solve really is sharing MySQL creds [20:03:12] DarTar: how about this.. i'll take the ticket and make _something_ happen on it during this week [20:03:14] now, if springle thinks we can revert to individual SQL credentials, I’m all for closing this ticket [20:03:18] can also ping springle [20:03:27] sounds goodf [20:03:47] and about adding the researchers group [20:03:57] i'll do it in cooperation with chase.. in the yaml data [20:04:04] !log marktraceur updated /a/common to {{Gerrit|I1c658bf65}}: Remove VE formula editor from BF whitelist (graduated) [20:04:06] kk [20:04:08] Logged the message, Master [20:04:51] thanks guys! [20:05:16] (03PS1) 10Rush: bast1001 to admin yaml [operations/puppet] - 10https://gerrit.wikimedia.org/r/134921 [20:05:18] (03PS1) 10Rush: data.yaml sanity testing [operations/puppet] - 10https://gerrit.wikimedia.org/r/134922 [20:05:20] (03PS1) 10Rush: data.yaml sorted [operations/puppet] - 10https://gerrit.wikimedia.org/r/134923 [20:06:14] !log marktraceur synchronized mediaviewer.dblist 'Enabling Media Viewer on itwiki and ruwiki by default' [20:06:17] (03CR) 10Dzahn: [C: 04-1] "please wait a little bit longer and we'll add this via the yaml data" [operations/puppet] - 10https://gerrit.wikimedia.org/r/122401 (owner: 10Ottomata) [20:06:18] Logged the message, Master [20:07:27] my goal is to run "SELECT old_id FROM text WHERE old_text LIKE ''" against all of the current revisions on enwiki. is there a faster way to do that than downloading all of the dumps? [20:07:27] Hrm [20:07:41] ottomata: and yea, totally related, thanks for bringing it up [20:07:45] (03CR) 10jenkins-bot: [V: 04-1] data.yaml sanity testing [operations/puppet] - 10https://gerrit.wikimedia.org/r/134922 (owner: 10Rush) [20:08:23] I changed a dblist and it didn't seem to have any effect - do I need to touch InitialiseSettings.php or something? [20:08:48] Reedy: ^ [20:09:08] marktraceur: yup, touch and sync it [20:09:31] jackmcbarn: tool labs? [20:09:47] Reedy: will they not mind a query like that? [20:10:00] It's like toolserver in that regard... [20:10:27] i've never really done much with toolserver or labs [20:10:51] !log marktraceur synchronized wmf-config/InitialiseSettings.php 'Sync for mediaviewer.dblist change' [20:10:55] Logged the message, Master [20:12:44] All right I think that's it... [20:12:56] I've used 15 minutes of a 2-hour window; anyone else need something deployed? :P [20:13:04] marktraceur: :P [20:13:08] marktraceur: alls good? [20:13:11] Seems to be [20:13:14] how's those dashboards ya'll created? [20:13:21] * marktraceur shall look [20:13:27] * bd808|LUNCH thinks marktraceur's question we related to https://bugzilla.wikimedia.org/show_bug.cgi?id=58618 [20:13:35] greg-g: Well, the dashboards *we* created only update once a day [20:13:35] *was [20:13:59] marktraceur: Do this more often and then ask people to pay you in return for using your deploy window :D [20:14:21] Call it the 'Marktraceur Deploy Rental Window' :D [20:14:29] JohnLewis: I'll deploy your code if you review five UploadWizard patches... >.> [20:15:00] marktraceur: My patches are creating a new wiki which no one can agree on the domain name with :p [20:15:07] can't help ya there [20:15:23] Which do we use? us-ne, or ne-us or neus or usne >.> [20:15:32] It's neus to me [20:15:41] :D [20:15:41] * greg-g sighs [20:19:19] JohnLewis: us-ne. [20:19:37] JohnLewis: But us_newikimedia for dbname. [20:20:10] James_F: Tell Daniel :p I've patched that, he says 'use ne-us' [20:20:43] JohnLewis: Hmm; Apparently yes, we went for pa-us. Oh well. [20:21:16] James_F: That is probably because the wiki was at pa.us.wikimedia.org. and the db was most likely pa_uswikimedia. [20:21:46] JohnLewis: Ick. [20:21:59] However the chapter is recognised as US-NE so... :p best to have it as the domain :p [20:22:08] http://pa-us.wikimedia.org/wiki/Main_Page [20:22:18] tell me how it makes sense to have pa-us but us-ne [20:22:29] James_F: [20:22:34] mutante: It doesn't make sense to have pa-us. :-) [20:22:46] legacy sense [20:22:47] mutante: Foo.bar.baz -> bar-foo.baz, not foo-bar.baz. [20:22:49] Indeed. [20:22:54] would i usually base my reviews on expecting all exising stuff is wrong:) [20:23:02] well, ok [20:23:21] Reedy: they are asking us to rename pa-us again [20:23:25] naming is hard [20:23:25] * James_F laughs. [20:23:27] from pa.us to us-pa this time [20:23:50] eh, it was already renamed once [20:23:52] it's a fscking closed/locked wiki [20:23:53] pa-us [20:23:54] Also, we should fix the HTTP spec to be org.wikipedia.en, obviously. [20:23:57] Reedy: I suppose Jeremy's idea, let's us 0.wikimedia.org for enwiki, 1.wikimedia.org for the next wiki etc :p [20:24:10] * James_F stops trolling. [20:24:24] you are all wrong. it's .wiki [20:24:26] Eww. [20:24:31] ugh [20:24:43] * JohnLewis imagines pedia.wiki :p [20:24:45] pa.us.wiki [20:25:07] media.wiki :D [20:25:09] I had forgotten that was a thing [20:25:12] http://nic.wiki [20:27:23] JohnLewis: we're not getting it,, tried [20:27:34] but we'll eventually have w. [20:27:57] we had it for like a day [20:28:29] mutante: so no media.wiki to cause confusion for everyone? :( [20:28:41] mutante: Clearly we really want the entire `wiki.` TLD, so we can have en.wiki/pedia/Foo, right? [20:29:02] James_F: +1 :D [20:29:06] (03PS2) 10MarkTraceur: Launch Media Viewer for all users on all Wikisources [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134810 (owner: 10Gilles) [20:29:11] James_F: i tried exactly that [20:29:23] (03PS2) 10MarkTraceur: Launch Media Viewer for all users on German wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134811 (owner: 10Gilles) [20:29:31] (03PS2) 10MarkTraceur: Launch Media Viewer for all users on English wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134812 (owner: 10Gilles) [20:29:31] we were too slow, not enough support before Raymond got it [20:29:34] mutante: Asking for a * cert for `wiki.` might be fun for our supplier. :-) [20:30:05] haha. [20:30:15] Wildcard SSL cert for an entire TLD. Fun. [20:30:29] you know what's even worse [20:30:35] wg.en.wikipedia.org [20:30:42] mutante: (My last employer decided to turn gov.uk into an actual domain, rather than just a cc2LD; then we got a cert for *.gov.uk, even though there are 1000s of other websites in .gov.uk…) [20:30:50] yes, SSL warning, yes we can't get *.* [20:31:09] wg.en.wikipedia.org should have never existed in wikipedia.org [20:31:15] should be wikimedia [20:31:26] wg? [20:31:29] Speaking of .uk domains, we're supposed to be able to register our own domains directly under .uk soon [20:31:33] working group or something [20:31:37] imho wikipedia should be language versions of the encyclopedia [20:31:40] and nothing else [20:31:40] Oh. Just burn it. [20:31:43] well gimpy [20:32:11] I remember asking for them to have a wiki, but not at a silly name. [20:32:36] _joe_, still working? [20:32:38] so, if you say we're renaming or deleting pa-us [20:32:57] then ne-us seems ok'ish [20:33:43] what the hell is wg.en.wikipedia.org [20:33:49] * us-ne [20:33:54] twkozlowski: Indeed. [20:33:59] We just had this discussion, twkozlowski [20:34:05] mutante: us-ne would follow MSB-first… [20:34:05] http://wg.en.wikipedia.org/wiki/Main_Page [20:34:19] James_F, (http://www.nominet.org.uk/uk-domain-names/introducing-uk) [20:34:41] Krenair: Yeah, I know. :-( Money-grabbing scum. [20:34:44] Krenair: Yes, that's what prompted me to have a look at it. [20:35:08] James_F: JohnLewis , eh yes, us-ne [20:35:28] but i don't think we need to follow what templates say [20:35:44] they are easier to fix than dbnames :p [20:36:14] silly question, may I? [20:36:24] is us-ne the newly created Nebraska group? [20:36:35] hehe, nice one [20:36:40] New England. but good point [20:37:26] actually, i have made a suggestion about this before [20:37:36] when we talked about wikimedia.us [20:37:38] Why is New England blocking the ISO code for Nebraska? [20:37:49] i once said why don't we put all US chapters into wikimedia.us [20:38:03] but not much support for that [20:38:06] twkozlowski: AffCom have assigned it to Wikimedia New England. [20:38:19] AffCom have no authority to assign ISO codes [20:38:45] can AffCom make a decision about pa-us :) [20:38:49] JohnLewis: Hmm, twkozlowski has a point. [20:39:06] AffCom assign what they want to Wikimedia Chapters :p [20:39:32] They're reserving US-NB for Wikimedia Nebraska, [20:39:42] us-hurrdurr.wikipedia.org [20:40:05] JohnLewis: AffCom don't dictate our URL naming conventions, though. [20:40:13] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: CRITICAL - Socket timeout after 10 seconds [20:40:13] Perhaps the system admins refusing to grant a domain might make them reconsider. [20:40:20] Indeed. [20:40:32] And yeah, why is the US having multiple chapters anyway? [20:40:32] James_F: They don't but fuck it :p [20:40:33] I suggest you check what is the middle point of New England, and pick the ISO code for the state it's located in. [20:40:42] i still think that wikimedia.us is a natural for wikimedia chapters in the US.. [20:40:52] mutante: Agreed; do we own it? [20:40:58] WMDC does [20:41:26] we do [20:41:31] Administrative Contact Name: Michelle Paulson [20:41:44] oh, true. [20:41:54] it got transferred [20:42:04] that's when we talked about what to do with it.. but it's been awhile [20:43:10] mutante: JFDI? :-) [20:43:12] Wikipedia says that the geographic centre of New England is near Dunbarton, New Hampshire [20:43:17] Pick us-nh then? [20:45:20] James_F: Then we need yet another * SSL cert... [20:45:23] :D [20:45:44] Reedy: Well, true. [20:45:55] But that's relatively easy [20:46:01] We just need to sweet talk RobH [20:46:03] (Why doesn't the US have state-grouping codes?) [20:46:07] think about the time we've already spent on this , multiplied by our hourly rate [20:46:10] then compare to cert price [20:46:38] Does WMF have to pay for adding extra domains to the existing cert? [20:46:44] even though i hate to say it.. this sounds like we'd want some kind of meeting / poll [20:46:46] oh yea [20:46:48] its not cheap [20:47:12] well, its not that expensive i guess [20:47:25] i just hate that ssl certs are paying for a line of code that doesnt do anything but sit in browser CA store [20:48:03] we dont purchase country specific SSL certs though [20:48:16] so pushing that one would make a precedent case so expect to have to justify why [20:48:55] am i missing something? why don't i see a text table in enwiki_p? [20:48:59] root cause: US needs more than 1 chapter [20:49:11] all other countries have just one [20:50:00] mutante: Hey, other countries only have 1 chapter proposed :p [20:50:19] If I was to promote a WMUK-North, they'll approve it :p [20:50:30] James_F: JohnLewis to clarify, i just personally gave a -1 because of the concerns with pa-us, that is not equal to "ops is blocking" [20:50:44] other root cause: there ain't no ISO code for a pan-state region [20:50:53] indeed, the us domain versus state chapters is an issue [20:50:58] that i didnt think of, glad mutante did [20:51:02] (or mentioned it at least) [20:51:17] (03PS1) 10Reedy: Redirect arbcom.*.wikipedia.org -> arbcom-*.wikipedia.org [operations/apache-config] - 10https://gerrit.wikimedia.org/r/134932 [20:51:20] mutante: now we can have a state versus federal in the US argument? ;] [20:51:35] RobH: it all started with the request for ne-us.wm.org while we already have pa-us.wm.org [20:51:50] RobH: heh, yes:) [20:51:56] * twkozlowski notes with some amusement that there is an ISO organization for New England that uses the name ISO-NE [20:52:21] (03CR) 10Andrew Bogott: "This turns out to be impossible to test w/out merging into prod. And I'm convinced that it doesn't break anything, at least. so..." [operations/puppet] - 10https://gerrit.wikimedia.org/r/134823 (owner: 10Giuseppe Lavagetto) [20:52:24] RobH: it's a group of states though [20:52:30] true [20:52:43] (03PS5) 10Andrew Bogott: puppet: pin packages to the correct version [operations/puppet] - 10https://gerrit.wikimedia.org/r/134823 (owner: 10Giuseppe Lavagetto) [20:52:43] (03PS2) 10Reedy: Redirect arbcom.*.wikipedia.org -> arbcom-*.wikipedia.org [operations/apache-config] - 10https://gerrit.wikimedia.org/r/134932 [20:52:43] which makes it even more complicated [20:52:43] plus many of those states are geographically closer [20:52:44] because now "NE" is Nebraska too [20:52:44] so it makes sense for them [20:52:49] oh, yes... [20:53:02] eww [20:53:16] The chapters folks need to determine how they are going to logically determine that [20:53:28] but for subdomain, eww. [20:53:28] newengland.wikimedia.us [20:53:48] RobH: they say pa-us was wrong in the first place :) [20:54:07] well, us.wm.org for everyone! [20:54:13] and remove ny's special thing [20:54:22] us.wm.org/statechapterwhatervername [20:54:33] * RobH waits for the hate from NY to roll in [20:54:37] (03CR) 10Andrew Bogott: [C: 032] puppet: pin packages to the correct version [operations/puppet] - 10https://gerrit.wikimedia.org/r/134823 (owner: 10Giuseppe Lavagetto) [20:54:40] who wants subdomain? [20:54:44] * aude rage ;) [20:54:51] RobH: or; chapter.wikimedia.org with the WMF logo going 'Pay for your own domain and server space' :p [20:54:54] we should have a talk about this at Wikimania [20:54:59] with all the chapters [20:55:00] heh [20:55:07] .... [20:55:12] better just to have wikimediadc.org [20:55:18] imho [20:55:21] mutante: you just made it so i cannot go to like a third of the talks i go to now without agreeing to that conversation! [20:55:26] heh [20:55:28] RobH: hahaa [20:55:38] nah, i dont mind discussion about it [20:55:48] i just dont want to be giving any one chapter special urls that others cannot get [20:55:54] seems shitty [20:56:03] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 54262 bytes in 0.324 second response time [20:56:05] (so would be nice to have a standard on subdomains) [20:56:14] i don't want them to get their own domains like wikimediadc.org either [20:56:20] hrmmm [20:56:22] well, i dont want them to HAVE to get [20:56:29] but they should be allowed to if they so choose of course [20:56:53] but we need a set url scheme for chapters so if they dont want to get and pay for a localized domain name they can use the generic one [20:56:56] yankee.wikimedia.org [20:56:59] it really is better... then we manage our subdomains [20:56:59] and apply that standard to everyone [20:57:03] that will just turn into discussion who gets to own them and the trademark talk with legal [20:57:12] and indeed, autonomy of chapters is a good thing [20:57:20] blog.* wiki.* [20:57:21] <^d> twkozlowski: s/yankee/damnyankees/ [20:57:22] whatever [20:57:22] <^d> Fixed. [20:57:40] mutante: did we start pulling domains from chapters without their permission? [20:57:50] cuz usually they keep domains unless they dont wanna deal with them [20:57:57] RobH: i don't know anymore since we gave up on the domains queue [20:57:59] not sure how its a legal discussion unless they are [20:58:07] yea i dont think they are, aude would have heard about it [20:58:21] (well, any chapters person would have heard) [20:58:21] like wikimania.org being owned by WMCH [20:58:36] ask Yana [20:58:38] we just (as the foundation) dont publish materials using domains we dont own [20:58:46] she sill say something about trademark protection [20:58:49] ive been involved in the wikimania domain shit since day 1 =P [20:59:22] thats slightly different as its not a specific chapter, but not so much that I would advise legal trying to simply take it without permission [20:59:27] or WMF can just buy the domains for the chapters [20:59:39] meh [20:59:44] (03PS3) 10Reedy: Redirect arbcom.*.wikipedia.org -> arbcom-*.wikipedia.org [operations/apache-config] - 10https://gerrit.wikimedia.org/r/134932 [20:59:46] That's just asking for trouble. [20:59:54] i dont like that idea [21:00:05] i like that chapters can buy their own shit and not depend on foundation [21:00:20] mutante: see, this has turned in to a state versus federal type discussion! [21:00:23] \o/ [21:00:25] IF they host their own wiki [21:00:29] but not if we host the wiki [21:00:34] if they buy the domain [21:00:43] and redirect it on their own to the full wmf url [21:00:46] i dont see an issue [21:00:55] i thought we never wanted to host wikis under domain names we don't own [21:00:58] RobH: As I said, make chapter.wikimedia.org a blank site with 'Buy your own shit instead of asking us' :p [21:01:01] but usually if they buy the domain they also host it themselves [21:01:16] <^d> I have my own domain but I don't host a wiki on it. [21:01:16] mutante: we wouldnt host the wiki at their domain [21:01:18] <^d> wikis are silly. [21:01:21] but @ our own [21:01:45] (03CR) 10Dzahn: "taking back my -1, +/- 0" [operations/dns] - 10https://gerrit.wikimedia.org/r/133980 (https://bugzilla.wikimedia.org/64557) (owner: 10John F. Lewis) [21:02:13] mutante: so +1 in theory? :p [21:02:21] no, 0 [21:02:44] JohnLewis: I dont have an issue wiwth them asking us [21:02:48] mutante: You know I meant -1+1 = 0 [21:02:55] as long as we have a set policy that we offer the exact same thing to every chapter who wants it [21:03:17] I don't want any one chapter getting favoritism because we happen to know them better or whatever and can convince someone to do it [21:03:35] (03CR) 10Dzahn: "i'm neutral now" [operations/apache-config] - 10https://gerrit.wikimedia.org/r/133991 (https://bugzilla.wikimedia.org/64557) (owner: 10John F. Lewis) [21:03:38] in the past i was constantly afraid of doing that with non english speaking projects/chapters [21:04:00] Anyone here able to grant editbugs on BZ? Someone wants it in #wikimedia-tech [21:04:15] Not I, I gave up my rights on BZ [21:04:28] Andre is not here so idk [21:04:40] i think someone else in ops kept their bz rights though, but not sure [21:04:53] Krenair: Look at the sysadmin list on meta? I think it lists BZ admins or so [21:04:53] rephrase, i'd think , not i think, i dont know. [21:04:56] i did not [21:05:11] JohnLewis, used to before I removed that [21:05:26] Krenair: k :p [21:06:02] (I think it listed Andre, Siebrand and Thehelpfulone. And was not really any more related to system administration than wiki sysops) [21:06:19] heh, that page is outdated too [21:06:21] i think [21:06:26] didnt ryan give up his root? [21:06:39] lemme see [21:06:39] RobH: I think he did. [21:06:53] I remember seeing a patch anyway [21:07:24] nope, its still live [21:07:27] i stand corrected! [21:07:29] (03PS1) 10Reedy: Add noboard-chapters to DNS to allow rename from noboard.chapters [operations/dns] - 10https://gerrit.wikimedia.org/r/134934 [21:07:42] ryan still does a lot of stuff so makes sense [21:08:35] (03CR) 10RobH: "and what if a chapter in the actual state abbreviation wants the subdomain?" [operations/dns] - 10https://gerrit.wikimedia.org/r/133980 (https://bugzilla.wikimedia.org/64557) (owner: 10John F. Lewis) [21:09:16] notice the 0 cuz i dont actually have a personal stake ;] [21:10:21] (03PS1) 10Chad: Remove FlaggedRevs from mediawiki.org [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134935 [21:10:23] (03CR) 10John F. Lewis: "It's what AffCom gave the chapter. They said the only conflict would be Wikimedia Nebraska but they're using US-NB or US-NEB for it. So US" [operations/dns] - 10https://gerrit.wikimedia.org/r/133980 (https://bugzilla.wikimedia.org/64557) (owner: 10John F. Lewis) [21:10:30] RobH ^ [21:10:40] ahh, then good enough for me. [21:11:31] (03CR) 10Brian Wolff: [C: 031] "https://www.mediawiki.org/wiki/Special:PendingChanges covers it pretty well" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134935 (owner: 10Chad) [21:11:54] (03CR) 10Reedy: [C: 04-1] "Wrong file" [operations/dns] - 10https://gerrit.wikimedia.org/r/134934 (owner: 10Reedy) [21:12:43] (03PS1) 10Reedy: Change URLs for noboard.chapters and wg.en to fix SSL cert issues [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134936 [21:14:49] (03Abandoned) 10Reedy: Add noboard-chapters to DNS to allow rename from noboard.chapters [operations/dns] - 10https://gerrit.wikimedia.org/r/134934 (owner: 10Reedy) [21:15:27] (03CR) 10Withoutaname: [C: 031] Remove FlaggedRevs from mediawiki.org [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134935 (owner: 10Chad) [21:17:49] (03CR) 10Chad: [C: 032] Remove FlaggedRevs from mediawiki.org [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134935 (owner: 10Chad) [21:17:56] (03Merged) 10jenkins-bot: Remove FlaggedRevs from mediawiki.org [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134935 (owner: 10Chad) [21:19:37] !log demon synchronized flaggedrevs.dblist 'Removing FR from mw.org' [21:19:42] Logged the message, Master [21:21:03] !log demon synchronized wmf-config/flaggedrevs.php 'Removing FR from mw.org' [21:21:08] Logged the message, Master [21:22:18] !log demon synchronized wmf-config/InitialiseSettings.php 'touching' [21:22:23] Logged the message, Master [21:26:19] (03PS1) 10Reedy: wg.en -> wg-en and noboard.chapters -> noboard-chapters [operations/apache-config] - 10https://gerrit.wikimedia.org/r/134939 [21:26:58] (03PS2) 10Reedy: Change URLs for noboard.chapters and wg.en to fix SSL cert issues [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134936 [21:27:11] (03CR) 10Reedy: [C: 032] Change URLs for noboard.chapters and wg.en to fix SSL cert issues [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134936 (owner: 10Reedy) [21:27:20] (03Merged) 10jenkins-bot: Change URLs for noboard.chapters and wg.en to fix SSL cert issues [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134936 (owner: 10Reedy) [21:29:05] (03CR) 10Nemo bis: "Now we have 3(+1) user groups with the same permission:" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134935 (owner: 10Chad) [21:29:07] !log reedy synchronized wmf-config/InitialiseSettings.php 'Iecdd8c5e60a142363b40e34d4fe2f27f0e5feef5' [21:29:11] Logged the message, Master [21:31:44] (03PS1) 10Withoutaname: Remove flaggedrevs-specific user groups from mediawiki.org [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134940 [21:33:19] !log reedy synchronized multiversion/MWMultiVersion.php [21:33:24] Logged the message, Master [21:33:38] (03CR) 10Nemo bis: [C: 04-1] "Not before they're emptied. https://gerrit.wikimedia.org/r/134935" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134940 (owner: 10Withoutaname) [21:34:56] <^d> Nemo_bis: I guess I should clean up after myself. [21:35:18] (03CR) 10Ori.livneh: [C: 032] Fix /a/common/php symlink creation [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134641 (https://bugzilla.wikimedia.org/64748) (owner: 10BryanDavis) [21:35:39] (03Merged) 10jenkins-bot: Fix /a/common/php symlink creation [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134641 (https://bugzilla.wikimedia.org/64748) (owner: 10BryanDavis) [21:36:07] (03CR) 10Nemo bis: Remove flaggedrevs-specific user groups from mediawiki.org (031 comment) [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134940 (owner: 10Withoutaname) [21:36:25] (03PS2) 10Reedy: wg.en -> wg-en and noboard.chapters -> noboard-chapters [operations/apache-config] - 10https://gerrit.wikimedia.org/r/134939 [21:38:06] (03PS1) 10Reedy: Remove remnants of . replaced with _ in "lang" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134948 [21:38:29] ^d: and I need to protect a couple pages I had unprotected [21:38:41] (03PS1) 10Reedy: Revert "Change URLs for noboard.chapters and wg.en to fix SSL cert issues" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134949 [21:39:03] (03PS2) 10Reedy: Revert "Change URLs for noboard.chapters and wg.en to fix SSL cert issues" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134949 [21:39:17] (03CR) 10jenkins-bot: [V: 04-1] Remove remnants of . replaced with _ in "lang" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134948 (owner: 10Reedy) [21:39:21] (03PS3) 10Reedy: Revert "Change URLs for noboard.chapters to fix SSL cert issues" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134949 [21:39:37] logs don't look particularly well https://www.mediawiki.org/w/index.php?title=Special:Log&page=Extension%3ATimedMediaHandler%2FClient+download [21:40:17] Can't we keep the extension enabled just revoking all user permissions and all effects? [21:40:51] (03PS4) 10Reedy: Revert "Change URLs for noboard.chapters to fix SSL cert issues" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134949 [21:41:17] (03CR) 10Reedy: [C: 032] Revert "Change URLs for noboard.chapters to fix SSL cert issues" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134949 (owner: 10Reedy) [21:41:25] (03Merged) 10jenkins-bot: Revert "Change URLs for noboard.chapters to fix SSL cert issues" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134949 (owner: 10Reedy) [21:41:31] :( those domains are really damned [21:42:01] i think Reedy fixed https://bugzilla.wikimedia.org/show_bug.cgi?id=64977 though [21:42:16] http://wg-en.wikipedia.org/wiki/Main_Page [21:42:41] I'm wondering if I should close the bug before the apache redirect is live [21:42:59] !log reedy synchronized wmf-config/InitialiseSettings.php 'Ifd048bafe0eb4af8765cee20a3d93d7663b1bcdf' [21:43:03] Logged the message, Master [21:43:22] Reedy: oh.. yea.. just say that you renamed it but leave open i suppose [21:43:34] (03PS2) 10Reedy: Remove remnants of . replaced with _ in "lang" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134948 [21:43:35] (03CR) 10jenkins-bot: [V: 04-1] Remove remnants of . replaced with _ in "lang" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134948 (owner: 10Reedy) [21:45:18] (03PS3) 10Reedy: Remove remnants of . replaced with _ in "lang" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134948 [21:45:27] (03CR) 10jenkins-bot: [V: 04-1] Remove remnants of . replaced with _ in "lang" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134948 (owner: 10Reedy) [21:47:43] (03PS4) 10Reedy: Remove remnants of . replaced with _ in "lang" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134948 [21:48:17] (03CR) 10jenkins-bot: [V: 04-1] Remove remnants of . replaced with _ in "lang" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134948 (owner: 10Reedy) [21:51:16] (03PS5) 10Reedy: Remove remnants of . replaced with _ in "lang" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134948 [21:54:54] (03PS3) 10Reedy: wg.en -> wg-en and noboard.chapters -> noboard-chapters [operations/apache-config] - 10https://gerrit.wikimedia.org/r/134939 (https://bugzilla.wikimedia.org/64977) [21:55:16] (03PS4) 10Reedy: Redirect arbcom.*.wikipedia.org -> arbcom-*.wikipedia.org [operations/apache-config] - 10https://gerrit.wikimedia.org/r/134932 (https://bugzilla.wikimedia.org/31335) [21:56:48] (03PS2) 10Reedy: Simplify wmf-config listings in createTxtFileSymlinks.sh [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134118 [21:56:52] (03CR) 10Reedy: [C: 032] Simplify wmf-config listings in createTxtFileSymlinks.sh [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134118 (owner: 10Reedy) [21:57:01] (03Merged) 10jenkins-bot: Simplify wmf-config listings in createTxtFileSymlinks.sh [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134118 (owner: 10Reedy) [22:08:13] (03PS1) 10Mwalker: Allow override of redis host [operations/puppet] - 10https://gerrit.wikimedia.org/r/134958 [22:08:37] anyone want to review ^? [22:08:37] :D [22:09:15] (03CR) 10jenkins-bot: [V: 04-1] Allow override of redis host [operations/puppet] - 10https://gerrit.wikimedia.org/r/134958 (owner: 10Mwalker) [22:10:23] (03PS3) 10Reedy: Remove wiktionary.wikipedia.org from rewrites as it is not in DNS. [operations/apache-config] - 10https://gerrit.wikimedia.org/r/92799 [22:10:28] (03PS2) 10Mwalker: Allow override of redis host [operations/puppet] - 10https://gerrit.wikimedia.org/r/134958 [22:10:34] (03CR) 10FunPika: [C: 04-1] Remove flaggedrevs-specific user groups from mediawiki.org (031 comment) [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134940 (owner: 10Withoutaname) [22:11:45] Reedy: lol @ wiktionary.wikipedia.org .. yea [22:12:00] (03CR) 10Reedy: "Is usne a plausible redirect?" [operations/apache-config] - 10https://gerrit.wikimedia.org/r/133991 (https://bugzilla.wikimedia.org/64557) (owner: 10John F. Lewis) [22:12:40] (03CR) 10Reedy: [C: 04-1] "Needs changing over to the new system if this is ever going to go forward" [operations/apache-config] - 10https://gerrit.wikimedia.org/r/24407 (owner: 10Jeremyb) [22:13:07] (03Abandoned) 10Reedy: Make votewiki use http:// in wgCanonicalServer [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/83566 (owner: 10Reedy) [22:13:10] (03Abandoned) 10Reedy: Remove redirect from http to https from votewiki [operations/apache-config] - 10https://gerrit.wikimedia.org/r/83565 (owner: 10Reedy) [22:13:35] Reedy: what do you mean by 'a plausible redirect' in this case? [22:13:48] Reedy: https://gerrit.wikimedia.org/r/#/c/53922/ [22:14:11] JohnLewis: Why would we need/want usne? [22:14:20] It's a new wiki [22:14:25] You tell people to use us-ne [22:14:59] mutante: ? [22:15:05] Reedy: The chapter wanted usne while we wanted us-ne. All I'm going to say is - tell them and -2 it. I'm not bothered about it in all honesty. [22:16:20] Reedy: nothing besides that it's about renaming wikis and old .. so kind of asking for status [22:16:26] us-ne I can cope with ;) [22:16:34] found it while looking for the other renaming bugs you linked me to [22:16:48] looked at open Apache changes [22:17:10] Reedy: I'll abandon the patches if you want :P idc about those 2 patches in all honesty [22:17:32] The other is fine [22:17:38] That redirect... [22:18:12] you can say dashes in domains are bad for SEO /me hides [22:18:22] haha [22:18:47] :p [22:19:28] I do wonder if we could do faux renaming for some of the wikis [22:19:37] what's faux renaming? [22:19:39] ie the chapcom affcom [22:20:01] keep the dbname the same, but have some sort of mapping in code to point the new url back to the old dbname [22:20:20] But weird edge cases [22:20:31] Hm. [22:20:47] it should just be multiversion setting dbnames [22:20:48] mwalker: have you seen https://news.ycombinator.com/item?id=7781878 ? [22:20:56] springle: hello :) [22:20:59] https://github.com/wikimedia/operations-mediawiki-config/blob/master/multiversion/MWMultiVersion.php#L225 [22:21:05] ori, iiiiinnnnnteerrrresting [22:21:12] Well, I'm gonna head off. Reedy: Send all complaints to Bugzilla, Gerrit or RT, there's something open for it is everything or my email. If you don't have it - use your access to find it ;) [22:21:24] *everywhere [22:21:26] oh, PDF rendering [22:21:32] if lang == chapcom lang = affcom [22:21:34] ori, I was just wondering if you'd like to be my favouritist person with puppet +2 for PDF: https://gerrit.wikimedia.org/r/#/c/134958/ [22:21:57] ^d: Thanks for the right change :p [22:22:13] <^d> JohnLewis: You're welcome :p [22:22:44] ^d: If I may offer advice, tick all the checkboxes next time for me :D [22:23:28] mwalker: can't at the moment, sorry [22:23:57] *sniff* ok! thanks for the lead on pdfium [22:24:05] sorry :/ [22:24:29] (03PS1) 10Reedy: Add docblock parameter type hints [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134961 [22:24:49] (03CR) 10Reedy: [C: 032] Add docblock parameter type hints [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134961 (owner: 10Reedy) [22:24:57] (03Merged) 10jenkins-bot: Add docblock parameter type hints [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134961 (owner: 10Reedy) [22:26:36] (03CR) 10Dzahn: [C: 032] Allow override of redis host [operations/puppet] - 10https://gerrit.wikimedia.org/r/134958 (owner: 10Mwalker) [22:28:04] mutante! thanks :D [22:28:25] (03PS1) 10Reedy: Allow faux-renaming/database remapping [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134962 [22:28:40] (03CR) 10jenkins-bot: [V: 04-1] Allow faux-renaming/database remapping [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134962 (owner: 10Reedy) [22:29:16] (03CR) 10BryanDavis: "Drive by comment inline." (031 comment) [operations/puppet] - 10https://gerrit.wikimedia.org/r/134958 (owner: 10Mwalker) [22:29:50] (03PS2) 10Reedy: Allow faux-renaming/database remapping [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134962 [22:30:54] I'm really not sure what I think of that [22:32:37] (03CR) 10Reedy: Allow faux-renaming/database remapping (031 comment) [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134962 (owner: 10Reedy) [22:32:55] (03CR) 10Dzahn: "cool and evil at the same time. avoids database renaming but kind of evil hack" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134962 (owner: 10Reedy) [22:34:17] Reedy: i'd say let's hear the dba one more time how complicated the renaming is [22:35:03] heh [22:35:11] you're reviewing yourself with inline comments:) [22:35:17] I wasn't going to rush in and start using that [22:35:31] It's ES that's the problem (as always) [22:35:46] arr, yea [22:35:52] <^d> external store or elasticsearch? [22:35:56] <^d> ES is VAGUE! [22:36:05] in this case storage:) [22:36:15] they make it hard to rename db's [22:36:28] hence the work-around Reedy is suggesting [22:36:36] aka "faux renaming" :) [22:38:04] It could be possible to just do something like that for ES... but yeah [22:38:52] Ask le expert [22:51:10] (03PS2) 10Withoutaname: Remove flaggedrevs-specific user groups from mediawiki.org [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134940 [22:51:10] (03PS3) 10Withoutaname: Remove flaggedrevs-specific user groups from mediawiki.org [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134940 [22:53:49] (03PS1) 10Dzahn: some small lint/style fixes on 'ocg' [operations/puppet] - 10https://gerrit.wikimedia.org/r/134966 [22:53:56] bd808|LUNCH, do you know anything about puppet in betalabs not allowing me to `sudo puppetd --test --verbose`, returning "Skipping run of Puppet configuration client; administratively disabled" [22:54:31] mwalker: Somebody ran `pupptd --disable` on that node. [22:54:52] Do `puppetd --enable` to turn the block off [22:54:55] hmm; nothing in the SAL [22:55:06] Which host? [22:55:11] deployments-pdf01 [22:55:24] Huh. Wasn't me [22:55:35] mwalker: there is also a bug [22:55:37] that does this [22:55:43] what's interesting is that all the vms in the deployment-prep group have stale puppet status [22:56:04] we've had this happen before in prod [22:56:11] I think that is realated to our self-hosted puppetmaster [22:56:14] puppet agents got into admin disabled state [22:56:30] (the stale status) [22:56:51] nooo! :'( "err: /Stage[main]/Role::Labs::Instance/Mount[/public/keys]: Could not evaluate: undefined method `[]=' for #" [22:57:06] ? [22:57:22] that was on the puppet run [22:57:29] not related to any change I've made [22:59:07] mutante, speaking of which; did you sync the pdf commit you +2'd? [22:59:37] mwalker: yes [22:59:51] mwalker: https://gerrit.wikimedia.org/r/#/c/134966/1 [23:00:08] couldn't resist [23:01:03] that error looks like a general labs issue? ehmm, no idea yet [23:01:08] * greg-g still smiles every time he sees jouncebot announce the deploy window [23:01:15] but not mounting public keys sounds bad [23:01:24] (03CR) 10Mwalker: [C: 031] "I like it! Thanks." [operations/puppet] - 10https://gerrit.wikimedia.org/r/134966 (owner: 10Dzahn) [23:02:30] (03CR) 10Dzahn: [C: 032] some small lint/style fixes on 'ocg' [operations/puppet] - 10https://gerrit.wikimedia.org/r/134966 (owner: 10Dzahn) [23:02:36] (03PS1) 10Withoutaname: Remove flaggedrevs-specific user groups from mediawiki.org [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134969 [23:03:32] (03CR) 10Steinsplitter: [C: 031] Create 'noratelimit' user group on dewiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/130809 (https://bugzilla.wikimedia.org/57819) (owner: 10Withoutaname) [23:03:42] mwalker: is it an option to use the actual puppetmaster? [23:03:59] add the role class as puppet group in labs [23:04:06] configure instance, select role class... i mean [23:04:21] I thought I was doing that already? [23:04:57] (03CR) 10Rillke: "The comments on-wiki could have had been a bit more explanative. If I would do like that on commonswiki people would cry." [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134935 (owner: 10Chad) [23:05:28] mwalker: oh, i'm confused then by the comment above < bd808> I think that is realated to our self-hosted puppetmaster [23:05:43] was wondering if everybody gets that error now or not [23:06:16] I was referring to wikitech showing all hosts in beta as having stale puppet status [23:06:42] ahh.ok [23:06:53] I *think* that data comes from the labs puppet master somehow [23:08:32] !log mwalker Started scap: SWAT Update to VisualEditor {{gerrit|134941}} [23:08:36] Logged the message, Master [23:09:59] James_F, fyi ^ [23:10:11] Ta. [23:10:21] bd808, new scap failure for me... [23:10:22] 23:09:52 ['/usr/local/bin/sync-common', 'mw1010.eqiad.wmnet', 'mw1070.eqiad.wmnet', 'mw1161.eqiad.wmnet', 'mw1201.eqiad.wmnet'] on osmium returned [255]: Received disconnect from 10.64.32.146: 2: Too many authentication failures for mwalker [23:10:22] 23:09:54 ['/usr/local/bin/sync-common', 'mw1010.eqiad.wmnet', 'mw1070.eqiad.wmnet', 'mw1161.eqiad.wmnet', 'mw1201.eqiad.wmnet'] on mw1151 returned [255]: Received disconnect from 10.64.16.131: 2: Too many authentication failures for mwalker [23:10:27] (03Abandoned) 10Withoutaname: Remove flaggedrevs-specific user groups from mediawiki.org [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134940 (owner: 10Withoutaname) [23:10:41] mutante: I just synced the deployment-prep (beta) puppetmaster with ops/puppet, forced a run and things look ok there. Not sure what mwalker's root problem is. [23:11:07] bd808: sounds good, cool [23:11:15] mwalker: Ouch. New to me too. [23:11:31] osmium is the 14.04 hhvm testing box [23:11:36] !log mwalker Finished scap: SWAT Update to VisualEditor {{gerrit|134941}} (duration: 03m 04s) [23:11:40] Logged the message, Master [23:11:54] Yay. [23:11:56] I suspect that just left us in an inconsistent state; doing a sync-dif [23:12:02] *sync-dir [23:13:04] mwalker: Seems to work here… [23:13:10] Failed publickey for mwalker [23:13:14] mwalker: I'll logged into osmuim and can run a sync-common there if needed. [23:13:15] there was something with your key [23:13:44] mutante, aye; but why only a couple servers on the cluster? [23:14:10] !log mwalker synchronized php-1.24wmf6/extensions/VisualEditor 'Syncing the extension manually because of scap failures on osium, mw1010, mw1070, mw1161, mw1201, and mw1151' [23:14:14] Logged the message, Master [23:14:34] !log sync-dir failed for osmium and mw1151 [23:14:39] Logged the message, Master [23:14:56] bd808, can you sync-common those two hosts? [23:15:10] mwalker: I'll try.... [23:15:14] heh [23:16:02] (03CR) 10BBlack: "If it helps at all in understanding the patch, the methodology of my change basically went like this:" [operations/debs/pybal] - 10https://gerrit.wikimedia.org/r/134833 (owner: 10BBlack) [23:16:16] mwalker: on osmium you don't own your home [23:16:20] !log Ran sync-common manually on osmium and mw1151 [23:16:22] did you recently get renamed or so? [23:16:25] Logged the message, Master [23:16:34] ah... andrewbogott_afk redid my user id [23:16:41] to match that of what I had in labs [23:16:49] mwalker: i'm pretty sure something went wrong during that [23:16:55] on osmium: [23:17:00] 4.0K -rw------- 1 2454 wikidev 581 Apr 29 23:51 authorized_keys [23:17:17] mutante: same on mw1151 [23:17:19] uid=605(mwalker) gid=500(wikidev) groups=500(wikidev) [23:17:48] Those hosts were probably down or not in the dhs group that andrew used to fix the perms [23:17:56] *dsh [23:18:06] the user still has the old UID [23:18:12] but the files are owned by the new UID [23:18:21] 605 vs 2454 [23:18:33] lemme check what you have in LDAP [23:18:49] yea, 2454 [23:19:38] mwalker: can you try to just login via ssh on osmium [23:20:03] ok, that should be fixed [23:20:07] mutante, I can; it let me in [23:20:39] ok, also fixing mw1151, more? [23:20:59] those two are the only ones that failed [23:21:34] my stuff on the analytics machines might be borked; but I think they removed all my accounts there when they did their most recent cleanup [23:22:17] James_F, in any case; your stuff should now be deployted [23:22:36] !log osmium,mw1151 fixed UID of mwalker (605->2454) [23:22:41] Logged the message, Master [23:23:27] mwalker: so yea, i don't know if you want to repeat it ..but should be fiex [23:23:31] fixed [23:25:24] mutante, given that I can log in to both hosts now; it should be OK [23:26:29] mwalker: Yeah, works for me. Thanks! [23:26:46] shiney [23:26:55] mwalker: yep, agreed. cya later then [23:28:15] (03CR) 10FunPika: Remove flaggedrevs-specific user groups from mediawiki.org (031 comment) [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/134969 (owner: 10Withoutaname) [23:29:24] hey, ops: where can i find real-time load graphs for the image scalers? [23:29:51] cscott: https://ganglia.wikimedia.org/latest/?r=hour&cs=&ce=&s=by+name&c=Image%2520scalers%2520eqiad&tab=m&vn=&hide-hf=false [23:29:53] <^d> ganglia! [23:30:08] my change to use square bounding boxes for default-sized thumbnails went live, like to see if it had an image on the scalers [23:30:16] oh, it did? [23:30:22] gwicke thought it probably wouldn't, since the parsercache still has the old thumbnail sizes [23:30:34] so the imagescalers would only get invoked on page edit/reparse. [23:30:36] They look really bored at the moment [23:30:47] https://ganglia.wikimedia.org/latest/graph_all_periods.php?c=Image%20scalers%20eqiad&m=cpu_report&r=hour&s=by%20name&hc=4&mc=2&st=1400801370&g=cpu_report&z=large [23:31:12] <^d> swift's a little red but I dunno how much of that is normal. [23:32:56] greg-g: oh, maybe it's not. https://en.wikipedia.org/wiki/Special:Version says 1.24wmf5 / 0320cdd is running, but the 0320cdd link takes me to origin/master, not 1.24wmf5 [23:33:33] cscott: if your stuff is anywhere, it's on mw.org [23:33:36] which is wmf6 [23:33:38] as of today [23:34:02] ah, yes, wmf6 has it. [23:34:19] ok. so i can expect my code to go live on the big wikis next wednesday, right? [23:34:26] cscott, greg-g I'm happy to deploy the backport if we want [23:34:47] cscott: tuesday to commons, which I assume will be "the" big one for this? [23:34:52] i've started pre-generating the new thumbnail sizes, although gwicke did convince me that even if i don't it shouldn't really kill the scalers [23:34:53] mwalker: not right now, please [23:34:58] greg-g, heh; k [23:35:00] :) [23:35:02] cscott: everywhere but wikipedias on Tuesday, everywhere on thursday [23:35:08] ok. [23:35:26] So ti will hit commons on Tuesday [23:35:42] cscott: Tuesday at 11am Pacific will be 'go time' for it on commons, will you be around? [23:36:10] that's 2pm eastern. yeah, i'll be around. [23:36:12] but i expect that it will be enwiki, not commons, which will be the big load [23:36:17] <^d> I was going to say. [23:36:24] ah, wasn't sure, honestly [23:36:24] <^d> Commons will probably be pretty boring. They don't actually make that many thumbnails for their own images. [23:36:29] so, yeah, same time on Thursday then :) [23:36:37] since it is wikitext with [[File:Foo.jpg|thumb]] where Foo.jpg is a portrait aspect ratio, which would be regenerated. [23:36:41] * greg-g nods [23:36:44] <^d> Similar to elasticsearch. you *use* commons a lot but it's not commons itself generating the load. [23:36:56] * greg-g nods [23:37:07] cscott: there's prep you want to do before enwiki, right? pregen'ing? [23:37:31] yes, i'm starting that now and it will run over the next week-ish [23:37:41] you have 6.5 days :) [23:38:12] so the question will be -- (a) how far through all the images have we got by then, and (b) does it really matter, since it's only edited pages which will immediately see the new sizes. [23:38:14] or we can back it out of wmf6 and do an explicit backport for it when it's ready [23:38:39] yep. check in with me early next week after the weekend and i should have a good idea about progress. [23:38:52] cscott: I was going to tell you to do the check in with me :) [23:39:03] but ok, I'll add to my list ;) [23:39:34] greg@x200s:~$ task add "check in with cscott re status of thumbnail pregening" proj:wmf.deploy pri:H due:tuesday [23:40:31] i'm disappointed that the 'task' command appears not to use GNU getopt [23:42:45] taskwarrior is the shiz [23:42:56] you should /join #taskwarrior [23:44:07] <^d> siri...remind me to talk to greg-g about task management. [23:44:08] <^d> [23:46:02] * bd808 stick a post-it on his desk [23:46:06] Time to get Siri-ous about task management [23:46:17] (the marketing PRACTICALLY WRITES ITSELF) [23:46:42] siri: slap marktraceur with a trout [23:47:07] !trout [23:47:25] Aw man, all the cool channels have a !trout command. [23:47:44] marktraceur: see the topic :) [23:47:54] Oh right. [23:50:03] cscott@x201s:~$ echo "echo talk to greg-g | wall" | at teatime next tue [23:50:55] Haha [23:56:34] (03PS1) 10Mwalker: Move OCG default port to 8000 [operations/puppet] - 10https://gerrit.wikimedia.org/r/134975 [23:59:06] bd808, I don't know if you have further time you dont know what to do with; but if you to use it on +2 and syncing ^ that would be useful. It's not urgent though; I can override the variable in a local config file [23:59:30] it's just unpleasant