[00:04:24] PROBLEM - Check status of defined EventLogging jobs on vanadium is CRITICAL: CRITICAL: Stopped EventLogging jobs: consumer/mysql-db1047 [00:06:41] Is there an issue with lists.wikimedia.org? Trying to change a list's 'owner' field and mailman is timing out on the request… [00:07:08] James_F: try Firefox instdead of Chrome [00:07:23] mutante: Really? Wow. [00:07:38] yes, Chrome timeout [00:07:41] black magic [00:07:53] i dont even know what to blame, but it only happens with that :p [00:07:56] Yeah, it works. [00:07:58] * James_F sighs. [00:10:12] mutante: Thanks hugely, BTW. :-) [00:13:06] James_F: np! i guess you recently created a bunch of lists since THO was gone? [00:13:47] mutante: No, Jamesofur did, but I was asked in a personal capacity to wield a ban-hammer. [00:14:08] got it! [00:14:42] (yes, sadly chrome and mailman do not get along) [00:14:45] James_F: when they ask about spam https://wikitech.wikimedia.org/wiki/Mailman#Fighting_spam_in_mailman [00:14:54] mutante: yeah, they were getting backed up and so I went 'PSH' [00:15:17] thank you, i actually had a message from THO about them but couldnt find the tickets [00:15:22] mutante: Nice. [00:20:03] * AaronSchulz strangles http://osdir.com/ml/repo-discuss/2013-07/msg00088.html [00:30:34] PROBLEM - Puppet freshness on labstore1001 is CRITICAL: Last successful Puppet run was Tue 11 Mar 2014 08:47:37 PM UTC [00:32:44] springle, looks like s1-analytics is down now. "ERROR 2003 (HY000): Can't connect to MySQL server on 's1-analytics-slave.eqiad.wmnet' (111)" [00:32:58] should I just keep waiting? [00:33:03] halfak: yep :) [00:33:08] OK. [00:33:13] trying something [00:35:28] halfak: should be back. the txn and locks are gone, but undo list must still be processed. would be good not to hammer the box for a day or so [00:35:51] so at least event logging can continue and replication start to catch up [00:36:26] Thanks. I'll spread the word. [00:39:04] springle, once lag is caught up, should it be good to take the regular abuse? [00:42:27] halfak: yep. worth rethinking those leizi_temp* tables though; at least drop the ORDER BY, and consider breaking it up into batches [00:43:36] Oh yeah. Not sure what was up with those. Will have a look. :) [00:44:34] maybe even use ENGINE=ARIA (like MyISAM) for the temp tables. would make it easier to recover if a transaction wasn't logging masses of history for rollback which could be as simple as dropping a few temp tables [02:13:12] !log LocalisationUpdate completed (1.23wmf17) at 2014-03-18 02:13:11+00:00 [02:16:40] morebots is dead again [02:17:09] Likely the Labs maintenance. [02:17:47] andrewbogott: morebots? :-) [02:17:56] I see grrrit-wm is back, yay. [02:18:48] Gloria: I think all bets are off while the bots migrate to ashburn [02:19:16] andrewbogott: Just as long as we eventually log the entries in the server admin log. [02:19:29] Can always do it by hand while morebots is down. :-) [02:19:38] grrrit-wm seems connected, but quiet. [02:24:11] !log LocalisationUpdate completed (1.23wmf18) at 2014-03-18 02:24:11+00:00 [02:56:18] !log LocalisationUpdate ResourceLoader cache refresh completed at Tue Mar 18 02:56:15 UTC 2014 (duration 56m 14s) [03:16:24] bd808|BUFFER: ori yay, the directory permissions fixed the l10n update :) [03:31:34] PROBLEM - Puppet freshness on labstore1001 is CRITICAL: Last successful Puppet run was Tue 11 Mar 2014 08:47:37 PM UTC [03:38:53] greg-g: :) [04:00:05] ori: Do you know anything about the 'eventlogging' labs project? [04:00:12] Specifically, whether it's good for anything anymore? [04:01:28] there's an eventlogging setup on the beta cluster that is the canonical place for working on eventlogging in labs, and i'm almost certain that the dedicated eventlogging project is therefore a dupe [04:01:30] but let me make sure [04:02:13] thanks [04:39:58] andrewbogott_afk: I don't think I see an eventlogging labs project [06:18:14] PROBLEM - HTTP 5xx req/min on tungsten is CRITICAL: CRITICAL: reqstats.5xx [crit=500.000000 [06:32:34] PROBLEM - Puppet freshness on labstore1001 is CRITICAL: Last successful Puppet run was Tue 11 Mar 2014 08:47:37 PM UTC [06:48:14] PROBLEM - SSH on lvs1001 is CRITICAL: Server answer: [06:51:14] RECOVERY - SSH on lvs1001 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [06:55:14] PROBLEM - SSH on lvs1001 is CRITICAL: Server answer: [06:56:14] RECOVERY - SSH on lvs1001 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [07:32:14] RECOVERY - HTTP 5xx req/min on tungsten is OK: OK: reqstats.5xx [warn=250.000 [08:33:23] !log 5xx resps spiked between 6:15 and 6:35 UTC; lvs1001 SSH check flapped between 6:48 and 6:56 UTC. [08:33:29] Logged the message, Master [08:47:00] aude, Tobi_WMDE: 2014-03-18 08:44:30 mw1196 wikidatawiki: [588a2739] /w/api.php?action=wbgetentities&props=sitelinks&format=json&ids=Q18559 Exception from line 98 of /usr/local/apache/common-local/php-1.23wmf17/extensions/Wikidata/extensions/Wikibase/lib/includes/store/sql/WikiPageEntityLookup.php: No such revision found for Q18559: 116886079 [08:47:38] ori: so that revision has just vanished? [08:48:03] maybe it was oversighted? [08:48:23] dunno, I'm on my way to bed, not brain power left to debug :) [08:48:32] *no [08:48:42] * Jasper_Deng wants to high-five ori w/ respect to that but can't atm :/ [08:49:10] can't high-five? [08:49:14] ori: thx! will tell it to the team once they show up.. [08:49:40] Tobi_WMDE: np, thank you [08:50:28] good night / day! *waves* [08:57:06] ah ori heading to bed [08:57:07] that is good [08:58:59] ori: b/c I can't go to bed atm [08:59:01] >.< [09:02:59] ori: https://bugzilla.wikimedia.org/show_bug.cgi?id=62547 [09:11:18] aude: any idea how I can pull from the puppet-jenkins repo on the new jenkins instance? [09:12:08] sorry, wrong channel [09:33:34] PROBLEM - Puppet freshness on labstore1001 is CRITICAL: Last successful Puppet run was Tue 11 Mar 2014 08:47:37 PM UTC [09:57:55] apergos: please https://gerrit.wikimedia.org/r/#/c/119212/1 [10:27:10] matanya: sorry, doing cleanup from yesterday, it's going to be awhile... [10:37:34] hashar: i don't understand what you are trying to do [10:38:46] matanya: on the beta cluster, I need the syslog-ng central server to write to /data/project/syslog instead of /home/wikipedia/syslog [10:38:46] :D [10:39:01] that part i got [10:39:10] you can just add a simple variable [10:39:31] and I need a role as well [10:39:53] if ::relam = producation write syslog to foo, else to bar [10:40:27] why do you need a role? [10:41:49] cause I had previous complaints about adding if realm / else [10:41:59] so I add some new roles [10:42:16] though that probably depends on the ops ultimately merging the change [10:47:55] i see [11:29:04] akosiaris: quick question: in modules/authdns/manifests/init.pp there is a variable ipaddress, is it equal to ::ipaddress ? [11:29:51] no, you can clearly see that the class has a parameter named ipaddress at the top [11:30:28] ah how nice. A local scope variable shadowing a global one ? [11:31:11] it's not shadowing it, you're not supposed to use the global one unprefixed anyway :) [11:31:24] unprefixed ? [11:31:34] unscoped [11:32:02] aah. Yeah in puppet3, but in puppet2 it is shadowing it [11:32:38] ? [11:32:45] since when it puppet sees the $ipaddress it will start searching every scope up to global [11:32:56] and throwing errors [11:32:58] which was bad bad bad and is no longer happening [11:33:01] er, warnings [11:33:07] yeah yeah sure [11:33:21] but technically it is shadowing, that is what i wanted to point out [11:33:25] you're not supposed to use $ipaddress to fetch the global variable even with puppet 2.7 [11:47:24] Puppet's starting to show bad heritage, just like PHP:P [11:48:31] anyway akosiaris and paravoid i guess the best way would be to rename that variable, correct? [11:48:56] * matanya got into every egde case of puppet in this puppet 3 migration project :) [11:49:13] why? [11:49:17] it is not technically needed. Scoping and all seems to be correct [11:49:18] it works fine [11:49:20] it will work fine [11:49:34] Dynamic lookup of $ipaddress is deprecated. Support will be removed in Puppet 2.8. Use a fully-qualified variable name (e.g., $classname::variable) or parameterized classes. [11:49:47] i would rename it simply for git grep to have better output [11:49:50] but it is not needed [11:50:27] matanya: is that warning for authdns? [11:50:48] that is the only left call to ipaddress i see when grepping [11:51:07] execpt site.pp [11:51:17] which i changes locally to test [11:51:20] *d [11:52:05] I see a bunch more [11:52:22] like manifests/realm.pp [11:52:28] also site.pp would trigger this error [11:52:46] realm is also fixed locally [11:53:03] have you reproduced this warning locally? [11:53:33] i have, but not sure it is related to authdns [11:53:48] since puppet doesn't bother to tell me *where* is the problem [11:53:53] if you have reproduced it, try modifying authdns and then retrying? [11:53:57] yeah, it's annoying [11:54:01] good point [11:54:05] i'll try that [11:54:06] it could be a false positive too [11:56:18] and now i can't reproduce [11:56:23] sigh [12:00:33] matanya: thanks for all the reviews :-] [12:01:21] my pleasure [12:03:14] PROBLEM - HTTP 5xx req/min on tungsten is CRITICAL: CRITICAL: reqstats.5xx [crit=500.000000 [12:05:22] lunch [12:05:38] the old tools login is throwing me out on login... brilliant [12:05:49] bon appetite [12:32:27] matanya: I'd also use suggestions :) https://gerrit.wikimedia.org/r/119216 [12:33:40] is doesn't even build Nemo_bis [12:34:16] Nemo_bis: you are missing } in line 26 [12:34:34] PROBLEM - Puppet freshness on labstore1001 is CRITICAL: Last successful Puppet run was Tue 11 Mar 2014 08:47:37 PM UTC [12:35:49] MZMcBride: Is there a way to get a statistic how many users use the math rendering option source png mathjax and default [12:36:08] matanya: I know but that's not the main problem [12:36:12] The historic values would be interesting as well [12:36:18] physikerwelt: Is that a user preference? [12:36:23] it is hoo [12:36:23] yes [12:36:35] Nemo_bis: what is your main problem ? [12:37:09] matanya: see inline comments [12:37:13] ok [12:37:20] I found a script https://en.wikipedia.org/wiki/Wikipedia:Database_reports/User_preferences/Configuration that generates user preference statistics but does not include the math rendering option [12:37:58] contact the author physikerwelt [12:38:40] physikerwelt: that user preference uses numerical values? Yikes... [12:38:53] yes 1-7 [12:38:56] I can probably get you numbers if you tell me which wiki you're interested i [12:38:57] n [12:39:57] physikerwelt: ^ [12:40:03] I would be interested in the distribution of the numbers [12:40:45] hoo: I would be interested in the distribution of the numbers [12:40:59] physikerwelt: On which wiki? [12:41:18] enwiki would be sufficient [12:41:33] can you differntiate between 0 and NULL [12:42:04] 0 means PNG [12:44:24] physikerwelt: Ok, back with irssi now... my pidgin went crazy [12:45:30] physikerwelt: http://fpaste.org/86302/39514671/ [12:46:11] thank you. Is SPACE = null [12:47:07] no, empty string [12:49:00] caffeineexcess: ok thank you. Thus if users don't change the default setting there will be no entry [12:49:28] That's how the preferences are supposed to work :) [12:49:59] Nemo_bis: not very familiar with this stuff, what is it all about? [12:50:25] caffeineexcess: I'd like to mention you if I publish this numbers unfortunately I don't remember your name. Should I refere to user hoo man? [12:50:37] physikerwelt: Yeah :) [12:50:47] Pidgin seems to also work again... [12:50:49] ok thank you [12:50:55] wonder where these malloc failures came form [12:50:57] * from [12:54:19] matanya: it's about what the commit message says :) [12:54:43] Do you what mediawiki-vagrant is? It's used by MediaWiki devs for local machines for testing, and also for quick virtual server setups [12:57:55] i do know that Nemo_bis [12:58:08] i'm not familiar with the setting and configs [13:18:30] matanya: those are not important, they can be argued over later by MediaWiki devs :) [13:19:04] ok [13:19:14] RECOVERY - HTTP 5xx req/min on tungsten is OK: OK: reqstats.5xx [warn=250.000 [13:26:45] hoo: another question. Can I determine how manny users did not set anything. Is that the rest to 20 954 494 [13:27:18] or are just the active users taken into account 129,542 [13:27:56] I'm wondering that most common setting is MODERN which has been marked as deprecated in 2011 [13:28:12] and is no longer selectable from the ui [13:29:02] physikerwelt: All other users don't have that setting set, yes [13:29:07] What?? https://integration.wikimedia.org/ci/job/operations-puppet-validate/12493/console [13:29:28] so that would be most of the almost 21 million [13:29:51] I guess so, yes [13:30:39] ugh, spurious self-pasting things [13:31:27] !log hoo synchronized wmf-config/InitialiseSettings.php 'Enable GuidedTours on Wikidata' [13:31:33] Logged the message, Master [13:35:16] ottomata: http://www.debian.org/doc/debian-policy/ch-docs.html#s-copyrightfile [13:36:01] akosiaris: because it's your week :) I think https://gerrit.wikimedia.org/r/117250 is ready [13:36:12] ottomata: http://dep.debian.net/deps/dep5/#license-specification (see syntax for example) [13:37:09] that's what I meant by "use common licenses" [13:37:42] hoo|irssi: Thank you [13:38:17] You're welcome :) [13:38:34] RECOVERY - Squid on carbon is OK: TCP OK - 0.000 second response time on port 8080 [13:45:46] AzaToth: ok cool, I was able to shorten the copyright file by referencing BSD and LGPL-2.1 [13:46:22] yea, but you should include the "header" though [13:46:30] for the LGPL one [13:47:19] ha, it was not clear what the header is [13:47:23] that one is really verbose [13:47:39] is the header the preamble? [13:47:52] I usually copies /usr/share/debhelper/dh_make/licenses/lgpl2 [13:47:54] yea [13:48:06] oh great [13:48:07] thanks [13:49:21] offcourse if it's a common license + exception, you can combine it as it was shown in the example [14:10:22] hashar: I think your proxy issue is resolved :-) [14:10:50] akosiaris: carbon is back up ? :-] [14:10:57] yes [14:11:01] that was fast! [14:11:34] carbon is easy to find ;) [14:14:43] squid3 eh? [14:14:56] (I was looking at it when I saw someone removing the package... I figured it was you) [14:15:24] akosiaris: you can close https://rt.wikimedia.org/Ticket/Display.html?id=7069 :] [14:15:29] ah it is closed alredy [14:16:58] :-) [14:29:44] !log Create EducationProgram and Translate tables on legalteamwiki [14:29:49] akosiaris: regarding the syslog-ng puppet changes, do you want to schedule that ahead of time? [14:29:49] Logged the message, Master [14:29:58] akosiaris: since it might cause issues on nfs1/nfs2 servers [14:30:31] I haven't had the time to look at the change yet. I 'll go through it and let you know [14:31:47] great : [14:32:37] """'Calling package builder 'cowbuilder' on distributions 'lucidprecise'""" *sigh* [14:32:44] poor Lucid [14:34:10] cmjohnson1: ok i guess I should take an18 offline? [14:34:34] ottomata: no don't, I am not there at that moment. [14:34:47] got hung up with getting inventory from Tampa [14:35:01] ok cool [14:35:12] didn't put it on my calendar [14:35:36] ottomata: wanna do after lunch? [14:36:30] err, today is a bad day for that, there are a bunch of analytics showcase meetings and I leave early on tuesdays to go teach [14:36:37] tomorrow? [14:37:18] !log reedy synchronized wmf-config/ [14:37:23] Logged the message, Master [14:38:41] ottomata: tomorrow is better [14:38:50] k cool [14:44:04] PROBLEM - Varnishkafka Delivery Errors on cp3020 is CRITICAL: kafka.varnishkafka.kafka_drerr.per_second CRITICAL: 382.833344 [14:46:24] heyyYYYyyyy shh [14:48:34] PROBLEM - Varnishkafka Delivery Errors on cp3019 is CRITICAL: kafka.varnishkafka.kafka_drerr.per_second CRITICAL: 983.700012 [14:49:03] :-@ [14:51:04] RECOVERY - Varnishkafka Delivery Errors on cp3020 is OK: kafka.varnishkafka.kafka_drerr.per_second OKAY: 0.0 [14:51:14] manybubbles, MaxSem: Looks like no changes for this morning's SWAT window. [14:51:45] there were some cirrus changes [14:51:50] but it looks like someone moved them? [14:52:23] https://wikitech.wikimedia.org/w/index.php?title=Deployments&diff=104947&oldid=104772 [14:52:52] hey [14:52:53] yep [14:53:25] I don't think those cirrus changes were deployed yet [14:53:29] I mean, no one asked me about them [14:56:10] manybubbles: oh, I thought you did those yesterday morning [14:56:19] manybubbles: I thought you just put them on the wrong day [14:56:28] my bad [14:56:45] manybubbles: the email thread confused me. [14:57:02] ah, sorry! yeah, I scheduled them for this morning [14:57:25] no worries, happens, thank you, go forth! [14:57:27] paravoid, any word on that unbalanced networking stuff to/from esams/eqiad? [14:58:04] PROBLEM - Varnishkafka Delivery Errors on cp3020 is CRITICAL: kafka.varnishkafka.kafka_drerr.per_second CRITICAL: 85.73333 [14:58:34] RECOVERY - Varnishkafka Delivery Errors on cp3019 is OK: kafka.varnishkafka.kafka_drerr.per_second OKAY: 0.0 [15:01:13] who today? [15:01:41] ottomata: no, maybe mark wants to have a look instead? :) [15:03:07] hi [15:03:21] greg-g: manybubbles we are deploying a config change [15:03:33] uh? [15:03:57] looks like guided tour was already done [15:04:01] !log reedy updated /a/common to {{Gerrit|I72a7751f8}}: Let AbuseFilter block users on Spanish Wikivoyage [15:04:06] Logged the message, Master [15:04:18] manybubbles: really? [15:04:28] that is what https://wikitech.wikimedia.org/wiki/Deployments says [15:05:09] Yeah, already did that earlier [15:05:17] oh [15:05:30] Didn't see that one... mh [15:05:45] what's SWAT anyway... did we move over to killing people? :P [15:06:08] probably good to have these on the calendar, but ok this time [15:06:32] nobody told me that we scheduled that, so I wasn't aware and just used a silent spot [15:06:39] that's fine [15:07:42] hoo: aude , just fyi (you did nothing wrong), but when asked if guidedtours on wikidata was ok for a SWAT deploy, I responded thusly: [15:07:45] 10:55 < greg-g> anomie: yeah, good question. That one is a blurry line (since it was enabled other places, and was on test wikidata already). I almost blocked it last night when I saw it, but... didn't. I think it's fine. [15:07:59] aude: hoo just so you get a feel of what the line is :) [15:08:17] it is blurry [15:08:23] * greg-g nods [15:08:49] anyway, look forward to the tours on wikidata :) [15:08:58] well tested on testwikidata [15:09:00] Yep, that's the main thing :) [15:09:12] we tested on both testwikidata and beta labs [15:09:13] and overwhelming consensus on project chat [15:10:03] yeah, all fine, but if it was a new extension, it'd need a separate window so it gets announced correctly [15:10:06] :) [15:11:18] Oh sure [15:11:59] anywho, that's all! [15:14:06] greg-g: anyhoo [15:15:04] Reedy: :P [15:17:46] hoo: oh, wait, when did you do this? I thought during a SWAT window? Next time SWAT window please :) [15:20:11] greg-g: what's that SWAT thing after all? It says I'm not even allowed to deploy there as I'm not on that list, right? [15:20:22] we can deploy [15:20:23] list of people [15:20:32] it's just there are more experienced people around at the time [15:20:43] i would just list our thing and the person who is deploying [15:20:55] * aude should ahve poked you that i put it on there [15:21:03] bah, I guess I just emailed engineering@, you're on ops@ list right, hoo? [15:21:10] https://wikitech.wikimedia.org/wiki/SWAT_deploys [15:21:12] greg-g: Yep, only on ops [15:21:12] greg-g: you emailed us [15:21:17] * aude got email [15:21:48] ah, it was in "Roadmap and deployment highlights - week of March 17th" on wikitech [15:22:19] yeah, I'll be sure to cc ops@ about it in the future, too. [15:22:34] PROBLEM - Varnishkafka Delivery Errors on cp3019 is CRITICAL: kafka.varnishkafka.kafka_drerr.per_second CRITICAL: 787.799988 [15:22:45] when I first started, I assumed/was told ops was a subset of engineering@, so I didn't always email both [15:22:59] but, since we have more non-WMF deployers now who are on ops@... no longer true! [15:23:01] mh, maybe I'm supposed to be on engineering as well [15:23:24] * aude thinks it's a staff list [15:23:36] ops@ is the requirement for all deployers, WMF or not, so the original assumption was probably that they already were on engineering@, since they were a WMF engineer [15:23:50] yeah, it's pretty quiet [15:24:09] purpose is staff only, I believe /me looks at archives [15:24:31] so, who is on point for swat today? [15:24:35] greg-g: ^? [15:24:48] i think no one [15:24:59] nothing today since ours is done [15:25:02] manybubbles: who ever in the list of names steps up :) [15:25:12] so, me then? [15:25:18] guess so! [15:26:21] hoo: aude yeah, engineering is mostly "here's this meeting on TDD on this date" and "don't forget to fill out the form to see who goes to the hackathon" and such [15:27:04] RECOVERY - Varnishkafka Delivery Errors on cp3020 is OK: kafka.varnishkafka.kafka_drerr.per_second OKAY: 0.0 [15:27:37] * manybubbles has the conch [15:29:04] James_F|Away: btw https://bugzilla.wikimedia.org/show_bug.cgi?id=56842 [15:33:04] PROBLEM - Varnishkafka Delivery Errors on cp3020 is CRITICAL: kafka.varnishkafka.kafka_drerr.per_second CRITICAL: 125.133331 [15:33:34] RECOVERY - Varnishkafka Delivery Errors on cp3019 is OK: kafka.varnishkafka.kafka_drerr.per_second OKAY: 0.0 [15:35:14] !log manybubbles synchronized php-1.23wmf18/extensions/CirrusSearch/ [15:35:21] Logged the message, Master [15:35:34] PROBLEM - Puppet freshness on labstore1001 is CRITICAL: Last successful Puppet run was Tue 11 Mar 2014 08:47:37 PM UTC [15:36:19] mutante|away: Thanks. [15:36:25] * manybubbles puts the conch down [15:37:14] * greg-g wipes the spit off the conch [15:46:49] ah puppet killed me :-((( [15:47:51] * ^d revives hashar [15:52:09] hiiiii [15:52:25] ^d: just in time, I got a question for you. If I am to report hhvm issue should I do them upstream ? [15:52:37] or would it makes sense to create a hhvm component in our bugzilla? [15:52:45] <^d> hashar: We've got a keyword! [15:53:08] ^d: are you actually triaging it ? :-] [15:53:36] will fill in bugs under Wikimedia > continuous integration and mark them [15:53:51] <^d> I'll at least look at it :) [15:53:58] <^d> And see if an upstream bug needs filing. [15:55:30] ^d: also you might want to enroll ebernhardson on the hhvm front. He proposed at least one hhvm configuration change for mw/core :] [15:58:34] PROBLEM - Varnishkafka Delivery Errors on cp3019 is CRITICAL: kafka.varnishkafka.kafka_drerr.per_second CRITICAL: 855.56665 [16:03:04] RECOVERY - Varnishkafka Delivery Errors on cp3020 is OK: kafka.varnishkafka.kafka_drerr.per_second OKAY: 0.0 [16:03:22] paravoid: (and Jeff_Green's ghost): after discussion with the apparent sender I looked more closely at the headers and http://lists.wikimedia.org/pipermail/wikimania-l/2014-March/005536.html looks spoofed. any thoughts about that? idk what yahoo supports but are we doing any checks on the way to mailman? (dkim/spf/etc.) [16:04:04] we can contact the guilty relay to fix the compromised account. but then presumably they'll find a new relay [16:04:54] ^d: https://bugzilla.wikimedia.org/show_bug.cgi?id=62787 and https://bugzilla.wikimedia.org/show_bug.cgi?id=62788 . You are on CC and I pushed you back an email by Timo from Sunday :-] [16:05:06] i was considering telling him to try to use a different address for posting. (there are other addresses he's already regularly using. idk how he chooses which to use for a given mail) [16:09:34] RECOVERY - Varnishkafka Delivery Errors on cp3019 is OK: kafka.varnishkafka.kafka_drerr.per_second OKAY: 0.0 [16:10:04] PROBLEM - Varnishkafka Delivery Errors on cp3020 is CRITICAL: kafka.varnishkafka.kafka_drerr.per_second CRITICAL: 136.666672 [16:13:42] basically, https://xkcd.com/1254/ [16:21:10] grrrit-wm is broken [16:21:11] fwiw [16:24:34] paravoid: ahhhh tabs :) [16:25:40] it's amazing how useful that (grrrit-wm) was for me [16:26:07] greg-g: he will be back [16:26:15] and i agree it is useful [16:32:34] PROBLEM - Varnishkafka Delivery Errors on cp3019 is CRITICAL: kafka.varnishkafka.kafka_drerr.per_second CRITICAL: 714.466675 [16:37:06] matanya: tabs because the file had tabs already and my changeset wasn't whitespace-related [16:38:04] i disagree, since it is a add and not rename, and i would like if you can fix it but i'm out. bye [16:39:03] it is a rename [16:39:04] RECOVERY - Varnishkafka Delivery Errors on cp3020 is OK: kafka.varnishkafka.kafka_drerr.per_second OKAY: 0.0 [16:40:05] akosiaris: https://gerrit.wikimedia.org/r/#/c/118794/4/manifests/admins.pp [16:42:34] RECOVERY - Varnishkafka Delivery Errors on cp3019 is OK: kafka.varnishkafka.kafka_drerr.per_second OKAY: 0.0 [16:45:04] PROBLEM - Varnishkafka Delivery Errors on cp3020 is CRITICAL: kafka.varnishkafka.kafka_drerr.per_second CRITICAL: 139.899994 [16:47:52] akosiaris, thanks for rt 6961! [16:48:13] gwicke: you are welcome :-) [16:54:33] Someone broke the sudoers. [16:54:47] Is anyone on it? [16:54:55] more info ? [16:55:24] i merged a sudo change not so long ago [16:55:34] but only on wtp* [16:57:15] its what is emailing [16:57:27] all the wtps are sending security warning emails, but only those [16:57:33] oh, and tin. [16:57:42] (but thats different thing) [16:58:00] akosiaris: "parse error near line 5" [16:58:01] i don't think so [16:58:07] I just saw it [16:58:11] damn cron emails.. [16:58:34] PROBLEM - Varnishkafka Delivery Errors on cp3019 is CRITICAL: kafka.varnishkafka.kafka_drerr.per_second CRITICAL: 3952.133301 [16:58:52] Gah, there's a tab at the beginning of lines. [16:59:02] yeah [16:59:06] it does not matter [16:59:18] other files have it too [16:59:26] look into it since I broke it [17:00:04] RECOVERY - Varnishkafka Delivery Errors on cp3020 is OK: kafka.varnishkafka.kafka_drerr.per_second OKAY: 0.0 [17:00:34] RECOVERY - Varnishkafka Delivery Errors on cp3019 is OK: kafka.varnishkafka.kafka_drerr.per_second OKAY: 0.0 [17:03:12] akosiaris: space after NOPASSWD: ? [17:03:22] fully path to command [17:03:24] and space [17:03:30] full* [17:06:45] Yeah, I just commited. [17:07:12] Coren: you beat me to it [17:07:14] thanks [17:07:41] https://gerrit.wikimedia.org/r/#/c/119308/ if you want to quick +2 [17:08:27] merged and puppet-merged [17:09:46] also triggered a puppet run through salt [17:09:58] * Coren hopes google won't throttle his inbox again. :-) [17:10:02] ahahahaha [17:10:10] I had forgotten about that [17:10:41] I didn't. It took /days/ to recover. [17:11:11] arg, rebase number 1000 [17:12:07] amending, git review, and having a different committ message :p [17:14:46] ! [remote rejected] HEAD -> refs/publish/production (change 119308 closed) [17:14:54] haha..ok [17:15:46] so how do i fix this now...arg [17:16:53] !log Rebuilding GeoData index [17:16:59] Logged the message, Master [17:21:45] 'Cannot assign to variables in other namespaces [17:22:01] ooooh man, this is so annoying, trying to fix one error and you get the other ones [17:33:58] YuviPanda: +1 for OTRS instead of Google [17:34:12] mutante|away: +1 on the thread? :) [17:34:14] PROBLEM - MySQL Processlist on db1019 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 0 copy to table, 70 statistics [17:35:14] RECOVERY - MySQL Processlist on db1019 is OK: OK 0 unauthenticated, 0 locked, 0 copy to table, 4 statistics [17:37:29] YuviPanda: i thought just replying with a +1 and no other content is discouraged?:) [17:39:26] mutante|away: really ? can I bash people with that ? [17:39:41] akosiaris: yes, but you can't just say -1 either:) [17:40:01] i replied anyways saying "i like how we use OTRS instead of Google" heh [17:40:36] otrs instead of google ? [17:41:08] [WikimediaMobile] Apps Feedback Email Addresses [17:41:36] LeslieCarr: noc@wm didn't work out that well i think [17:42:02] haha [17:43:10] mark, instances running on virt1007 are suffering some kind of drastic network failure. Can you help me debug? [17:43:34] * andrewbogott waves to LeslieCarr  [17:44:40] hi andrewbogott :) [17:53:00] cmjohnson1: Are you in the DC right now? [17:53:10] I am not...what's up? [17:53:46] ah, eth1 is totally dead on one of the labs hosts. Don't know yet if it's hardware or software. [17:53:55] Would ask you to re-plug in the network cable if you were there [17:54:05] But, I should learn more before you make a special trip [17:54:21] which labs host? [17:56:03] virt1007 [17:59:57] ori: <<[enwiki] Sample size 1000: 8611129/8904041 96.710347582631% (igbinary/php)>> [18:00:01] * AaronSchulz chuckles [18:00:45] * AaronSchulz tries 5000 [18:01:05] greg-g: I'm ready to prep tin for group1 to 1.23wmf18 if the flight deck is clear [18:01:16] <^d> AaronSchulz: What's giving you that? [18:01:17] that is gzipping both of course [18:01:19] * ^d wants to see :) [18:01:30] where's te deploy bot [18:01:36] .clear [18:01:41] mutante|away: :P [18:01:44] bd808|deploy: all clear [18:01:53] * bd808|deploy starts doing stuff [18:02:07] (good job lining up your nick character numbers, mutante|away and bd808|deploy ) [18:02:49] .summon SWAT [18:04:23] * bd808|deploy created https://gerrit.wikimedia.org/r/119328 [18:05:47] wowiki no, wowikiquote yes ? [18:06:37] mutante|away: group1 is "everything except 'pedias" [18:06:51] aha [18:07:44] https://gerrit.wikimedia.org/r/#/c/119294/ [18:09:32] Reedy: your patch is better than mine. [18:10:01] How did you get the docroots to switch? [18:11:19] ./multiversion/updateWikiversions all.dblist php-1.23wmf18 [18:11:19] ./multiversion/updateWikiversions wikipedia.dblist php-1.23wmf17 [18:11:19] ./multiversion/updateWikiversions group0.dblist php-1.23wmf18 [18:11:20] paravoid: If you're up, can you help me double-check a problem before cmjohnson1 drives to the datacenter? [18:11:30] or mark if you're not at dinner [18:11:31] i got it [18:11:40] we literally both got out of a meeting, so shoot :) [18:11:45] *just [18:11:56] yes [18:12:00] what's up? [18:12:08] i have it...checking a port on a switch [18:12:11] Um… no carrier for eth1 on virt1007. [18:12:19] But sounds like cmjohnson1 is on top of it. So, nm! [18:12:53] Reedy: weird those are basically the same commands I used [18:13:10] then just a commit -a [18:13:23] But I didn't get a change for the bits… Or did I just not look for them :/ [18:13:46] I bet I didn't look for anything other than the wikiversions.json changes [18:13:50] I did it directly on tin [18:13:54] Not sure if that actually makes any difference [18:14:40] Reedy: Would you sync-dir the bits changes before or after the version bump? [18:14:55] after I guess [18:15:28] 5000, still 96% [18:15:47] andrewbogott, cmjohnson1: confirmed, no cable (or faulty cable) plugged between virt1007 & asw-b-eqiad [18:16:26] paravoid: yep..thx [18:16:29] paravoid: ok, thanks [18:16:48] andrewbogott. will head out in a few [18:17:08] mark if you were disappointed at not having a networking thing to look into, I've got one for youuuuu :p [18:17:08] ^d: http://pastebin.com/uB2D8pPU [18:17:10] thanks [18:17:20] * andrewbogott has guilt about making cmjohnson1 drive in DC traffic [18:17:21] !log bd808 rebuilt wikiversions.cdb and synchronized wikiversions files: group1 to 1.23wmf18 [18:17:22] ottomata: yeah, the speed issue :) [18:17:26] Logged the message, Master [18:17:28] (i've been looking at network things all day though ;p) [18:17:33] ^d: eval.php on enwiki ;) [18:17:36] andrwbogott: don't feel guilt i live close [18:17:45] hah, yup, ok cool [18:17:46] :) [18:17:48] ottomata: first thing to try though, it so compare ipv4 and ipv6 on that [18:17:59] <^d> AaronSchulz: best tool ever. [18:18:05] is to* [18:18:08] ok, i had compared it for esams -> eqiad in analytics cluster [18:18:11] and compare the routing both ways [18:18:19] any differences there? [18:18:20] ipv6 was slower than ipv4, but i'll do the exact same comparison for that too [18:18:20] * ebernhardson gets constantly annoyed at eval.php puking on syntax errors and typos :P [18:18:21] ok cool [18:18:29] lemme compare apples to apples and run it again [18:18:44] ebernhardson: it does suck that loops have to squeeze into one line :/ [18:18:49] these are different links and routes [18:18:53] so it would be weird if they're both equally slow [18:18:57] or both slow, really [18:19:02] aye [18:19:03] so I used closures to make it more readable ;) [18:19:18] either should be able to reach at least 1 Gbps, up to 10Gbps [18:21:03] !log bd808 synchronized docroot/bits 'bits/static-current to 1.23wmf18; static-stable to 1.23wmf17' [18:21:08] Logged the message, Master [18:22:36] ottomata: which varnish servers are you testing on btw? [18:22:49] i'm curious to see if any of the amssq* ones are doing better [18:23:01] but those are just text varnish [18:24:29] * greg-g wonders what that spike at 16:00 was: https://gdash.wikimedia.org/dashboards/reqerror/ [18:24:33] cp3014 [18:24:42] i'm using it because it is depooled [18:25:26] * bd808|deploy wonders when checking "show code deploys" will be fixed in gdash [18:26:59] mark [18:27:01] cp3014 -> stat1002 IPv4: 19.3 Mbits/sec [18:27:01] cp3014 -> stat1002 IPv6: 89.3 Mbits/sec [18:27:01] stat1002 -> cp3014 IPv4: 584 Mbits/sec [18:27:01] stat1002 -> cp3104 IPv6: 553 Mbits/sec [18:27:20] (the IPv4 numbers are the ones from yesterday, IPv6 just collected) [18:27:22] interesting [18:27:31] try amssq* and ssl300* too [18:27:43] akosiaris: hey, RT duty person, mind looking at https://rt.wikimedia.org/Ticket/Display.html?id=6970 ? [18:27:46] just in case there's a difference [18:27:51] bd808|deploy: ^ :) [18:28:33] !log bd808 Purged l10n cache for 1.23wmf16 [18:28:37] Logged the message, Master [18:29:36] greg-g: {{done}} [18:30:33] bd808: rock [18:30:36] * greg-g looks at graphs [18:33:12] nothing obvious [18:33:14] * greg-g moves along [18:36:34] PROBLEM - Puppet freshness on labstore1001 is CRITICAL: Last successful Puppet run was Tue 11 Mar 2014 08:47:37 PM UTC [18:43:54] hmm, mark, this is weird, it might be inconsistent [18:44:02] i just ran these [18:44:03] amssq47 -> stat1002 IPv4: 62.8 Mbits/sec [18:44:03] amssq47 -> stat1002 IPv6: 49.6 Mbits/sec [18:44:03] ssl3001 -> stat1002 IPv4: 503 Mbits/sec [18:44:03] ssl3001 -> stat1002 IPv6: 48.5 Mbits/sec [18:44:19] when i got 503 form ssl3001 -> stat1002, i didnt' believe it [18:44:21] so i'm rerunning that now [18:44:24] and it is taking much longer this time [18:44:31] should have a number in a minute [18:46:28] yeah, mark, and this time i got [18:46:28] ssl3001 -> stat1002 IPv4: 23.1 Mbits/sec [18:47:17] mark, i gotta run [18:48:55] k, laters [18:49:36] andrewbogott: eth1 was disconnected [18:49:38] fixed [18:50:07] cmjohnson1: ok! [18:50:11] And, instances seem happier now... [18:50:48] sorry about that..i prolly did it yesterday swapping cables [18:51:07] np, glad it wasn't a software failure :) [18:54:17] cmjohnson1: MaxSem reports fixedness. thanks! [18:54:57] cough cough monitoring:) [19:00:31] greg-g: did that deploy2graphite ever work ? [19:00:45] cause I must be looking in the wrong direction otherwise [19:01:09] iirc it didn't [19:27:39] akosiaris: I *think* deploy2graphite worked before graphite was moved to eqiad. [19:28:29] akosiaris: ori may be able to give you troubleshooting tips and/or point you to wiki docs [19:30:33] bd808: well I am looking at graphite graphs for any deploy. whatever metrics and I find zero data up to two years before which is troublesome [19:40:04] akosiaris: The python scap scripts have been sending metrics named 'scap.*' to statsd.eqiad.wmnet:2003 for over a month and I've never seen them show up in graphite. Something on tungsten is listening for traffic on port 2003 but I can't dig any deeper since I don't have shell access to that host. [19:40:37] My guess is either that the statsd server there is borked or that it can't connect to the graphite host to record the metrics [19:41:56] carbon cache is definitely listening on 2003 tcp and udp on tungsten [19:42:26] so it is either network issue (easy to check). sending from tin I suppose ? [19:42:32] akosiaris: The listener on tungsten is carbon? That would be the problem. [19:42:43] and not statsd [19:42:45] that is correct [19:42:53] The scripts assume they are talking to a statsd server [19:43:04] The protocols are different [19:43:27] So I guess the question now is where is statsd running (host + port) [19:43:34] port 8125 udp perhaps ? not familiar with statsd tbh [19:43:49] that is where the twisted framework is listening [19:45:02] akosiaris: That sounds promising. 8125 is the default statsd port that etsy picked and I think I recall that the version we are using is a twisted app [19:45:18] give it a shot then :-) [19:45:28] will do :) [19:47:54] PROBLEM - DPKG on dbstore1001 is CRITICAL: DPKG CRITICAL dpkg reports broken packages [19:49:10] akosiaris: That's it. The port is wrong in the config files. https://graphite.wikimedia.org/render?from=-24months&until=now&width=400&height=250&target=drawAsInfinite(deploy.all.count)&_uniq=0.9361904815305024&title=drawAsInfinite(deploy.all.count) [19:49:34] :-) :-) :-) [19:49:41] * bd808 will make a patch [19:49:59] akosiaris: Thanks much for looking into that [19:50:16] you are most welcome. So I suppose I can mark as resolved https://rt.wikimedia.org/Ticket/Display.html?id=6970? [19:50:51] repasting to avoid that ? in the end https://rt.wikimedia.org/Ticket/Display.html?id=6970 [19:50:59] Yes. You did your part. I'll submit a puppet patch to fix the config and add you as a reviewer [20:12:25] akosiaris: it did [20:12:27] matanya: it did [20:12:46] you did what greg-g ? [20:13:21] * matanya lost the ability to read [20:13:35] greg-g: deploy2graphite ? [20:14:02] greg-g: It was configured with the wrong port! See patches attached to https://bugzilla.wikimedia.org/show_bug.cgi?id=62667 [20:15:14] https://bugzilla.wikimedia.org/show_bug.cgi?id=52359 [20:15:24] when I previously complained, but realized I was just doing it wrong [20:15:40] yay for communication! [20:15:48] mw:Beta Features/Hovercards enabled on mw.org - bugzilla:61167 [20:15:56] greg-g: 2003 is graphite port [20:15:57] I asked last week what wikis it will be turned on, no info [20:16:05] 8125 is statsd [20:16:54] week of March 24(?): Typograph Refresh beta feature (aka VectorBeta) graduated in to Vector for all users [20:17:01] greg-g: what does this mean? [20:17:07] matanya: /me don't know. All I know is previously, in July of '13, gdash's "show code deploys" worked. [20:17:08] how do I opt out if I like my links blue? [20:17:39] twkozlowski: sorry for this answer, but best to ask in -design [20:19:13] Then I will re-phrase my question [20:19:32] Is it confirmed that this will be pushed down our throats next week? [20:19:48] Because I'd like to prepare my own vector.css if it is. [20:21:08] ok, greg-g i think bd808's fix should fix this. [20:21:11] bd808: https://gerrit.wikimedia.org/r/#/c/116455/ ? [20:21:29] was the wrong port set in this ^ [20:21:39] or copied over? [20:21:50] matanya: copied over [20:22:09] from the old scap? [20:22:21] matanya: Yeah. See https://gerrit.wikimedia.org/r/#/c/119339/ [20:22:29] twkozlowski: for certain less graphic values of your question? yes, there should be an FAQ on the Typography Refresh page. If it doesn't answer your questions, it's a failing in the FAQ, so please ask it and have the WMF Design team answer it. [20:23:21] bd808: yes, saw the fix, i'm wondering about the source of the mistake, we might learn something from this [20:23:42] greg-g: I took the quote from your page [20:23:48] https://wikitech.wikimedia.org/wiki/Deployments [20:23:51] * bd808 assumes we'll learn that someone confused carbon and statsd [20:23:59] https://wikitech.wikimedia.org/wiki/Deployments#Next_month to be precise [20:24:13] twkozlowski: yes, that is the current schedule, details of what is and isn't included in "Typography Refresh" and what it means to users should be in their FAQ [20:24:13] do you know know where are the old scap git sources ? [20:24:30] greg-g: oh cool, I'll add a link then [20:25:50] https://www.mediawiki.org/wiki/Typography_refresh#Can_I_opt_out_of_changes_to_the_default_fonts.3F [20:25:57] matanya: In operations/puppet.git history. The files were in files/misc/scap I think [20:25:59] * twkozlowski swears loudly [20:26:09] oh, right [20:28:34] greg-g: Can you please clarify that, please? [20:28:46] This really /needs/ to be mentioned in next issue of Tech News [20:29:54] twkozlowski: I agree, and no, I don't have the domain knowledge to answser that, please ask the design mailing list [20:30:13] (I agree it should be clarified, that is) [20:31:01] oh no, yet another mailing list to subscribe to [20:31:03] * twkozlowski sighs [20:31:20] * greg-g looks at his IRC channel list and agrees [20:34:04] twkozlowski: there's also #wikimedia-design :) [20:34:24] There's also #wikimedia-antarctica [20:34:27] doesn't mean much. [20:40:39] Reedy, we got a go-ahead from ops with zero.wikimedia.org private wiki. Whenever you have time :) [20:40:54] PROBLEM - MySQL Slave Delay on db1042 is CRITICAL: CRIT replication delay 340 seconds [20:40:54] PROBLEM - MySQL Replication Heartbeat on db1042 is CRITICAL: CRIT replication delay 346 seconds [20:44:58] twkozlowski: i saw design people on trello [20:45:15] no idea what this is. [20:45:30] I saw one member of the team in Haifa [20:45:36] twkozlowski: heh. there is at least 3 people active on -design, though (me, prtksxna and quiddity) [20:45:47] twkozlowski: trello is a web-based system for managing work in progress in a SCRUM or Kanban type fashion [20:45:54] RECOVERY - MySQL Slave Delay on db1042 is OK: OK replication delay 0 seconds [20:46:01] Let me Google this. [20:46:04] :) [20:46:16] cmjohnson1: I'm looking for some hardware information about the OpenStack virt1xxx servers (Vendor, Model, CPU_type) https://ganglia.wikimedia.org/latest/?c=Virtualization%20cluster%20eqiad – all it says is 24 cores / ~170 GB RAM. Can you help me with that? [20:46:48] hedonil: sure [20:47:02] cmjohnson1: would be great! [20:48:54] PROBLEM - MySQL Slave Delay on db1042 is CRITICAL: CRIT replication delay 820 seconds [20:49:17] cmjohnson1: I'm updating some docs and want to enrich 'em with some nerdish facts & figures https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/Overview [20:49:36] http://lists.wikimedia.org/pipermail/design/2014-March/001696.html greg-g [20:50:01] np give me a few mins and I will get what i can for you [20:50:38] twkozlowski: you kind of "buried the lead", as they say, in that message [20:51:16] twkozlowski: saying "Typography Refresh: has it been tested with color blind, low vision, or other non-perfect vision readers?" [20:51:16] * twkozlowski shrugs [20:51:40] twkozlowski: http://designatwikipedia.tumblr.com/ [20:52:19] twkozlowski: fwiw, the link colors aren't grey anymore [20:52:39] http://designatwikipedia.tumblr.com/ask [20:52:59] * twkozlowski doesn't have a Tumblr account [20:53:45] hedonil: They are Cisco UCS C250 M1, 8 300GB Disk [20:54:21] cmjohnson1: great! [20:54:33] * hedonil looks into specs [20:55:29] cmjohnson1: are they using 10 GBit/s network adapter yet or still 1GBit with LACP? [20:55:29] mutante|away: I can't see why I'd need to create a Tumblr account to ask a WMF team a question [20:55:37] mailing lists work OK for that sort of stuff :-) [20:56:06] They are using 1GBit [20:56:30] cmjohnson1: ahh. ok. thanks [20:57:42] hedonil thats 1Gb with LACP [20:58:17] twkozlowski: i agree with you very much [20:58:34] twkozlowski: i'm still giving you the means i have [20:58:48] mutante|away: And I'm grateful [20:58:57] I did end up posting to their mailing list :-) [20:59:35] design mailing list, or the talkpage of the BetaFeature, are best. That way everyone can see the questions and the answers (or help give answers). [21:07:26] cmjohnson1: Hmm. spec says: Up to two Intel Xeon 5500 Series multicore processors. the Intel Xeon 5500 Series is up to 4 Cores. With Hyperthreading enabled 8 Threads -> makes a max of 16 virtual CPU's. How come Ganglia is reporting 24? [21:09:34] so, grrrit-wm is back. just a fyi [21:14:01] root@virt1001:~# nproc [21:14:01] 24 [21:14:03] hedonil: i see 24 [21:15:20] (03PS4) 10MarkTraceur: Add MMV feature flags for beta and pilot sites [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117376 [21:15:22] http://p.defau.lt/?ZYWyzz7Dk4nWhAMQPBUyKg [21:17:47] cmjohnson1: would you mind pasting $cat /proc/cpuinfo ? [21:19:37] http://p.defau.lt/?E3ZVbsEzcsIlOyW7ohAdug [21:20:52] cmjohnson1: great. thanks. It's Intel Xeon 5600 family with up to 6 cores ;) [21:24:05] (03PS1) 10Andrew Bogott: Create a simple role class to set up a big /mnt volume on labs [operations/puppet] - 10https://gerrit.wikimedia.org/r/119398 [21:24:56] yay, grrrit-wm [21:25:44] greg-g: woo :) sorry it took too long to get back. [21:25:50] greg-g: I should've migrated it before it went dark. [21:26:06] shoulda coulda woulda [21:28:57] "If I could've, I would've", to quote TBBT [21:37:34] PROBLEM - Puppet freshness on labstore1001 is CRITICAL: Last successful Puppet run was Tue 11 Mar 2014 08:47:37 PM UTC [21:43:02] (03CR) 10Aude: [C: 031] "tried this for wikidata jenkins and it works" [operations/puppet] - 10https://gerrit.wikimedia.org/r/119398 (owner: 10Andrew Bogott) [21:43:06] (03CR) 10Andrew Bogott: [C: 032] Create a simple role class to set up a big /mnt volume on labs [operations/puppet] - 10https://gerrit.wikimedia.org/r/119398 (owner: 10Andrew Bogott) [21:43:28] welcome back grrrit-wm [21:44:48] well, this is the wrong time to share the test phabricator instance with some friends... [21:46:57] (03PS1) 10Matanya: rt: lint role [operations/puppet] - 10https://gerrit.wikimedia.org/r/119399 [21:50:00] (03CR) 10Dzahn: "is it worth doing this while this is already here: https://gerrit.wikimedia.org/r/#/c/116064/6/manifests/role/requesttracker.pp all you g" [operations/puppet] - 10https://gerrit.wikimedia.org/r/119399 (owner: 10Matanya) [21:50:47] (03Abandoned) 10Matanya: rt: lint role [operations/puppet] - 10https://gerrit.wikimedia.org/r/119399 (owner: 10Matanya) [21:56:54] (03PS1) 10Matanya: salt: qulify site and realm [operations/puppet] - 10https://gerrit.wikimedia.org/r/119403 [22:11:40] (03CR) 10Hashar: "Coren wrote:" [operations/puppet] - 10https://gerrit.wikimedia.org/r/117199 (owner: 10Hashar) [22:14:13] I'm getting occasional error pages from the MW API when testing parsoid locally [22:14:28] Request: POST http://en.wiktionary.org/w/api.php, from 10.128.0.118 via cp4017 cp4017 ([10.128.0.117]:3128), Varnish XID 1197372587
Forwarded for: 216.38.130.164, 10.128.0.118
Error: 503, Service Unavailable at Tue, 18 Mar 2014 22:11:04 GMT [22:18:34] (03PS14) 10Nemo bis: Add cron job to run characterEditStats.php on multilingual wikis weekly [operations/puppet] - 10https://gerrit.wikimedia.org/r/117250 [22:19:21] (03CR) 10jenkins-bot: [V: 04-1] Add cron job to run characterEditStats.php on multilingual wikis weekly [operations/puppet] - 10https://gerrit.wikimedia.org/r/117250 (owner: 10Nemo bis) [22:26:40] (03PS3) 10BryanDavis: [WIP] Make trebuchet work in eqiad.wmflabs [operations/puppet] - 10https://gerrit.wikimedia.org/r/119221 [22:30:00] anyone know the session timeout for wikitech? i keep getting logged out despite using two factor and checking "keep me logged in". [22:31:11] (03Abandoned) 10OliverKeyes: Add r-base to Hadoop worker machines [operations/puppet] - 10https://gerrit.wikimedia.org/r/108633 (owner: 10OliverKeyes) [22:57:46] yurik: Do you have a bug/RT ticket/email etc detailing what you want? [22:58:37] Reedy, https://rt.wikimedia.org/Ticket/Display.html?id=6831 [22:58:51] Reedy, i think we settled on zero.wikimedia.org [22:59:37] (i think it was said before that there is a ssl cert for *.wikimedia.org) [23:07:57] yeah [23:08:02] *.wikimedia.org is fine [23:08:25] (03PS3) 10Alexandros Kosiaris: osm module [operations/puppet] - 10https://gerrit.wikimedia.org/r/119408 [23:12:50] (03PS10) 10Dzahn: lint admins.pp [operations/puppet] - 10https://gerrit.wikimedia.org/r/118794 [23:15:29] (03CR) 10Hoo man: [C: 04-1] "So we've entered the rebase hell as it seems..." (031 comment) [operations/puppet] - 10https://gerrit.wikimedia.org/r/118794 (owner: 10Dzahn) [23:19:31] (03CR) 10Dzahn: [C: 04-1] "yes, and since i did this literally 10 times now while admins.pp is being touched this is precisely where i stop for today" [operations/puppet] - 10https://gerrit.wikimedia.org/r/118794 (owner: 10Dzahn) [23:20:59] * hoo would have approved it... [23:21:49] (03PS1) 10Ori.livneh: Beta cluster MemcachedPeclBagOStuff: use PHP serialization [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/119411 [23:22:11] mutante|away: so it that changeset finally frozen? can I test ? [23:23:22] (03CR) 10Ori.livneh: "See for some tests confirming the behavior matches the description in this commit message." [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/119411 (owner: 10Ori.livneh) [23:26:29] (03PS4) 10Alexandros Kosiaris: osm module [operations/puppet] - 10https://gerrit.wikimedia.org/r/119408 [23:26:56] if mutante|away anwers you may stab him, he's supposed tobe gone [23:27:06] and btw I am gone too, good night! [23:30:38] greg-g: I can do the SWAT deploys if you like [23:30:56] ori: sweetsauce, are there any? [23:31:06] nope, that's why you volunteered! [23:31:08] ;) [23:31:27] * greg-g looks at the clock... a bit behind [23:31:34] my afternoon disappeared [23:31:41] SWAT deploy team window [23:31:41] gerrit:119313 GeoData/Nearby settings tweak for wikivoyage (Max) [23:31:41] gerrit:117376 Fixing up MMV config settings. (rdwrer) [23:31:54] am I misreading it? [23:31:56] * greg-g reloads the page [23:32:00] nope, you're right [23:32:22] ori: thanks, go forth [23:32:29] MaxSem rdwrer about? [23:32:32] Hai! [23:32:38] .!log march 18, 2014: ori interprets calendar correctly [23:32:39] greg-g, yep [23:32:47] ori: :P [23:32:51] * rdwrer throws confetti at ori [23:32:53] Good job [23:33:00] impossible is nothing [23:33:55] jdlrobson: re: , can you confirm that the condition you specified for deployment does in fact hold true? [23:33:58] (03PS4) 10BryanDavis: [WIP] Make trebuchet work in eqiad.wmflabs [operations/puppet] - 10https://gerrit.wikimedia.org/r/119221 [23:34:23] (03CR) 10Ori.livneh: [C: 032] Add MMV feature flags for beta and pilot sites [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117376 (owner: 10MarkTraceur) [23:34:32] (03Merged) 10jenkins-bot: Add MMV feature flags for beta and pilot sites [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117376 (owner: 10MarkTraceur) [23:35:20] ori, relevant part is https://gerrit.wikimedia.org/r/#/c/119313/1/wmf-config/CommonSettings.php [23:36:26] !log ori updated /a/common to {{Gerrit|Ifeda59963}}: Add MMV feature flags for beta and pilot sites [23:36:31] Logged the message, Master [23:36:40] yay, nil change [23:36:49] (03PS2) 10Ori.livneh: Bump geosearch radius to 20km on Wikivoyage [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/119313 (owner: 10MaxSem) [23:36:53] (03CR) 10Ori.livneh: [C: 032] Bump geosearch radius to 20km on Wikivoyage [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/119313 (owner: 10MaxSem) [23:37:01] (03Merged) 10jenkins-bot: Bump geosearch radius to 20km on Wikivoyage [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/119313 (owner: 10MaxSem) [23:37:57] !log ori synchronized wmf-config 'Ifeda59963: Add MMV feature flags for beta and pilot sites' [23:38:01] Logged the message, Master [23:38:15] !log ori updated /a/common to {{Gerrit|I936d5abe3}}: Bump geosearch radius to 20km on Wikivoyage [23:38:20] Logged the message, Master [23:38:47] !log ori synchronized wmf-config 'I936d5abe3: Bump geosearch radius to 20km on Wikivoyage' [23:38:53] Logged the message, Master [23:39:00] jdlrobson, ^^^ [23:39:05] ori: thankyou kindly good sir [23:39:35] np! glad you're driving this initiative. [23:39:48] We seem good [23:39:55] success! [23:39:59] * greg-g calls it early [23:40:09] (03PS1) 10Jforrester: Follow-up: Ifeda5996385 – 'mediaviwerpilot.dblist' is ugly [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/119415 [23:40:17] rdwrer: ^^^ OK by you? [23:40:23] hah [23:40:27] MaxSem: https://en.wikivoyage.org/wiki/Special:Nearby seems to now be pulling from wikipedia.org ?! [23:40:27] Uh [23:40:36] * MaxSem shakes ori's hand [23:40:40] * greg-g jinxed it [23:40:43] (I've silenced grrrit-wm :)) [23:41:04] (03CR) 10Ori.livneh: [C: 031] Follow-up: Ifeda5996385 – 'mediaviwerpilot.dblist' is ugly [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/119415 (owner: 10Jforrester) [23:41:06] James_F: Seems fine [23:41:10] Kk. [23:41:15] ori: Good to merge, then. [23:41:16] OK, I'm going to deploy it then [23:41:28] bd [23:41:31] seeing as it is really an addendum to the previous patch [23:41:35] Yeah. [23:41:39] (03CR) 10Ori.livneh: [C: 032] Follow-up: Ifeda5996385 – 'mediaviwerpilot.dblist' is ugly [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/119415 (owner: 10Jforrester) [23:41:50] (03Merged) 10jenkins-bot: Follow-up: Ifeda5996385 – 'mediaviwerpilot.dblist' is ugly [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/119415 (owner: 10Jforrester) [23:42:01] * James_F is such a whiny perfectionist, sorry. [23:42:08] !log ori updated /a/common to {{Gerrit|Ie7597f5f9}}: Follow-up: Ifeda5996385 – 'mediaviwerpilot.dblist' is ugly [23:42:14] Logged the message, Master [23:42:24] MaxSem: ? [23:42:49] !log ori synchronized wmf-config 'Ie7597f5f9: Follow-up: Ifeda5996385 – 'mediaviwerpilot.dblist' is ugly' [23:42:53] Logged the message, Master [23:42:56] jdlrobson, this shouldn't have changed [23:43:07] $wgMFNearbyEndpoint is reporting as wikipedia.org now MaxSem mmm [23:43:17] pretty sure it wasn't this morning [23:45:18] ahhhrg MaxSem seems I'm too blame :-S [23:45:50] correction kaldari is :) [23:46:00] that kaldari [23:46:03] jdlrobson, wtf is that variable? [23:46:13] MaxSem: it's to allow local testing [23:47:45] so, it's broken or not? :) [23:48:01] it's broken, but was so even before;) [23:49:03] still successful SWAT window then! [23:49:31] as a result, I'll have to SWAT a fix tomorrow [23:49:31] ori, do you have anything to do with the 'eventlogging' project in labs? [23:49:36] MaxSem: /me nods [23:49:38] yes, it's a new verb! [23:49:43] :P [23:49:50] MaxSem, posted RFC :) [23:49:51] (03Draft4) 10Alexandros Kosiaris: osm module [operations/puppet] - 10https://gerrit.wikimedia.org/r/119408 [23:50:00] * greg-g goes to work on taxes then [23:50:30] greg-g, on the other hand, we have 10 minutes:) [23:50:43] you have a fix for it already? [23:50:49] yes [23:50:59] simple/easy/safe? [23:51:13] :) [23:51:23] ori: your call ^ [23:51:26] [04:00:05] ori: Do you know anything about the 'eventlogging' labs project? Specifically, whether it's good for anything anymore? [23:51:27] [04:01:28] there's an eventlogging setup on the beta cluster that is the canonical place for working on eventlogging in labs, and i'm almost certain that the dedicated eventlogging project is therefore a dupe, but let me make sure [23:51:27] [04:02:13] thanks [23:51:28] [04:39:58] andrewbogott_afk: I don't think I see an eventlogging labs project [23:51:44] MaxSem: let's have it [23:51:50] https://gerrit.wikimedia.org/r/#/c/119416/ [23:51:56] eep, guess I should've made a note when you said that :) thanks [23:52:48] * andrewbogott marks eventlogging for death [23:53:31] ori, yeah, I'll push it [23:53:51] MaxSem: OK. Don't forget the submodule update commit. [23:54:05] eventlogging is dying? [23:54:23] cmjohnson1: is the 'observium' labs project still used for anything? [23:54:45] (03PS1) 10Alexandros Kosiaris: Enable planet.osm population on labsdb1004 [operations/puppet] - 10https://gerrit.wikimedia.org/r/119419 [23:55:05] yurik: just flog.wmflabs.org [23:55:56] andrewbogott, wha? [23:57:36] yurik: so… is that something you use? [23:58:11] andrewbogott, no, i was just wondering if i should update eventlogging schema editor (its up on meta), or if that extension is dying too [23:58:30] !log maxsem synchronized php-1.23wmf18/extensions/MobileFrontend/MobileFrontend.php 'https://gerrit.wikimedia.org/r/119416' [23:58:35] Logged the message, Master [23:58:41] yurik, I'm talking about the labs project 'eventlogging'. I don't know what its relationhship is to the extension, if any. [23:58:44] jdlrobson, done^ [23:58:57] andrewbogott, probably nothing, never mind than :) [23:59:22] ok