[00:00:31] Ryan_Lane: Just in case there's another Ryan (I didn't get your irc nickname during the conversation) - it was you I spoke with, right ? [00:00:48] I dunno, when? :) [00:00:56] 1 hour ago [00:01:00] where? [00:01:04] Skype ? [00:01:07] nope :) [00:01:10] OK. Thanks [00:01:15] there's also kaldari [00:01:21] and ryan faulkner [00:02:07] neilk_: Which Ryan sat next to you after we spoke ? [00:02:19] that was Kaldari [00:02:32] he prefers to just be called Kaldari [00:02:32] Oh, my bad. [00:05:31] Alright, I'm gonna close up for today [00:05:31] nn [01:29:07] hey folks, demon had something come up, and won't have time for a bit to look at this: https://bugzilla.wikimedia.org/show_bug.cgi?id=27287 [01:29:46] it's a test2 problem with CentralAuth that might actually be a real 1.17 problem, but it could just be a problem with the test infrastructure [01:30:09] is there anyone with the access, time, and inclination to look at it? [01:35:09] I will do it if I have time [01:35:35] but first of all I want to make sure we have profiling set up in a way that will let us detect what the main problem is [01:38:51] TimStarling: yeah, keep going on that front. Chad may get back before you're done [01:40:40] btw, just refreshed the review list: http://www.mediawiki.org/wiki/MediaWiki_roadmap/1.17/Revision_report [01:40:58] 7 revs left: 3 unreviewed, and 4 fixmes [02:33:31] wgLogAutocreatedAccounts should probably be set to true (default is false) in the 1.17 config files if we want autocreated accounts logged on test2 [02:34:39] I'm happy to add it to um I guess InitialiseSettings in a test2 stanza if we want it [03:13:49] *^demon is back now :) [03:14:34] see the backread (I didn't add it as I didn't want to muck with test2 when Tim was poking at it) [03:22:18] <^demon> We should set it for test2wiki, confirm that's the fix, and then set the default to true cluster-wide. [03:22:44] <^demon> Victor changed behavior in r64853. [03:41:07] excellent, thanks apergos! [03:43:29] if that default makes sense [03:43:36] also that is only for um the autocreated ones [03:44:06] the other issue (accounts created from special:login) I don't know about [03:44:34] (sorry I was away looking at a media store demo, or rather a demo of a testing infrastructure, it is coming along!) [03:52:17] I added it in InitaliseSettings enabled for test2 only and will sync it [03:52:46] <^demon> Alrighty [03:54:17] let's see how that does [03:56:01] lookie there, two accounts created automatically [03:56:01] <^demon> Weird, I didn't login via cookie... [03:56:14] (one is mine) [03:56:34] <^demon> Might be on my end. [03:56:46] <^demon> I've been having intermittent login issues for the last 2 weeks or so [03:56:51] guess I'l try creating an account the old way now and see how that is [03:57:50] <^demon> Hmmm. [03:57:56] <^demon> I wonder if my bot account isn't SUL'd [03:58:06] new user account showed up [03:58:12] created the old way [03:58:17] <^demon> Ha, I never bothered merging into SUL. [03:58:31] where's that bug report [03:58:33] Yay, I saw -tech. [03:58:40] It wasn't enabled? [03:58:42] oO [03:58:44] nope [03:58:49] Weird. [04:00:07] nice job. go merge yerself ^demon [04:00:16] I mean that in the nicest way of course :-D [04:00:30] <^demon> haha, I have to unblock it first. [04:00:34] woops [04:00:38] <^demon> I blocked my bot account on enwiki ages ago [04:01:15] I had looked earlier and I remembered that on test when we enabled 1.17 I had created an account the manual awya and it worked (so it seemed unlikely to be the code causing the issue) [04:01:30] what is left in the queue? [04:01:51] apergos: which queue? [04:02:01] fixes. reviews. whatever [04:02:12] stuff to do before 10 pm PST [04:02:33] well....I'm looking at this problem that Ryan_Lane|away reported: https://bugzilla.wikimedia.org/show_bug.cgi?id=27310 [04:02:50] <^demon> Hmm, it fixes the logging-when-auto-created. [04:02:52] hmph [04:02:58] <^demon> Still isn't logging on login. [04:03:04] it's on trunk and may not be on 1.17, but was just about to test it [04:03:19] ^demon: I created an account via special login and it showed up [04:03:35] see recent changes, look at [04:03:38] <^demon> Created, or logged in to a SUL'd account? [04:03:41] TestAcctCreation1.17 [04:03:45] I did both: [04:03:49] <^demon> I tried logging in as ^demonBot2, but I didn't get a creation. [04:03:53] first I logged in via cookie (that's ??????????????) [04:03:56] then I logged out. [04:04:09] next I created an account (that's TestAcctBlah), that worked [04:04:14] what I did not try is [04:04:24] create an account while already logged in [04:05:11] in fact I'm still logged in as testacctblah [04:05:18] *apergos logs out [04:06:05] <^demon> What about logging in normally, not via cookie. [04:06:10] <^demon> That should also create a local log entry [04:06:13] <^demon> But doesn't seem to be [04:06:30] hmm, let's see if that's how it works elsewhere [04:07:11] password reset seems fine at least [04:07:31] <^demon> Hmm, actually it doesn't seem to do that in 1.16wmf4 either. [04:07:43] <^demon> Tried logging in to frwiki, didn't log as an auto-creation [04:07:49] I don't see it in production so *gong* [04:07:56] yeah I just did it wikisource [04:08:16] <^demon> Hmm, wonder how long that's been a regression. [04:08:24] heheheh [04:08:37] long enough to ignore it for now [04:08:49] <^demon> It's certainly not a regression in 1.17 [04:08:50] you can modify the one bug (I did not close it) [04:09:01] or add a new one but let's save til later [04:10:14] I think I encountered a great bug in my script to update the rev list....looks like we're getting few enough revisions to review that I've got an uninitialized variable :) [04:10:24] nice :-D [04:10:27] what a good bug to find! [04:10:38] I thought so [04:10:46] I don't mind finding that one about now [04:10:48] <^demon> apergos: I commented. [04:11:01] who knew two years ago when I was watching brion sweat through deployment that someday... [04:11:01] <^demon> So, we should go ahead and set that to true for all wikis. [04:11:17] <^demon> Or change it in DefaultSettings to true. He changed behavior in that commit, and it's unintuitive. [04:11:20] I'd be suffering through the same crap :-D [04:11:33] yer call ^demon [04:11:58] I kinda wonder if we didn't have people complaining about the autocreate stuff [04:12:24] <^demon> I can't imagine it's expected behavior. But obviously no one was missing it... [04:12:35] (too many entries as people were bouncing around all over the place) [04:12:55] hmm that's true [04:15:22] I think he changed it in response to some sort of steward request [04:15:40] ok [04:15:46] you should ask him why, send him an email [04:15:52] bummer, not quite as rosy as I'd thought: http://www.mediawiki.org/wiki/MediaWiki_roadmap/1.17/Revision_report 6 revs left [04:16:37] we need an "okfornow" tag ;-) [04:16:44] heh [04:18:43] <^demon> r81896 is only fixme because of a db2 complaint [04:18:47] <^demon> db2 doesn't matter :) [04:20:26] ^demon: is that the only problem with that checkin? [04:21:04] <^demon> Yeah the rest of it was wfProfile fixes that Roan already merged. [04:21:05] I guess everything else is a profile in or out call [04:22:09] <^demon> Ok, e-mailed Victor. [04:22:12] ok...we won't worry about that. the obsessive compulsive side of me wants to create a "1.17wmf1ok" tag, but I'll resist the urge [04:22:12] cool [04:23:02] <^demon> robla: It doesn't really matter for release either. [04:23:07] this one looks like the only one not ok for deploy: http://www.mediawiki.org/wiki/Special:Code/MediaWiki/70498 [04:23:18] or at least, no one has declared it safe yet [04:23:57] <^demon> r70095 *should* be ok. We're not using upload-by-url (yet). The generally applicable bits look fine. [04:24:52] is there a SF location for deployment ? [04:25:04] location? [04:25:06] mdale: not really [04:25:10] I'm on my bed in the wikisuites :-P [04:25:16] ( Danese ) is asking [04:25:29] paaarty in thw wikisuites! [04:25:36] (russ and steven would kill me) [04:26:16] This IRC channel was kinda it for the last deploy [04:27:04] I totally forgot to dial into the conf line during the deploy [04:27:09] (last time) [04:27:20] <^demon> Did anyone? [04:27:29] I'm not sure...no one complained [04:27:43] I think we were all typing [04:27:47] worked well enough [04:28:24] yeah. there are times when having a headset and being able to talk and type is helpful, but IRC worked well enough [04:29:40] btw, what about r70498? [04:29:48] !r 70498 [04:29:48] --elephant-- http://www.mediawiki.org/wiki/Special:Code/MediaWiki/70498 [04:30:12] <^demon> I hadn't even looked at it yet. [04:31:12] that's the only one that no one seems to have mentally attached the "goodenoughfornow" tag to [04:31:25] need a special page with wikitext headings [04:31:33] to look at on test2 [04:32:26] <^demon> Use Special:Version [04:32:42] I did but I don't see edit tags there [04:33:01] edit section links are added to wikitext headings on special pages it says but I don't see that [04:33:12] <^demon> Oh hrm. [04:33:40] of course if it's actually fixed [04:33:43] ah, I see: http://www.mediawiki.org/wiki/Special:Code/MediaWiki/79185 [04:33:53] it was fixed there, and merged into 1.17 [04:33:57] in that rev like in a prev comment... [04:34:07] well there we go then [04:34:09] so this one looks like it's resolved, actually [04:34:12] <^demon> I think version is the only one. [04:36:15] so someone gonna close that? :-P [04:37:56] well, I actually misfollowed the version history. the comment ialex made may still not be fixed, but I guess we're in a "works for me" situation, right? [04:38:16] http://test2.wikipedia.org/wiki/Special:Version [04:38:27] yup [04:39:35] ok...I'll put it back, then [04:43:03] alright...going afk for a little bit [04:43:13] okey dokey [04:49:14] <^demon> This time I'm gonna watch James Bond movies during deployment :) [04:50:58] ok, that's just plain weird :-D [04:51:06] oohhh... time to put on the news [04:51:30] <^demon> I started with The Spy Who Loved Me [04:51:46] it's been a long time since I've seen them [04:51:53] I used to watch those bond marathons on tnt [04:52:02] waaaay back in the day [04:52:23] <^demon> I think Spike has the rights to Bond marathons these days [04:52:37] <^demon> I've got them all [04:52:51] dang [04:52:53] that's hardcore [04:53:49] sports news, bah [04:54:01] but it will be the top of the hours in a couple minutes and then we'll get more ... [04:54:01] <^demon> Oh, it's been my career goal since I was like 5 to be a secret agent [04:54:11] <^demon> Travel around the world drinking martinis and blowing stuff up. [04:54:11] rumors about egypt! [04:54:19] hahaha [04:54:45] I think you have to be involved in organized crime for that [04:56:44] woah really, in an hour? bah, there goes my afternoon nap... [04:57:09] <^demon> An hr? [04:58:01] <^demon> I think I'm going to run to the store real quick and grab some snacky sustenance. [04:59:26] http://noc.wikimedia.org/cgi-bin/pcache-hit-rate.py [04:59:32] finished in plenty of time [05:00:36] <^demon> I'll be back in like 20 or so. [05:07:10] (making food) [05:07:43] ok! [05:20:11] <^demon> (back) [05:20:36] ditto [05:27:13] food in about 10 minutes I think... [05:45:14] anyone mind if I take a more active role this time around? [05:45:47] Is the first window of 1.17 upgrade start? [05:46:39] <^demon> TimStarling: I take the silence to mean no objections. [05:47:03] TimStarling: what would the difference be? [05:47:32] I'll be the one moving wikis and syncing things [05:47:53] Roan is not here anyway [05:48:11] ah, I see. sounds reasonable [05:49:09] I'll move usability.wikimedia.org when the appointed time comes up in 7 minutes [05:49:59] that's basically an alpha test since nobody uses it [05:50:11] hi roan, just saying usability first in 5 minutes [05:51:03] then after that, the next batch can be strategy, simple x2 and meta [05:51:15] OK [05:51:18] that's hopefully a beta test since people do use meta [05:51:18] <^demon> Do any of the wikis in round 1 use FlaggedRevs? [05:51:23] Let me just make some breakfast real quick [05:51:55] Yes, we should be a little more comfortable with ourselves by the time we do meta [05:51:57] then the third set will be the rest of them, that give us our load test, of sorts [05:52:00] Because central stuff lives there [05:53:39] <^demon> Ah, enwikinews and enwikibooks use it. My followup question is moot then. [05:53:40] Is it only the 11 selected wikis will upgrade today? Countdown, it just have 2 minutes left... [05:54:02] <^demon> Time is an illusion. [05:54:09] I see stats/1.17-phase1 and 1.17-phase1 [05:54:13] is there a difference? [05:54:29] or to be more precise [05:54:41] what is the difference, since I see ther eis one? [05:54:58] waihorace: yes, it's only those ones, and only for a while. [05:55:30] It is 6 UTC now... [05:55:51] can we maybe have a little bit less waihorace this time around? [05:56:02] And a little more breakfast for Roan :) [05:56:10] anyway usability is on 1.17 now [05:56:14] Oh? [05:56:25] Oh you logged it in -ops [05:57:54] still the PHP diff engine? [05:58:17] ? [05:58:54] Hmm, abg of >500 calls to wfMsgReal per req, is that normal? [05:59:34] in which db? [05:59:35] It's elevated [05:59:38] 1.17-phase1 [05:59:53] Wait, five HUNDRED [05:59:54] On enwiki the average is 40 [06:00:01] heh [06:00:03] those are the pre-1.17 wikis [06:00:12] they are running on 1.16 [06:00:14] Oh [06:00:29] let's hope 1.17 has an efficient wfMsgReal() [06:01:05] Ah yes [06:01:24] OK so what we're seeing in 1.17-phase1 is a mix of 1.17 (usabilitywiki) and 1.16 (others) currently [06:01:36] yes, but I haven't cleared the profiling data yet [06:01:51] so it's mostly from all those wikis for the last 6 hours or so [06:02:02] I took a snapshot in /ng, I'll clear it now [06:02:16] OK [06:02:35] also you may like http://noc.wikimedia.org/cgi-bin/pcache-hit-rate.py [06:02:58] that's the same set of wikis, whether or not they are running 1.17 [06:03:24] hit-rate.py is nice [06:03:29] Which wikis does it refer to? [06:03:44] 1.17-phase1.dblist [06:03:47] The hit rate is shows seems atrociously low [06:04:03] it agrees with the data in stats/1.17-phase1 though [06:04:04] enwiki had a 78% hit rate under normal conditions earlier this week, right? [06:04:58] 74% [06:05:27] How are you counting pcache_not_possible? Not at all? [06:05:33] smaller wikis get a smaller portion of the cache space, because they are further down the frequency/rank graph [06:05:39] http://usability.wikimedia.org/wiki/Special:CommunityVoice and I got HTTP 500 using IE8 [06:05:39] not at all [06:05:44] *RoanKattouw shuts up and reads the source instead [06:05:45] Right [06:05:51] 12 Fatal error: Cannot use object of type stdClass as array in /usr/local/apache/common-local/php-1.17/extensions/CommunityVoice/Modules/Ratings.php on line 28 [06:06:05] I know exactly what happened there [06:07:14] <^demon> I'm already on it [06:07:25] <^demon> Should be $row->vot_category; [06:07:27] I am too [06:07:33] Reverting r75677 on CommunityVoice [06:07:48] But ^demon will be faster, he wins [06:08:33] So, there was a bug reported against 1.17wmf1 about logging account creations [06:08:55] <^demon> Part of it was seemingly on purpose because of a change Victor made. [06:09:07] !r 75677 [06:09:07] --elephant-- http://www.mediawiki.org/wiki/Special:Code/MediaWiki/75677 [06:09:07] already dealt with but only enabled for test2, we'll see about the rest later [06:09:13] apergos: ? [06:09:18] ^^ [06:10:09] https://bugzilla.wikimedia.org/show_bug.cgi?id=27287 for the comments if you want them [06:10:21] Right [06:10:51] doublewiki is quite the memory hog I see (that's irrelevant, just watching the messages scroll by) [06:11:04] It is, yes [06:11:11] apergos: Where did you set that var exactly? [06:11:32] initalisesettings in a stanza at the end. default left as false [06:12:09] <^demon> RoanKattouw: Fix is in trunk. Review + merge plz. [06:12:28] <^demon> apergos: It's a memory hog in 1.16 as well. I'm not sweating it right now. [06:12:55] oh me neither, just being amused [06:13:14] I pushed out a fix as a live patch [06:13:51] PHP Fatal error: Allowed memory size of 83886080 bytes exhausted (tried to allocate 51927 bytes) in /usr/local/apache/common-local/wmf-deployment/languages/LanguageConverter.php on line 390 [06:13:52] hmm [06:14:14] we're mostly interested in errors in 1.17 at this stage [06:14:39] couldn't tell if that was 1.16 or 1.17 [06:14:46] apergos: wmf-deployment in the path [06:14:53] eh duh [06:14:54] :-D [06:15:06] Oh right, I forgot [06:15:12] Running svn up I get lots of changes pulled in [06:15:20] Mostly the profiling stuff from yesterday [06:15:23] I made a script called sync-file-17, it's like sync-file but for 1.17 [06:16:33] OK [06:16:48] PHP Warning: filemtime() [function.filemtime]: stat failed for /usr/local/apache/common-local/wmf-deployment/extensions/Vector/modules/./images/open.png in /usr/local/apache/common-local/php-1.17/includes/resourceloader/ResourceLoaderFileModule.php on line 361 [06:16:54] note mixed paths [06:17:17] ^ [06:18:24] I got " TimStarling: Can I run s-c-a ? [06:18:48] I'll run it as root [06:19:09] OK [06:20:36] https://bugzilla.wikimedia.org/show_bug.cgi?id=27318 [06:21:35] getting reports of that bug ^ on #wikimedia-tech as well. I haven't seen it myself, though [06:21:40] I have [06:21:47] Use ?uselang=fr on usabilitywiki [06:22:14] is this possibly a localization cache issue? [06:22:25] Blegh [06:22:31] /* exception 'DBQueryError' with message 'A database error has occurred. Did you forget to run maintenance/update.php after upgrading? See: http://www.mediawiki.org/wiki/Manual:Upgrading#Run_the_update_script Query: DELETE FROM `msg_resource` Function: MessageBlobStore::clear Error: 1213 Deadlock found when trying to get lock; try restarting transaction (10.0.6.44) [06:22:57] And of course RL happily sets a Cache-Control header on that [06:23:09] I wanted to fix cache settings for errors at some point but forgot [06:23:54] It's in MessageBlobStore::clear() which is called from LocalisationCache::recache() [06:24:15] I'm guessing DELETE FROM msg_resource; is generally a deadlock-prone query though [06:26:18] Roan's online right now? [06:26:35] Well yeah [06:26:47] I don't usually expect you for a few more hours. [06:26:59] Shirley: we've started a deploy [06:27:01] I don't see that with el btw [06:28:02] I've written a patch for the caching issue, now testing it [06:28:15] the module_deps table seems ok now, maybe it was just a transition thing? [06:29:00] Could've been [06:29:10] My local patch for not caching RL errors works, committing [06:29:52] Hmm, I'm thinking [06:30:21] // TODO: Give this some more thought [06:30:22] This basically does if ($warnings !== '' || $exceptions !== '' ) { don't cache } which means any file-doesn't-exist warning has a quite large impact [06:30:30] TimStarling: Yeah, I meant to ask you about it [06:31:18] recache() is called for each language separately [06:31:32] was there some reason for clearing all languages? [06:31:58] Just me not realizing [06:32:10] TimStarling: nadeesha_calcey reported the bug on test2 a bit before we started here, so it may not be a transition thing [06:32:17] However, the deadlock prone-ness remains [06:32:36] put it in a transaction [06:32:42] robla: BTW, I ended up filing a bug for rewriting Wikimedia's sync scripts. [06:32:44] woops [06:32:47] 'cause I couldn't find one. [06:32:56] TimStarling: OK, I'll wrap it in begin(), end() and add a clause for the language [06:33:03] begin() and commit() [06:33:10] TimStarling: What are your thoughts on whether nocache any single PHP warning is a good idea [06:33:16] Yeah, commit(), of course. Wake up, Roan [06:33:46] PHP warnings shouldn't appear in the output [06:34:10] so they shouldn't need any special cache headers [06:34:49] They currently appear in /* comments */ [06:34:49] I wonder what that pile of segfaults was from [06:34:52] gone now anyways [06:34:53] But I can remove them I guess [06:34:59] Exceptions I would like to stay [06:35:13] Oh wait [06:35:17] Warnings appear in debug mode only anyway [06:35:50] maybe don't show warnings if display_errors is off [06:35:51] TimStarling: Also, I'll need an index on mr_lang. Can you create one across all DBs? [06:36:05] and now a batch of these: [06:36:08] Feb 11 06:40:38 10.0.2.202 apache2[13387]: [error] [client 208.80.152.85] request failed: error reading the headers [06:36:19] that means RL will be in keeping with site security policy [06:36:32] Right [06:36:38] do you have an ALTER TABLE command? [06:38:20] CREATE INDEX msg_resource_lang ON msg_resource (mr_lang); [06:38:28] it's a bit more complicated than you think to properly clear msg_resource on recache [06:38:35] in case rob la is still around, my bug report pretty much only affects the LDAP extension, though it's a general bug in mediawiki [06:38:44] because of fallbacks [06:38:44] TimStarling: How so? [06:38:49] Ugh [06:38:56] Ryan_Lane: yup...got it, thanks! [06:38:56] Well, actually, no [06:39:12] Or yes [06:39:22] Drat [06:39:46] also, recache() is called when there is no local cache, and you are clearing a shared cache [06:40:05] so every time a message file changes, you'll be clearing it 200 times [06:40:10] once for each server [06:40:14] Right [06:40:17] Hence the deadlocks [06:40:44] could that have caused the performance problems we saw last time? [06:40:53] *RoanKattouw is wondering whether we should've gone with CDB for msg_resource after all [06:41:54] I'll at least wrap this stuff in a transaction [06:42:21] I think we will need more than just a transaction before we continue deployment [06:42:24] how often are message files changing? [06:42:44] apergos: Every time we deploy code xD [06:42:48] As in every time we upgrade the site [06:43:03] mmm hmmm [06:43:05] The cache invalidations should stop at some point though [06:43:35] there are 355 messages files, and 146 apaches [06:43:46] give or take [06:43:51] so 51830 cache invalidations [06:44:03] Right [06:44:10] Yeah the transaction is a very temporary patch [06:45:34] TimStarling: Alright, shall I write a quick CDB replacement for msg_resource? [06:46:06] there's still the fallback issue [06:46:45] Yes, I'll have to do something similar to LCStore [06:46:50] CDB for WMF, DB for mortals [06:46:58] if RL did its own cache invalidations, and it did them when a message was requested instead of hooking into LocalisationCache, it could deal with fallbacks elegantly [06:47:22] That'd require RL knowing when to invalidate [06:47:49] And knowing that it has to beforehand so the module's mtime can be updated to reflect the message chance [06:47:52] *changes [06:48:42] But it seems that code is kind of broken in trunk anyway [06:48:52] LocalisationCache doesn't know when to invalidate, it just does it once per request at the first wfMsg() call [06:49:20] once per language, I should say [06:50:08] Yes, because it has CacheDependency and all that jazz [06:50:09] RL could do its checks in the startup module, that's cached pretty aggressively isn't it? [06:50:16] hmm [06:50:22] The startup module is cached for 300 seconds [06:50:33] but it's not enough, right? [06:50:42] you need the mtimes of the modules that are included in the HTML [06:50:52] and for that you would need to check on every request [06:51:05] We accept that invalidations will take 5 minutes to propagate [06:51:30] So if a module's mtime changes, the client might not find out about that for 5 mins. Once it's got the new mtime, it'll be set [06:51:51] What I would like to have here, though, is for l10ncache to know when a certain language was last cached [06:52:24] Or, better, the invalidation timestamp it comes up with using CacheDependency [06:53:27] the text of the messages is included with the JS source of the module, right? [06:53:37] so each module request has a language parameter? [06:54:02] Yes [06:54:23] can the mtime be different for different languages? [06:54:41] Yes [06:54:52] See ResourceLoaderModule::getMsgBlobMtime( $lang ) [06:54:59] (Its logic is subtly broken, I just noticed) [06:55:16] so when you are writing a