[00:03:30] !log db1050 alive. do not repool. [00:03:36] Logged the message, Master [00:09:08] There's no LD scheduled but FYI Flow deploy completed some time ago. Thanks [00:29:32] (03PS1) 10Ori.livneh: Release ordered_json.rb under the terms of the Apache license [operations/puppet] - 10https://gerrit.wikimedia.org/r/112828 [00:35:35] (03CR) 10Ryan Lane: [C: 031] "Excellent. Thanks!" [operations/puppet] - 10https://gerrit.wikimedia.org/r/112828 (owner: 10Ori.livneh) [00:35:51] (03CR) 10Ori.livneh: [C: 032] Release ordered_json.rb under the terms of the Apache license [operations/puppet] - 10https://gerrit.wikimedia.org/r/112828 (owner: 10Ori.livneh) [01:14:37] db crash again? [01:15:04] Function: User::saveOptions Error: 1213 Deadlock found when trying to get lock; try restarting transaction (*ip address*) [02:18:45] !log LocalisationUpdate completed (1.23wmf13) at 2014-02-12 02:18:44+00:00 [02:18:51] Logged the message, Master [02:22:35] who is the right person to ask about gerrit permissions? [02:29:10] gwicke: I'm not the right person, necessarily, but I can probably help [02:37:38] !log LocalisationUpdate completed (1.23wmf12) at 2014-02-12 02:37:38+00:00 [02:37:46] Logged the message, Master [03:13:44] (03CR) 10Jforrester: [C: 04-1] "For discussion." [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112590 (owner: 10Jforrester) [03:16:07] (03CR) 10Jforrester: [C: 04-1] "Need to notify the community first." [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/111149 (owner: 10Jforrester) [03:17:13] (03CR) 10Jforrester: "Max: The OTRS admins, whose call it is." [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/100927 (owner: 10Jforrester) [03:19:05] !log LocalisationUpdate ResourceLoader cache refresh completed at 2014-02-12 03:19:05+00:00 [03:19:14] Logged the message, Master [03:22:40] RECOVERY - mysqld processes on db1036 is OK: PROCS OK: 1 process with command name mysqld [03:24:10] about topic branch names, if every single one is unique, isnt it just repeating the first line of the commit message? doesn't it make more sense to have a few topic branch names that are being reused, so like categories [04:45:31] (03CR) 10GWicke: "The salt issue is tracked at https://github.com/saltstack/salt/issues/10339" [operations/puppet] - 10https://gerrit.wikimedia.org/r/112640 (owner: 10GWicke) [04:47:58] !log disabling mysql monitoring on db48 (prep before starting OTRS upgrade) [04:48:05] Logged the message, Master [04:48:16] !log disabling https/smtpd monitoring on iodine [04:48:24] Logged the message, Master [05:11:20] PROBLEM - ElasticSearch health check on elastic1002 is CRITICAL: CRITICAL - elasticsearch (production-search-eqiad) is running. status: red: timed_out: false: number_of_nodes: 11: number_of_data_nodes: 11: active_primary_shards: 1340: active_shards: 3895: relocating_shards: 0: initializing_shards: 1: unassigned_shards: 0 [05:12:20] RECOVERY - ElasticSearch health check on elastic1002 is OK: OK - elasticsearch (production-search-eqiad) is running. status: green: timed_out: false: number_of_nodes: 11: number_of_data_nodes: 11: active_primary_shards: 1341: active_shards: 3896: relocating_shards: 0: initializing_shards: 0: unassigned_shards: 0 [05:26:14] Jeff_Green: You up still? [05:26:42] I get a fatal error on changing user rights as a steward. [05:26:55] ? [05:26:57] PHP fatal error in /usr/local/apache/common-local/php-1.23wmf13/includes/specials/SpecialUserrights.php line 155: [05:26:57] Call to undefined method UserRightsProxy::clearInstanceCache() [05:27:40] ohhh. I'm sort of tied up at the moment with an OTRS upgrade. [05:27:49] Oh okay. :) [05:27:55] I reported it already, sDrewthedoff. [05:27:59] ok cool [05:28:03] https://bugzilla.wikimedia.org/show_bug.cgi?id=61252 [05:28:18] jeff it seems to be a wmf13 issue [05:28:24] and only xwiki [05:28:25] ohhh. I'm sort of tied up at the moment with an OTRS upgrade. [05:28:38] This is a Reedy thing. [05:28:47] Or Tim. [05:28:48] Reedy:. [05:28:51] TimStarling [05:28:57] Thanks, Bsadowski1. [05:29:03] Gloria! :D [05:29:20] How did you all not notice until now? [05:29:20] everything will work as long as we don't need to change any xwiki rights [05:29:31] what is xwiki? [05:29:35] Crosswiki. [05:29:39] Like X-mas, or something. [05:29:43] Gloria: wmf13 only rolled five hours ago [05:29:49] TimStarling: https://bugzilla.wikimedia.org/show_bug.cgi?id=61252 [05:31:26] Gloria: changing xwiki rights is less often undertaken task [05:31:43] sDrewthedoff: Right, right. Sorry, I thought the deploy was longer ago. [05:32:23] Gloria: I don't change user rights 24/7. [05:32:28] :| [05:32:32] * sDrewthedoff is magnaminous and forgives [05:33:33] I think I remember this change. [05:33:35] To the code. [05:33:55] It was something to do with a rights conflicts or something, I think. [05:34:31] that is where I fail, I wouldn't knwo where to start looking [05:35:00] https://gerrit.wikimedia.org/r/#/c/110869/1/includes/specials/SpecialUserrights.php,unified [05:35:04] I think it's that. [05:35:57] "Note: I can't reproduce the issue or verify this patch." [05:36:03] that looks a pretty goo dmatch to the error [05:36:49] I'm still not sure why https://bugzilla.wikimedia.org/show_bug.cgi?id=38989 can't just use the same logic as edits. [05:36:57] With a start timestamp. [05:37:55] "Beware of bugs in the above code; I have only proved it correct, not tried it." :) [05:38:22] Gloria: We were trying rights at different wikis. [05:38:24] Not the same. [05:38:29] How could it be conflicting? [05:38:46] I tried to give myself rights at mediawikiwiki. [05:39:00] Bsadowski1: That code change was made to address a separate issue. [05:39:07] The separate issue should be solved differently. [05:39:23] in reflection, yes [05:39:23] And that code change likely caused your issue. [05:39:34] Are you sure? [05:39:48] No. That's why I said likely. [05:40:56] Gloria, and 38989 was due to you and I rights dancing at mw [05:41:06] sDrewthedoff: Indeed. :-) [05:41:36] legoktm: Are you a shell user yet? [05:41:43] I'm not. [05:41:46] Lame. [05:41:48] how urgent is it? [05:41:51] I can look at exception logs though [05:42:01] no xwiki rights changes until fixed [05:42:20] do you want it fixed in 1 minute or 10? [05:42:22] so as I said earlier, as long as we have no spambots and other ballistic issues, we can wait [05:42:30] 10-60-120 are all okay [05:43:09] TimStarling: 2 hours is fine, longer is okay [05:43:30] up to one day is acceptable, not preferable [05:43:33] That's quite the negotiation tactic. [05:44:14] legoktm: I think a straight revert is fine for now. [05:44:19] Gloria: always try to be reasonable and practicable [05:44:22] oh [05:44:25] I have the fix ready [05:44:27] agreed [05:45:13] [09:45:06 PM] (PS1) Legoktm: Only call ->clearInstanceCache() if $targetUser instanceof User [core] - https://gerrit.wikimedia.org/r/112847 [05:45:53] Special:GlobalGroupMembership is also probably broken [05:46:04] No topic, -1. [05:46:17] legoktm: need it tested? [05:46:30] I think it is better to introduce clearInstanceCache() to UserRightsProxy [05:46:36] I am testing a fix along those lines [05:47:27] We'd also need to add it to CentralAuthUser too then [05:47:46] that's what you get for duplicating code [05:48:38] legoktm: yes, ... Call to undefined method CentralAuthGroupMembershipProxy::clearInstanceCache() [05:48:52] thought so [05:49:25] globalgroup membership is less likely to have duelling rights [05:49:42] the other was local <-> central duel [05:52:30] $user = self::newFromId( $wikiID, $id ); // FIXME: newFromId is undefined [05:52:32] * legoktm sighs [05:52:45] Louder. [05:53:04] Hi pakaran. [05:58:13] taking too long [05:58:31] is legoktm's patch tested? [05:58:51] I tested that Special:GlobalGroupMembership works [05:59:14] what's the worst thing that can happen? [05:59:14] I don't have cross-wiki userrights set up, so that part isn't tested [05:59:25] !worstcase ;) [06:05:55] !log tstarling synchronized php-1.23wmf13/includes/specials/SpecialUserrights.php [06:06:03] Logged the message, Master [06:06:39] sDrewthedoff, Bsadowski1: ^ [06:06:46] here for testing, and if it fails, hardly a mega issue [06:07:10] testing [06:08:14] we have a win [06:08:21] thx legoktm [06:08:23] :) [06:08:34] will test a little further soon [06:08:55] and s:ggm is a tick too [06:09:13] Special:GlobalGroupMembership [06:14:38] (03PS1) 10TTO: Initial setup for legalteamwiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112850 [06:28:35] !log restarted apache on mw1184 [06:28:43] Logged the message, Master [06:29:20] RECOVERY - Apache HTTP on mw1184 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.058 second response time [06:41:14] (03CR) 10TTO: "recheck (see bug 61246)" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112850 (owner: 10TTO) [06:57:17] (03PS1) 10Ryan Lane: Code documentation for trebuchet's deployment module [operations/puppet] - 10https://gerrit.wikimedia.org/r/112855 [07:11:50] (03CR) 10TTO: "recheck" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112850 (owner: 10TTO) [07:20:43] !log aaron synchronized php-1.23wmf13/includes/filebackend/SwiftFileBackend.php '288941aa53adffb7de1af8ff4fbae4c1f0c26937 and 1bb7ef5a7816a57b38caa4a302fbff3e53f16d16' [07:20:49] Logged the message, Master [07:25:57] ori: https://upload.wikimedia.org/wikipedia/en/thumb/0/02/Barbie_and_the_Magic_of_Pegasus_poster.jpg/402px-Barbie_and_the_Magic_of_Pegasus_poster.jpg [07:25:59] ugh [07:27:37] the pegasus in the distant background knows something we don't, clearly [07:28:32] who says it is in the background, maybe it is a faerie on a mini-pony [07:28:53] maybe it clearer when you see it in 3D [07:29:03] perspective is everything [07:30:44] * ori gets that Thai food. [08:19:34] (03PS3) 10GWicke: Remove old parsoid init script [operations/puppet] - 10https://gerrit.wikimedia.org/r/112640 [08:32:50] PROBLEM - ElasticSearch health check on elastic1009 is CRITICAL: CRITICAL - elasticsearch (production-search-eqiad) is running. status: red: timed_out: false: number_of_nodes: 11: number_of_data_nodes: 11: active_primary_shards: 1340: active_shards: 3895: relocating_shards: 6: initializing_shards: 1: unassigned_shards: 0 [08:32:50] PROBLEM - ElasticSearch health check on elastic1004 is CRITICAL: CRITICAL - elasticsearch (production-search-eqiad) is running. status: red: timed_out: false: number_of_nodes: 11: number_of_data_nodes: 11: active_primary_shards: 1340: active_shards: 3895: relocating_shards: 6: initializing_shards: 1: unassigned_shards: 0 [08:32:51] PROBLEM - ElasticSearch health check on elastic1012 is CRITICAL: CRITICAL - elasticsearch (production-search-eqiad) is running. status: red: timed_out: false: number_of_nodes: 11: number_of_data_nodes: 11: active_primary_shards: 1340: active_shards: 3895: relocating_shards: 6: initializing_shards: 1: unassigned_shards: 0 [08:33:50] RECOVERY - ElasticSearch health check on elastic1004 is OK: OK - elasticsearch (production-search-eqiad) is running. status: green: timed_out: false: number_of_nodes: 11: number_of_data_nodes: 11: active_primary_shards: 1341: active_shards: 3896: relocating_shards: 6: initializing_shards: 0: unassigned_shards: 0 [08:33:51] RECOVERY - ElasticSearch health check on elastic1009 is OK: OK - elasticsearch (production-search-eqiad) is running. status: green: timed_out: false: number_of_nodes: 11: number_of_data_nodes: 11: active_primary_shards: 1341: active_shards: 3896: relocating_shards: 6: initializing_shards: 0: unassigned_shards: 0 [08:33:51] RECOVERY - ElasticSearch health check on elastic1012 is OK: OK - elasticsearch (production-search-eqiad) is running. status: green: timed_out: false: number_of_nodes: 11: number_of_data_nodes: 11: active_primary_shards: 1341: active_shards: 3896: relocating_shards: 6: initializing_shards: 0: unassigned_shards: 0 [08:35:01] * apergos raises an eyebrow [08:35:08] concise and actionable, no? [08:35:18] i like the color-coding [08:35:24] I don't have color coding [08:35:32] it's spelled out [08:35:39] unless you mean it has a color in th line which I would have to dig out [08:35:40] status: red [08:37:35] (03CR) 10Alexandros Kosiaris: [C: 032] Remove old parsoid init script [operations/puppet] - 10https://gerrit.wikimedia.org/r/112640 (owner: 10GWicke) [08:42:50] PROBLEM - Parsoid on wtp1004 is CRITICAL: Connection refused [08:43:42] morning [08:46:21] hashar: good morning to you too [08:48:13] that is interesting... parsoid listening on another port [08:48:17] hmmm [08:48:59] (03PS1) 10Matanya: zuul: remove lookupvar and replace with same scope @ var [operations/puppet] - 10https://gerrit.wikimedia.org/r/112862 [08:49:10] RECOVERY - Parsoid on wtp1004 is OK: HTTP OK: HTTP/1.1 200 OK - 970 bytes in 0.004 second response time [08:49:18] hashar: morning ^ :) [08:49:58] got restart comp [08:49:59] !log restarted parsoid on wtp1004. After https://gerrit.wikimedia.org/r/112640 is was for some reason listening on 56547 and not 8000. The restart fixed it [08:50:08] Logged the message, Master [08:51:41] akosiaris: I think parsoid attempt to listen on whatever port is configured (8000) and if it is busy listen on another port [08:51:49] or might be related to some env variable not being set properly [08:51:58] why would it do that ? [08:52:04] what sense does it make ? [08:52:20] I think that is for testing/dev purposes [08:52:44] you probably want to bug fill it [08:53:15] api/ParsoidService.js: var port = process.env.VCAP_APP_PORT || process.env.PORT || 8000; [08:53:33] I already mailed gwicke. I 'll wait his assesment. And if it confirms yours, a bug it is [08:53:50] so environment variables with the fallback being 8000 [08:54:09] yeah and VCAP_APP_PORT is for a cloud service ( https://www.appfog.com ) [08:54:11] but a restart had it to 8000 and not 56657 (or whatever it was) [08:54:25] 56547 [08:54:25] I am pretty sure some folks were talking about that env variable yesterday night [08:54:51] I guess 56547 would be a port assigned automatically whenever you open a listen socket on port 0 [08:55:00] i.e. the OS gives you a free port [08:55:02] yeah, an ephemeral port [08:55:23] so might be because one of the env variable is unproperly set [08:55:49] so one of those variables being set to 0 ... [08:55:59] * | 92b3b1d - Rename PARSOID_PORT to PORT and PARSOID_INTERFACE to INTERFACE (11 hours ago) [08:56:00] but it must also be some race [08:56:01] nice [08:56:27] how do we know which parsoid we are running btw ? [08:56:33] git log in the deploy repo ? [08:57:04] I think it is deployed using git deploy, so potentially by inspecting the parsoid process running, you can find the path it runs from and then look at that git repo [08:57:32] that is a bit messy :( [08:57:40] it it git-deploy indeed [08:57:44] it is* [08:57:56] i was hoping something more generic though [08:57:58] anyway [08:59:08] seems like we are a bit before that commit hashar [08:59:10] Gabriel is working on a debian package to ship parsoid + its node modules [08:59:23] yeah I know [08:59:28] akosiaris: yup probably, but the init / upstart conf might already have been updated in puppet [08:59:36] potentially leading to the env var not being set properly somehow [08:59:44] could be [09:03:15] (03CR) 10Hashar: "I keep forgetting why we had/have to use scope.lookupvar(). Apparently the @statsd_host should just work." [operations/puppet] - 10https://gerrit.wikimedia.org/r/112862 (owner: 10Matanya) [09:03:22] at least it is monitored [09:03:49] akosiaris: easy one to review ^^ ? :) [09:07:05] scope.lookupvar() must die apparently [09:07:16] if you get it merged, i can apply the fix on Zuul server right away [09:08:45] heh. scope.lookupvar in the same scope ? [09:09:06] can't remember the details [09:09:35] probably caused by puppet noobiness back in that time :-] [09:10:12] merging [09:10:36] (03CR) 10Alexandros Kosiaris: [C: 032] zuul: remove lookupvar and replace with same scope @ var [operations/puppet] - 10https://gerrit.wikimedia.org/r/112862 (owner: 10Matanya) [09:11:08] thanks akosiaris :) [09:11:26] running puppet [09:13:22] nothing changed \O/ [09:21:08] (03CR) 10Tim Landscheidt: "@ottomata: Please adjust the commit message in such cases in the future then :-). Seeing "DO NOT MERGE" in the log of merged changes can " [operations/puppet] - 10https://gerrit.wikimedia.org/r/111523 (owner: 10Ottomata) [09:58:43] YuviPanda: do you do mobile stuff ? [09:58:51] some of it, matanya [09:58:53] why [09:59:00] have you seen https://play.google.com/store/apps/details?id=animaonline.android.wikiexplorer ? [09:59:34] matanya: looking [09:59:43] matanya: ah, I haven't! [09:59:47] this one is up my alley [09:59:48] installing [09:59:57] you can grabe some ideas for it, really a great one :) [10:00:51] was published yesterday iirc [10:01:13] matanya: thanks! [10:01:26] my pleasure :) [10:01:29] matanya: the wikipedia app reboot is going to be out in a month or two. Will have editing! [10:01:38] YAY! [10:02:01] looking forward for it, the current one annoyies me a lot ... [10:02:27] matanya: yup. do you have an android device? [10:02:37] yes, i do [10:02:41] (03CR) 10Alexandros Kosiaris: [C: 032] Bind nrpe to a primary address [operations/puppet] - 10https://gerrit.wikimedia.org/r/112698 (owner: 10Alexandros Kosiaris) [10:02:44] matanya: we put out builds every other wednesday. I can get you one now to try it out with [10:02:46] matanya: what version of Android? [10:03:09] last time i rooted it, i through 4.0 [10:03:17] but didn't change that in months [10:03:28] so it is probably still it [10:03:49] matanya: that's fair enough. can you give me your email address so I can mail you a recent build? [10:03:58] the last public one was 2 weeks ago, a lot has changed since then [10:04:57] matanya: sent! [10:05:00] matanya: it has rough edges still [10:05:27] thanks a lot, i'll test and push you bugs :) RTL would be the first most likely [10:06:07] matanya: :D [10:06:16] matanya: I already did an RTL pass with aharoni a few weeks ago [10:06:28] matanya: so it shouldn't be *too* bad, but I am sure it has edges [10:06:36] ah, stole my thunder [10:06:59] when i'll get home tonight i'll give it a shot [10:07:39] matanya: :) [10:07:50] matanya: I tried out WikiExplorer. Honestly even unfinished I like our app better :P [10:07:57] even without the editing [10:08:13] why may i ask? [10:08:52] matanya: their font rendering seems terrible, and they seem to be stripping inline styles not too properly [10:08:59] matanya: plus if I tap a link, I have to tap again to actually read it [10:09:37] matanya: wikigrid is an interesting concept but thoroughly unoptimized for a mobile device [10:09:53] makes sense [10:10:19] matanya: the 'you have to tap twice to follow links' got annoying really fast [10:10:44] didn't note it until you memtioned it [10:11:27] matanya: the font rendering is also 'off'. I don't know why [10:11:54] matanya: they have attempted to restyle infoboxes and such, but feels a bit meh. [10:12:20] PROBLEM - ElasticSearch health check on elastic1006 is CRITICAL: CRITICAL - elasticsearch (production-search-eqiad) is running. status: red: timed_out: false: number_of_nodes: 11: number_of_data_nodes: 11: active_primary_shards: 1341: active_shards: 3898: relocating_shards: 6: initializing_shards: 1: unassigned_shards: 0 [10:13:09] matanya: it's also a trademark violation. I'll usually be ok with letting it go, but there are ads on it... [10:13:20] RECOVERY - ElasticSearch health check on elastic1006 is OK: OK - elasticsearch (production-search-eqiad) is running. status: green: timed_out: false: number_of_nodes: 11: number_of_data_nodes: 11: active_primary_shards: 1341: active_shards: 3898: relocating_shards: 1: initializing_shards: 0: unassigned_shards: 0 [10:13:37] so poke legal [10:14:22] matanya: might. [10:15:46] matanya: done [10:17:30] matanya: I'm now super interested in having you try out the app I link I gave you :) [10:18:49] if you will still be aroud at approx 17:00 UTC you will be able to find some feedback [10:18:57] * YuviPanda does some math [10:19:04] matanya: yup, will definitely be around [10:19:31] ok, i'll look for you [10:19:34] matanya: thanks :) [10:19:46] matanya: I'll send you another build if I fix some CentralAuth issues in the meantime :) [10:19:54] great, thanks [10:20:35] matanya: :) [10:28:18] YuviPanda: you won't be happy :P [10:28:25] hmm? [10:28:27] i showed it to a fellow at work [10:28:39] 'it' -> ? [10:28:44] the app [10:28:59] oh, the build I sent you? [10:29:02] he said: "on english it is great, but on hebrew it is bad" [10:29:05] yes [10:29:21] matanya: heh :D I *am* happy, because we have people who care and who can tell me what's wrong and then I can fix it :) [10:29:46] i'll through his comments at you [10:29:56] and you decide what to do with it, fair? [10:30:06] matanya: sure! [10:30:12] but we should probably move to some -mobile channel [10:30:16] sure [10:30:18] #wikimedia-mobile [10:30:19] is there one? [10:33:50] PROBLEM - ElasticSearch health check on elastic1004 is CRITICAL: CRITICAL - elasticsearch (production-search-eqiad) is running. status: red: timed_out: false: number_of_nodes: 11: number_of_data_nodes: 11: active_primary_shards: 1341: active_shards: 3898: relocating_shards: 2: initializing_shards: 1: unassigned_shards: 0 [10:34:50] RECOVERY - ElasticSearch health check on elastic1004 is OK: OK - elasticsearch (production-search-eqiad) is running. status: green: timed_out: false: number_of_nodes: 11: number_of_data_nodes: 11: active_primary_shards: 1342: active_shards: 3901: relocating_shards: 0: initializing_shards: 0: unassigned_shards: 0 [11:01:20] (03PS1) 10Patchon: Adding new configuration option "listenLocalOnly" [operations/debs/lucene-search-2] - 10https://gerrit.wikimedia.org/r/112871 [11:05:18] (03PS1) 10Matanya: base: remove lookupvar and replace with top scope @ var [operations/puppet] - 10https://gerrit.wikimedia.org/r/112872 [11:12:15] akosiaris: is realm a top scope var that can be used in .erb templates as @realm ? [11:12:56] matanya: exactly [11:13:20] but do avoid referencing it deep inside modules please [11:13:29] oh, a big new batch of commits coming then :) [11:13:37] it more like for role classes [11:13:48] no, only fixing exsiting ones [11:14:11] can't see scope.lookupvar all over the place [11:18:55] (03PS1) 10Matanya: search: remove lookupvar and replace with top scope @ var [operations/puppet] - 10https://gerrit.wikimedia.org/r/112876 [11:52:01] PROBLEM - HTTP 5xx req/min on tungsten is CRITICAL: CRITICAL: reqstats.5xx [crit=500.000000 [12:13:25] paravoid: regarding hk --> commons issue, have a sec? [12:32:43] (03PS1) 10Matanya: nginx: remove lookupvar and replace with top scope @ var [operations/puppet] - 10https://gerrit.wikimedia.org/r/112881 [12:44:26] gone for a little while (lunch out) [12:46:03] (03PS1) 10Matanya: torrus: remove lookupvar and replace with top scope @ var [operations/puppet] - 10https://gerrit.wikimedia.org/r/112884 [12:52:14] (03PS1) 10Matanya: swoft: remove lookupvar and replace with fact @ var [operations/puppet] - 10https://gerrit.wikimedia.org/r/112885 [13:02:01] RECOVERY - HTTP 5xx req/min on tungsten is OK: OK: reqstats.5xx [warn=250.000 [13:07:19] (03CR) 10MaxSem: [C: 04-1] "lsearchd is curently in a (very) maintenance mode. Third-party users are not recommended to use it, instead they're encouraged to try the " (033 comments) [operations/debs/lucene-search-2] - 10https://gerrit.wikimedia.org/r/112871 (owner: 10Patchon) [13:09:58] (03CR) 10MaxSem: "s/connect to/bind to/" [operations/debs/lucene-search-2] - 10https://gerrit.wikimedia.org/r/112871 (owner: 10Patchon) [13:19:00] (03PS1) 10Matanya: misc: remove lookupvar and replace with top scope @ var [operations/puppet] - 10https://gerrit.wikimedia.org/r/112887 [13:38:33] (03PS1) 10Matanya: ganglia: remove lookupvar and replace with top scope @ var [operations/puppet] - 10https://gerrit.wikimedia.org/r/112888 [13:44:10] PROBLEM - HTTP 5xx req/min on tungsten is CRITICAL: CRITICAL: reqstats.5xx [crit=500.000000 [13:45:45] (03CR) 10Patchon: ">> lsearchd is curently in a (very) maintenance mode. Third-party users are not recommended to use it, instead they're encouraged to try t" [operations/debs/lucene-search-2] - 10https://gerrit.wikimedia.org/r/112871 (owner: 10Patchon) [13:45:51] (03PS1) 10Matanya: ganglia_new: remove lookupvar and replace with top scope @ var [operations/puppet] - 10https://gerrit.wikimedia.org/r/112889 [13:50:25] (03PS1) 10Matanya: solr: remove lookupvar and replace with top scope @ var [operations/puppet] - 10https://gerrit.wikimedia.org/r/112892 [14:12:19] apergos: i have over 30 patches pending review, mind reduce that a bit? [14:17:34] (03CR) 10Matanya: (WIP) Completely overhaul admins.pp & modularize (032 comments) [operations/puppet] - 10https://gerrit.wikimedia.org/r/107848 (owner: 10Faidon Liambotis) [14:20:08] matanya: how old is the oldest? [14:20:48] about month i guess [14:21:13] jan 13 [14:21:27] a month tommorow [14:31:25] not too bad [14:42:21] Nemo_bis: i don't how that is not too bad, but meh [14:42:51] *don't see [14:46:11] https://www.mediawiki.org/wiki/Gerrit/Reports/Oldest_open_changesets [14:46:47] You do hard and needed work so ops review your stuff quickly, but most are not so lucky. ;) [14:53:23] (03CR) 10Ottomata: [C: 031] "Oh, I totally read that patchset wrong. This looks good to go." [operations/puppet] - 10https://gerrit.wikimedia.org/r/112613 (owner: 10Dzahn) [14:57:26] matanya: for simple changes such as the lookupvar/@ thing, you can just submit a single patchset that touches multiple modules [14:57:48] thanks paravoid didn't know that [14:58:07] (03CR) 10ArielGlenn: "actually this is already handled; sahar was already in admins.pp, and if he gets stat1002 access he doesn't need stat1... he also got adde" [operations/puppet] - 10https://gerrit.wikimedia.org/r/112613 (owner: 10Dzahn) [14:59:17] !log just finished in place (Cirrus) reindex of all group1 wikis except commons [14:59:25] Logged the message, Master [15:05:12] (03PS1) 10Faidon Liambotis: Switch Hong Kong, Macao & Bhutan to ulsfo [operations/dns] - 10https://gerrit.wikimedia.org/r/112897 [15:06:39] (03CR) 10Faidon Liambotis: [C: 032] Switch Hong Kong, Macao & Bhutan to ulsfo [operations/dns] - 10https://gerrit.wikimedia.org/r/112897 (owner: 10Faidon Liambotis) [15:11:26] Ops, do you have opinions on how to handle bounces in MediaWiki? https://bugzilla.wikimedia.org/show_bug.cgi?id=46640#c4 [15:11:45] Or should this be asked on the ops list? :) [15:17:10] RECOVERY - HTTP 5xx req/min on tungsten is OK: OK: reqstats.5xx [warn=250.000 [15:46:08] (03CR) 10Ottomata: "Looks good, only one suggestion (and an idea!)" (032 comments) [operations/puppet] - 10https://gerrit.wikimedia.org/r/112855 (owner: 10Ryan Lane) [15:59:44] apergos: do you deal with media storage wierdnesses? [15:59:59] a little of it [16:00:03] what's up? [16:00:10] compare https://he.wikipedia.org/wiki/%D7%A6%D7%94%D7%9C%D7%94 [16:00:20] with file page: https://he.wikipedia.org/wiki/%D7%A7%D7%95%D7%91%D7%A5:Zahala.jpg [16:00:32] with the full lengh file: https://upload.wikimedia.org/wikipedia/he/7/75/Zahala.jpg [16:01:35] what should I be seeing? [16:01:49] wrong rotationg? 90* [16:02:33] they are all vertical except for the oldest thumb for me [16:02:52] I am logged in viewing them [16:03:35] ok, so anon complaing it wrong, caching issue? [16:04:39] yes, and maybe a purge of the image would get it [16:05:12] (03CR) 10Faidon Liambotis: "Why aren't you doing this on the MediaWiki side?" [operations/puppet] - 10https://gerrit.wikimedia.org/r/112805 (owner: 10Yurik) [16:06:40] (03CR) 10Yurik: "Because many responses (especially API) are not varied on X-CS." [operations/puppet] - 10https://gerrit.wikimedia.org/r/112805 (owner: 10Yurik) [16:19:51] (03CR) 10Chad: "I see no harm in merging this if it helps someone in the short term, once the style nits are taken care of that Max pointed out." [operations/debs/lucene-search-2] - 10https://gerrit.wikimedia.org/r/112871 (owner: 10Patchon) [16:20:38] <^d> MaxSem: I'll probably merge ^ if he amends to fix your style complaints. It's basically no big deal. [16:21:09] no objctions from me [16:21:33] I just had to warn him that we don't care [16:21:59] thanks [16:22:08] -1 was only on stylistics grounds:) [16:22:53] <^d> MaxSem: Yeah, we don't care, but it's harmless :) [16:43:20] RECOVERY - Host labnet1001 is UP: PING OK - Packet loss = 0%, RTA = 1.04 ms [17:14:33] !log stopping hadoop daemons on an12, oracle java 6 is running there, don't know why yet, apt says it is not installed….grrr [17:14:41] Logged the message, Master [17:14:43] !log demon synchronized php-1.23wmf12/extensions/CirrusSearch 'Performance fixes' [17:14:52] Logged the message, Master [17:15:14] !log demon synchronized php-1.23wmf13/extensions/CirrusSearch 'Performance fixes' [17:15:22] Logged the message, Master [17:17:40] PROBLEM - Hadoop DataNode on analytics1012 is CRITICAL: PROCS CRITICAL: 0 processes with command name java, args org.apache.hadoop.hdfs.server.datanode.DataNode [17:17:50] PROBLEM - Hadoop NodeManager on analytics1012 is CRITICAL: PROCS CRITICAL: 0 processes with command name java, args org.apache.hadoop.yarn.server.nodemanager.NodeManager [17:20:50] ottomata is on it: ^^ [17:21:04] (03PS1) 10ArielGlenn: Add shell account for santosh, admins restricted + stats1002 [operations/puppet] - 10https://gerrit.wikimedia.org/r/112912 [17:22:16] (03PS3) 10Se4598: Replace easter egg by a more explaining message [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/110926 (owner: 10Hashar) [17:23:30] (03PS4) 10Se4598: Replace easter egg by a more explaining message [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/110926 (owner: 10Hashar) [17:25:40] RECOVERY - Hadoop DataNode on analytics1012 is OK: PROCS OK: 1 process with command name java, args org.apache.hadoop.hdfs.server.datanode.DataNode [17:25:50] RECOVERY - Hadoop NodeManager on analytics1012 is OK: PROCS OK: 1 process with command name java, args org.apache.hadoop.yarn.server.nodemanager.NodeManager [17:25:54] ok [17:26:08] !log purged oracle java 6 from analytics1012 and restarted hadoop daemons [17:26:16] Logged the message, Master [17:26:21] (03PS2) 10Nemo bis: Add shell account for santosh, admins restricted + stats1002 [operations/puppet] - 10https://gerrit.wikimedia.org/r/112912 (owner: 10ArielGlenn) [17:30:36] hey Ryan_Lane, when I run git deploy start [17:30:40] should that create a .deploy file? [17:33:48] yo jgage, yt? [17:45:47] !log Updated scholaships.wm.o to fbffb96; l10n updates [17:45:55] Logged the message, Master [17:46:01] greg-g: Done [17:46:10] bd808: yay [17:50:11] (03PS1) 10Dzahn: lower TTL for bugzilla, upcoming switch [operations/dns] - 10https://gerrit.wikimedia.org/r/112915 [17:50:24] greg-g: fyi: i'm not igniring your email re deployment calendar - been a crazy week - sorry [17:52:26] (03CR) 10Dzahn: "in case you're wondering why, see f.e. http://techtalk.virendrachandak.com/migrating-servers-using-dns-ttl-for-minimum-downtime/" [operations/dns] - 10https://gerrit.wikimedia.org/r/112915 (owner: 10Dzahn) [17:53:05] Lydia_WMDE: no worries :) [17:53:06] thaks [17:53:07] +n [18:14:24] manybubbles: elastic1007 is fixed [18:14:44] cmjohnson1: sweet! [18:15:00] I'll get it back in the loop! [18:16:47] should i just drop bugs.wikiPedia.org [18:16:56] or is it important to redirect.. hrmm hrmm [18:17:26] (we already have bugzilla.wikipedia.org, bugzilla.wikimedia.org, bugs.mediawiki.org etc ...) [18:18:03] it's more of an annoyance now .. [18:19:06] mutante: I say drop it; feel free to ignore me tho :-P [18:20:00] (03Draft6) 10Alexandros Kosiaris: PostgreSQL/Postgis module [operations/puppet] - 10https://gerrit.wikimedia.org/r/112469 [18:20:15] mutante: drop all [18:20:21] andre__ will be happy [18:20:36] I just don't care :P [18:20:37] no more bugs to worry about [18:20:42] heh, ok, thanks guys, the mantra to not kill old URLs is strong ,,, BUT [18:20:45] akosiaris: weeeee!!!!! [18:20:48] [Bug 14546] New: Remove bugs.wikipedia.org redirect - Archivum.info [18:20:52] duh :p [18:20:56] uh [18:21:03] this is what is among the 4(sic!) results i found with [18:21:10] inurl:bugs.wikipedia.org [18:21:17] break! break! break! [18:21:59] We will be completely bug-free in about 6 hours once we've purged the Bugzilla database for the update anyway ;) [18:22:01] [18:22:06] hehehe [18:22:49] Ok, now how do I interpret that message andre__ [18:22:57] you did not open the tag [18:23:02] :P [18:23:06] parser break :p [18:23:12] 500 [18:23:19] revert, revert [18:23:33] twkozlowski, after a few decades of existence I decided to become serious. Next step might be to grow up. [18:23:47] :)) [18:23:53] LOL andre__ [18:24:00] andre__: liar [18:25:56] mutante: I saw links to bugs.wikipedia.org around [18:26:31] sih [18:26:33] sigh :) [18:26:52] i was about to change the patch set [18:32:14] Nemo_bis: want to delete per https://bugzilla.wikimedia.org/show_bug.cgi?id=14546 :) [18:32:28] it shall not be confused with bug.wp , which is a language [18:33:08] Close the wiki! [18:33:16] lol [18:33:43] if you ask for wiki renaming next ... [18:34:03] nah, just delete and drop the tables [18:34:22] a bug based on "I mistyped the URL once" is not particularly convincing [18:35:00] Nemo_bis: [18:35:03] Wow :p [18:35:24] Nemo_bis: Don't delete the wiki! It has important 1 liners about French Communes :p [18:36:12] oh come on .. i also don't care and i already know no matter what i do today there will be a bug, so..i'm picking one [18:36:28] JohnLewis: I hadn't thought about it, you're right! A bug speaker might be saved by that information while travelling clueless in France! [18:36:51] but we're not being helpful to Daniel with our chatter :( [18:37:13] haha [18:37:23] :) it is kind of amusing though [18:37:45] Nemo_bis: No one cares about Daniel >:D [18:38:40] fair, you do the Apache change [18:39:11] mutante: Tell me what to do and I'll try my best to break it like my last patch did :p [18:39:24] (03Abandoned) 10Dzahn: point bugs. and bugzilla. within wikiPedia to -lb [operations/dns] - 10https://gerrit.wikimedia.org/r/109868 (owner: 10Dzahn) [18:39:42] greg-g, seems like i need to push a minor js change out [18:39:59] yurik: k, you have time now [18:40:03] ok [18:40:06] JohnLewis: i don't know what to do , i just asked you guys [18:40:26] mutante: What was the question again you asked? [18:40:28] as others have on the multiple bugs [18:40:32] lol, ttyl [18:40:42] Cya mutante. We all love you really :D [18:40:43] wow, it seems apache has been crashing steadily and massively for the past few days :( greg-g, you know about that? [18:40:51] .... [18:41:00] ? [18:41:07] ~300 seg faults in fatalmonitor [18:41:08] yurik: please tell me more [18:41:20] i saw that almost a week ago i think [18:41:30] pfft, i [18:42:08] so much about the beloved community consensus [18:42:32] Those are usually PCRE segfaults [18:42:45] We might have tweaked a regex in MW in a way that exposes a PCRE bug again [18:43:24] Usually the problem is excessive worst-case backtracking exhausting the stack (like, the actual stack in memory) [18:44:08] IIRC libpcre either doesn't have a stack limit option, or the option it has is unusable in that any reasonable value cripples other, sane regexes [18:48:45] hey, does anyone here have sysop privs on metawiki for deleting a stale Zero config? [18:51:15] dr0ptp4kt, i can do it i think [18:51:29] dr0ptp4kt, which page? [18:51:31] yurik, ok, 405-25 per dan [18:55:46] !log yurik synchronized php-1.23wmf12/extensions/ZeroRatedMobileAccess/ [18:55:53] Logged the message, Master [18:58:17] !log yurik synchronized php-1.23wmf13/extensions/ZeroRatedMobileAccess/ [18:58:25] Logged the message, Master [19:00:28] greg-g, done [19:08:22] (03PS1) 10Dzahn: switch bugzilla.wikipedia.org to wikimedia-lb [operations/dns] - 10https://gerrit.wikimedia.org/r/112930 [19:14:36] can I just point out how much I hate gnome shell. seriously. [19:14:59] one more week and if it doesn't shape up I jump ship (kde, mint, whatever) [19:16:04] (03PS1) 10Dzahn: drop bugs.wikipedia.org [operations/dns] - 10https://gerrit.wikimedia.org/r/112932 [19:16:15] apergos: is there a release planned in a week, or was that a safe ultimatum? :) [19:17:39] (03CR) 10Dzahn: "dropping this but still fixing bugzilla.wikipedia.org SSL cert error in I303339f00178eeca4c95a6baaca74bb9a6c8cf71" [operations/dns] - 10https://gerrit.wikimedia.org/r/112932 (owner: 10Dzahn) [19:17:42] mw1115 mediawikiwiki ApiWikiLove::saveInDb 10.64.16.27 1146 Table 'mediawikiwiki.wikilove_log' doesn't exist [19:18:01] Reedy: ? [19:18:17] haven't see him all day [19:20:05] Error: undefined at line undefined: [object Event] [19:20:14] that's a helpful message [19:20:48] no release, it was an 'I am gonna start smashing hardware if something doesn't improve and I figure about one week I will run out of nervers including my last one' [19:21:14] [sn]erver?S [19:21:24] nerves [19:21:44] as in 'I've got one nerve left and gnome-shell's getting on it' [19:25:26] * twkozlowski hugs apergos [19:25:36] :-) [19:25:42] to quote a persona that shall remain unnamed: "apergos? apergos is cool :-)" [19:25:51] awww1 [19:25:52] ! [19:26:50] apergos is awsome :) [19:28:56] * apergos takes a bow [19:29:13] * twkozlowski proclaims Feb 12 to be the Apergos Appreciation Day :-) [19:29:36] hm, I don't know if I'm ready for one of those yet :-D [19:32:18] Or maybe it's time to pick up Sumana's idea of an appreciation thread again [19:47:17] (03PS1) 10Ottomata: Putting analytics users on analytics1027 so they can ssh tunnel to hue [operations/puppet] - 10https://gerrit.wikimedia.org/r/112937 [19:47:30] (03CR) 10Ottomata: [C: 032 V: 032] Putting analytics users on analytics1027 so they can ssh tunnel to hue [operations/puppet] - 10https://gerrit.wikimedia.org/r/112937 (owner: 10Ottomata) [19:48:05] !log when elastic1007 came back Elasticsearch wasn't able to balance shards to my liking so I'm forcing it to shuffle them some by temporarily lowering the disk thresholds to 75/80 instead of 85/95 [19:48:13] Logged the message, Master [20:07:34] (03PS2) 10Dzahn: lower TTL for bugzilla, upcoming switch [operations/dns] - 10https://gerrit.wikimedia.org/r/112915 [20:12:02] (03CR) 10Dzahn: [C: 032] lower TTL for bugzilla, upcoming switch [operations/dns] - 10https://gerrit.wikimedia.org/r/112915 (owner: 10Dzahn) [20:13:09] !log DNS update - lowering bugzilla TTL [20:13:17] Logged the message, Master [20:13:26] mutante :D [20:19:03] (03PS2) 10Dzahn: switch bugzilla.wikipedia.org to wikimedia-lb [operations/dns] - 10https://gerrit.wikimedia.org/r/112930 [20:23:49] !log csteipp started scap: bug 60771 [20:23:56] Logged the message, Master [20:59:35] (03PS1) 10Ottomata: Adding git-fat support for trebuchet deployment [operations/puppet] - 10https://gerrit.wikimedia.org/r/112944 [21:00:47] !log csteipp finished scap: bug 60771 (duration: 36m 58s) [21:00:53] (03CR) 10jenkins-bot: [V: 04-1] Adding git-fat support for trebuchet deployment [operations/puppet] - 10https://gerrit.wikimedia.org/r/112944 (owner: 10Ottomata) [21:00:55] Logged the message, Master [21:01:40] (03CR) 10Dzahn: [C: 032] "just influences wikiPedia, not wikiMedia and let's the cluster do the redirect to fix a cert warning" [operations/dns] - 10https://gerrit.wikimedia.org/r/112930 (owner: 10Dzahn) [21:01:56] (03CR) 10Ottomata: Adding git-fat support for trebuchet deployment (031 comment) [operations/puppet] - 10https://gerrit.wikimedia.org/r/112944 (owner: 10Ottomata) [21:02:04] Ryan_Lane: ^ :) [21:02:45] !log DNS update - switch bz.wp to cluster redirect [21:02:54] Logged the message, Master [21:08:16] (03CR) 10Dzahn: "fixed SSL cert warning. openssl s_client -connect bugzilla.wikipedia.org:443 used to get wrong cert, now gets cluster wp cert then redire" [operations/dns] - 10https://gerrit.wikimedia.org/r/112930 (owner: 10Dzahn) [21:14:59] !log updated parsoid to deploy repo 77f4aaf2 and code repo 96c1274 [21:15:06] Logged the message, Master [21:15:16] service-restart returns an error [21:15:42] can any root do a manual parsoid restart? [21:15:51] dsh -g parsoid service parsoid restart [21:16:24] gwicke: ok [21:16:41] gwicke: done [21:16:49] ori, thanks! [21:17:09] logs are looking good [21:17:52] we should think about how we can get the right to restart parsoids back after the init -> upstart migration [21:18:30] (and/or fix service-restart) [21:19:00] PROBLEM - Parsoid on wtp1016 is CRITICAL: Connection refused [21:19:00] PROBLEM - Parsoid on wtp1020 is CRITICAL: Connection refused [21:19:09] gwicke: ? [21:19:12] ^,there it is again [21:19:15] do it again [21:19:19] on just those boxes [21:19:40] PROBLEM - Parsoid on wtp1019 is CRITICAL: Connection refused [21:19:45] done [21:19:58] on 19 as wel [21:20:00] l [21:20:00] RECOVERY - Parsoid on wtp1016 is OK: HTTP OK: HTTP/1.1 200 OK - 970 bytes in 0.010 second response time [21:20:00] RECOVERY - Parsoid on wtp1020 is OK: HTTP OK: HTTP/1.1 200 OK - 970 bytes in 0.009 second response time [21:20:00] last time i also had to start _some_ again [21:20:06] then looked at icinga [21:20:13] mutante: can you take over? i have to run [21:20:14] looking [21:20:19] yea [21:20:22] i did restart all the ones that alerted [21:20:22] thanks [21:20:31] alex had the same issue last night, but then wasn't able to reproduce it manually [21:20:31] use initctl to control the upstart service specifically [21:20:40] RECOVERY - Parsoid on wtp1019 is OK: HTTP OK: HTTP/1.1 200 OK - 970 bytes in 0.007 second response time [21:20:41] man initctl for details [21:20:41] that was after a puppet change [21:21:04] well, they are all up now [21:21:09] so i think its ok, right [21:23:43] all Parsoid green per Icinga (we can improve though by putting them in the hostgroups, so it's a nice overview link) [21:27:09] (03CR) 10BryanDavis: "Looking forward to seeing how this works out." (031 comment) [operations/puppet] - 10https://gerrit.wikimedia.org/r/112944 (owner: 10Ottomata) [21:27:58] (03PS1) 10Chad: WIP: Remove unused third parameter for wikiversions [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112955 [21:29:16] the weird thing about the restart is that the init script should now be gone [21:32:26] (03PS2) 10Dzahn: drop bugs.wikipedia.org [operations/dns] - 10https://gerrit.wikimedia.org/r/112932 [21:32:54] (03CR) 10Fabriceflorin: [C: 04-1] "Hi guys," [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112639 (owner: 10MZMcBride) [21:42:56] (03CR) 10BryanDavis: WIP: Remove unused third parameter for wikiversions (031 comment) [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112955 (owner: 10Chad) [21:44:38] !change 112932 | Gloria [21:44:38] Gloria: https://gerrit.wikimedia.org/r/#q,112932,n,z [21:46:45] (03CR) 10Dzahn: [C: 031] "bugzilla.wp fixed, bugs should go..plz:)" [operations/dns] - 10https://gerrit.wikimedia.org/r/112932 (owner: 10Dzahn) [21:48:50] brion: how is our release looking ? [21:50:00] (03CR) 10Matanya: [C: 031] "all for it, although the holy rule of note breaking links." [operations/dns] - 10https://gerrit.wikimedia.org/r/112932 (owner: 10Dzahn) [21:50:19] gerrit must have edit button! [21:51:27] matanya: It does for *some* things :p Mainly, editing commit messages [21:52:11] are we migrating to a different bug tracking system?! eiii :D [21:52:38] you wish yurik [21:53:48] yurik: Yes, call Bugzilla x.x.x whatever version it is :p [21:54:12] andre__: Fill in my x.x.x :p Which version are you shoving bugzilla up to? [21:54:31] JohnLewis, 4.4.1 [21:54:46] and that will be different how? :-P [21:55:10] PROBLEM - Varnishkafka Delivery Errors on cp3022 is CRITICAL: kafka.varnishkafka.kafka_drerr.per_second CRITICAL: 1851.06665 [21:55:37] not in tampa, new major version, newer puppet code [21:56:00] yurik: the same way we are no longer running MediaWiki 1.12.1 [21:57:08] mutante: andre__ many thanks for this hard work on bugzilla [21:57:43] yurik: Well... 4.2.7 -> 4.4.1, A 0.1.4 increase assuming Bugzilla calls 4.2.10 4.3.0 :p [21:57:44] matanya, heh, say that after it worked out well, not before :P but thanks [21:57:58] yurik, https://bugzilla.wikimedia.org/show_bug.cgi?id=49597#c12 [21:58:03] yay! [21:58:05] p858snake|l, i would have hoped the *useful* functionality on MW has changed since 1.12.1... Afaik, there is noone out there who supports as many things as MW... and yes, we are still laging in usability for a few things, but VE and other things have been making huge improvements... With bugzilla, my feeling has been that it hasn't changed for the last 10 years ;) [21:58:19] I made it before Bugzilla goes AWWL [21:58:23] yurik, 5.0 in upstream looks surprisingly promising [21:58:48] sigh... i'll believe it when i see it :) [21:58:57] work on a comment preview mode, usernames instead of email address exposing, etc; but we'll see how much really lands, yeah [21:59:10] RECOVERY - Varnishkafka Delivery Errors on cp3022 is OK: kafka.varnishkafka.kafka_drerr.per_second OKAY: 0.0 [21:59:29] andre__: Out of interest; Why 4.4.1 and not 4.5.2? [21:59:38] JohnLewis, because 4.5 is unstable [21:59:42] yet [21:59:44] and 4.1 and 4.3 etc [21:59:46] Fair enough :p [21:59:54] old school versioning system [22:01:29] (03PS4) 10Dzahn: switch Bugzilla to zirconium [operations/dns] - 10https://gerrit.wikimedia.org/r/108906 [22:01:34] I'm trying another service-restart on parsoid after pushing a small fix [22:02:05] might need a root to fix it up when that fails again [22:02:10] PROBLEM - Varnishkafka Delivery Errors on cp3022 is CRITICAL: kafka.varnishkafka.kafka_drerr.per_second CRITICAL: 6248.366699 [22:02:56] mutante, ori: another dsh -g parsoid service parsoid restart needed [22:03:08] sorry, i am in the bugzilla migration [22:03:12] if somebody else could [22:03:23] k [22:03:34] eh, causing breakage? then i will [22:03:40] PROBLEM - Parsoid on wtp1019 is CRITICAL: Connection refused [22:03:42] can still stop the other thing [22:03:45] arr, ok, hold on [22:04:10] i think one of the biggest annoyances with bugzilla is the extremely cumbersome search system. Even searching for "stuff i wrote, or stuff that mentions me" requires typing in my email in some obscure field on the huge search form [22:04:20] PROBLEM - Parsoid on wtp1022 is CRITICAL: Connection refused [22:04:33] i would have thought that SQL like lookups are a thing of the past [22:04:35] yurik: pro tip: use quick search [22:04:35] yurik, let me fix part of that later today [22:04:38] !bug reporter:yurik [22:04:38] https://bugzilla.wikimedia.org/reporter:yurik [22:04:44] heh [22:04:49] bah, this channel has broken bangcodes. [22:04:54] lol [22:05:05] my point exactly :DDD [22:05:07] https://bugzilla.wikimedia.org/buglist.cgi?quicksearch=reporter:yurik (when bugzilla goes back up) [22:05:18] !bug [22:05:19] https://bugzilla.wikimedia.org/$1 [22:05:22] !bug del [22:05:22] You are not authorized to perform this, sorry [22:05:26] mutante, most boxes seem to have worked this time [22:05:27] lol [22:05:28] bah. [22:05:37] someone to this, then !bug is https://bugzilla.wikimedia.org/buglist.cgi?quicksearch=$url_encoded_* for me? [22:05:56] !bug del [22:05:56] Successfully removed bug [22:06:00] !bug is https://bugzilla.wikimedia.org/buglist.cgi?quicksearch=$url_encoded_* [22:06:00] Key was added [22:06:26] yurik: https://bugzilla.mozilla.org/page.cgi?id=quicksearch.html enjoy [22:06:37] thanks :) [22:06:40] "Successfully removed bug"??? [22:06:41] i understand reading docs is not something people normally do, but this one is worth it :P [22:06:50] MatmaRex, it's still on my to-do list I must admit [22:06:58] (reading that quick search docs page) [22:07:10] !log restarting parsoids [22:07:18] Logged the message, Master [22:07:24] heh, i guess i am too used to http://youtrack.jetbrains.com/dashboard [22:08:02] gwicke: ok now? sorry, had issues with key first [22:08:03] try typing things in - like "repor" [22:08:13] i did via dsh [22:08:19] from good old fenari [22:08:21] (03PS1) 10Ottomata: Initial debian release [operations/debs/git-fat] (debian) - 10https://gerrit.wikimedia.org/r/113018 [22:08:38] mutante, most boxes were fine to start with [22:09:26] wtp1019 still refuses connections [22:09:34] (03CR) 10Catrope: [C: 032] Move VisualEditor to opt-in status on eswiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/109639 (owner: 10TTO) [22:09:42] RECOVERY - Parsoid on wtp1019 is OK: HTTP OK: HTTP/1.1 200 OK - 970 bytes in 0.009 second response time [22:09:44] !log root@wtp1019:~# service parsoid restart [22:09:52] Logged the message, Master [22:09:53] wtp1022 too [22:10:04] I think those were the only ones [22:10:06] yes, done [22:10:14] ack, also per icinga that way [22:10:19] mutante, thanks! [22:10:43] (03Merged) 10jenkins-bot: Move VisualEditor to opt-in status on eswiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/109639 (owner: 10TTO) [22:10:46] np [22:10:48] will do some restart testing under load before next deploy [22:10:51] andre__: ok, i'm coming back :) [22:10:57] heh [22:11:03] RECOVERY - Parsoid on wtp1022 is OK: HTTP OK: HTTP/1.1 200 OK - 970 bytes in 0.003 second response time [22:11:07] gwicke: What happened to those two boxes? [22:11:15] (03CR) 10Catrope: [C: 032] Enable VisualEditor in opt-in mode for French Wikiversity [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112582 (owner: 10Jforrester) [22:11:25] (03Merged) 10jenkins-bot: Enable VisualEditor in opt-in mode for French Wikiversity [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112582 (owner: 10Jforrester) [22:11:53] RoanKattouw, the restart failed [22:12:06] via salt [22:12:07] stop: Rejected send message, 1 matched rules; type="method_call", sender=":1.8" (uid=575 pid=15476 comm="stop parsoid ") interface="com.ubuntu.Upstart0_6.Job" member="Stop" error name="(unset)" requested_reply="0" destination="com.ubuntu.Upstart" (uid=0 pid=1 comm="/sbin/init") [22:12:12] fwiw [22:12:18] (03PS1) 10Diederik: Enable ResourceManager API in hue.ini to enable JobHistory in Hue. [operations/puppet/cdh4] - 10https://gerrit.wikimedia.org/r/113021 [22:12:30] start: Rejected send message, 1 matched rules; type="method_call", sender=":1.9" ... [22:12:39] (03PS2) 10Ottomata: Initial debian release [operations/debs/git-fat] (debian) - 10https://gerrit.wikimedia.org/r/113018 [22:12:39] aha [22:13:09] upstart issue? [22:13:15] (03CR) 10Ottomata: [C: 032 V: 032] "Awesome, thanks DrDee!" [operations/puppet/cdh4] - 10https://gerrit.wikimedia.org/r/113021 (owner: 10Diederik) [22:13:27] (03CR) 10Catrope: [C: 032] Add config ability to set wgTemplateDataUseGUI for wikis [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/111148 (owner: 10Jforrester) [22:13:35] (03Merged) 10jenkins-bot: Add config ability to set wgTemplateDataUseGUI for wikis [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/111148 (owner: 10Jforrester) [22:14:28] google results suggest permission issues vs. upstart [22:14:31] that output is what you get when doing a parsoid service restart via dsh [22:14:42] but yea, bbl.. on this other thing [22:14:43] (03PS5) 10Tim Starling: Replace easter egg by a more explaining message [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/110926 (owner: 10Hashar) [22:14:44] http://askubuntu.com/questions/112655/what-is-the-meaning-of-this-upstart-init-error [22:15:01] (03CR) 10Tim Starling: "PS5: fix spelling error" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/110926 (owner: 10Hashar) [22:15:13] (03CR) 10Ebe123: "@TTO, consensus was reached." [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/110876 (owner: 10Ebe123) [22:15:27] (03CR) 10Tim Starling: [C: 032] Replace easter egg by a more explaining message [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/110926 (owner: 10Hashar) [22:15:27] good thing Debian decided on systemd ;) [22:15:39] (03CR) 10Gergő Tisza: [C: 04-1] "> A more detailed report/recap by Fabrice will follow, for" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112639 (owner: 10MZMcBride) [22:17:03] RECOVERY - Varnishkafka Delivery Errors on cp3022 is OK: kafka.varnishkafka.kafka_drerr.per_second OKAY: 0.0 [22:17:11] !log tstarling updated /a/common to {{Gerrit|I24ff4d72a}}: Replace easter egg by a more explaining message [22:17:18] Logged the message, Master [22:17:20] !log catrope synchronized visualeditor.dblist 'Enable VE on frwikiversity' [22:17:28] Logged the message, Master [22:17:42] !log tstarling synchronized docroot/noc/conf/highlight.php [22:17:51] Logged the message, Master [22:17:54] yay TimStarling [22:18:32] <^d> #lame [22:19:08] ^d: well, even if you liked the joke, the nature of jokes is that they are only funny so many times [22:19:13] !log catrope synchronized visualeditor-default.dblist 'VE no longer default on eswiki' [22:19:13] did TimStarling kill the Easter Bunny? [22:19:20] Logged the message, Master [22:19:21] that one was getting a bit old [22:19:45] Eloquence: no, he killed the 'secret password' bunny [22:19:53] the one with hashes in its retinas, you know [22:19:57] TimStarling: Indeed. [22:20:09] * James_F +1s the decision. [22:20:10] they killed Kenny! [22:20:25] !log catrope synchronized wmf-config/InitialiseSettings.php 'secondary tabs and disableforanons for eswiki' [22:20:33] Logged the message, Master [22:20:42] TimStarling: Would you mind checking the deployment calendar before randomly touching the deployment directories? [22:20:59] sorry RoanKattouw [22:21:15] * RoanKattouw wants a locking mechanism for this kind of stuff [22:21:26] I did check the deployed diff, I made sure I was only merging that one change [22:21:28] <^d> James_F: Luckily +1s mean nothing! :) [22:21:34] We need that deploy-queue bot that greg-g has talked about [22:21:54] Yeah [22:22:04] !log catrope synchronized wmf-config/CommonSettings.php 'plumbing for as-yet-unused TemplateDataUseGUI setting' [22:22:13] Logged the message, Master [22:22:27] RoanKattouw: Make sure that your favorite deploy system requirements are on https://etherpad.wikimedia.org/p/DeploymentSystemRequirements [22:22:50] Yeah the locking thing is on there [22:22:57] I'll probably start turning that into a wiki page with some analysis soon-ish [22:23:03] Yes, please make it a wiki page [22:23:09] (03PS1) 10Dzahn: switch new Bugzilla db from _testing to prod [operations/puppet] - 10https://gerrit.wikimedia.org/r/113028 [22:24:37] It will become something under https://www.mediawiki.org/wiki/Deployment_tooling [22:24:49] (03PS1) 10Chad: Error message haiku++ [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/113029 [22:24:58] (03CR) 10jenkins-bot: [V: 04-1] Error message haiku++ [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/113029 (owner: 10Chad) [22:25:19] bd808: RoanKattouw yes :( [22:25:24] bd808: +1. [22:25:27] https://github.com/etsy/PushBot [22:25:32] * James_F grins. [22:25:36] a modified version of that would work [22:25:51] (just of the statuses/states) [22:26:56] ^d: :-D [22:26:59] (03CR) 10Nemo bis: "Gergő, your interpretation is surely very misleading. Consensus has never been enough by itself to enable an extension, if that extension " [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112639 (owner: 10MZMcBride) [22:27:15] (03PS1) 10Diederik: Updating cdh4 submodule with hue jobbrowser fix [operations/puppet] - 10https://gerrit.wikimedia.org/r/113030 [22:28:04] greg-g: Have you opened up a bug about pushbot yet? [22:29:34] (03CR) 10Ottomata: [C: 032 V: 032] Updating cdh4 submodule with hue jobbrowser fix [operations/puppet] - 10https://gerrit.wikimedia.org/r/113030 (owner: 10Diederik) [22:30:12] PROBLEM - MySQL Slave Running on db1001 is CRITICAL: CRIT replication Slave_IO_Running: Yes Slave_SQL_Running: No Last_Error: Error There is no such grant defined for user bugs on host % on [22:31:03] heh [22:31:04] !log catrope synchronized php-1.23wmf13/extensions/VisualEditor/ 'cherry-picks' [22:31:12] Logged the message, Master [22:32:12] RECOVERY - MySQL Slave Running on db1001 is OK: OK replication Slave_IO_Running: Yes Slave_SQL_Running: Yes Last_Error: [22:33:14] bd808: maybe? sorry, in meeting otherwise would search [22:34:09] greg-g: Also bugzilla is "in flux" :) [22:39:38] !log Bugzilla in scheduled maintenance/upgrade period - be back in a bit [22:39:46] Logged the message, Master [22:45:44] !deployed change to jsduck jobs that will cause them to fail more often (but in a good way) [22:45:48] Er [22:45:50] !log deployed change to jsduck jobs that will cause them to fail more often (but in a good way) [22:45:58] Logged the message, Master [22:47:25] <^d> !log gerrit: disabled bz plugin during bz maintenance, spamming errors since it can't connect [22:47:31] Logged the message, Master [22:47:42] ^d: thanks! [22:48:00] we were just talking about that, that was quick [22:49:08] thanks [22:52:05] (03PS4) 10Nemo bis: Remove ArticleFeedbackv5 from Wikimedia wikis [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112639 (owner: 10MZMcBride) [22:52:17] (03PS5) 10Nemo bis: Remove ArticleFeedbackv5 from Wikimedia wikis [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112639 (owner: 10MZMcBride) [22:54:59] (03CR) 10Tim Starling: [C: 031] "This redirect looks pretty pointless, but any other redirects to bugzilla should be handled by the main application cluster. A DNS pointer" [operations/dns] - 10https://gerrit.wikimedia.org/r/112932 (owner: 10Dzahn) [22:56:06] (03CR) 10Dzahn: "thanks Tim, yes, i just did that to fix the cert error on bugzilla.wikipedia.org today in I303339f00178eeca4c95a6baaca74bb9a6c8cf71" [operations/dns] - 10https://gerrit.wikimedia.org/r/112932 (owner: 10Dzahn) [22:58:32] (03Abandoned) 10Chad: Error message haiku++ [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/113029 (owner: 10Chad) [22:58:51] (03CR) 10Dzahn: [C: 032] drop bugs.wikipedia.org [operations/dns] - 10https://gerrit.wikimedia.org/r/112932 (owner: 10Dzahn) [23:02:28] (03CR) 10Dzahn: [C: 032] "springle confirmed new db is populated" [operations/puppet] - 10https://gerrit.wikimedia.org/r/113028 (owner: 10Dzahn) [23:03:27] mutante: So productive :D [23:06:38] !log checksetup.pl on zirconium doing db upgrades .. [23:06:46] Logged the message, Master [23:09:22] PROBLEM - Apache HTTP on mw1154 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [23:09:52] PROBLEM - Apache HTTP on mw1157 is CRITICAL: Connection timed out [23:10:12] PROBLEM - LVS HTTP IPv4 on rendering.svc.eqiad.wmnet is CRITICAL: Connection timed out [23:10:32] PROBLEM - Apache HTTP on mw1156 is CRITICAL: Connection timed out [23:11:02] PROBLEM - Apache HTTP on mw1160 is CRITICAL: Connection timed out [23:11:12] PROBLEM - HTTP 5xx req/min on tungsten is CRITICAL: CRITICAL: reqstats.5xx [crit=500.000000 [23:11:12] PROBLEM - Apache HTTP on mw1159 is CRITICAL: Connection timed out [23:11:12] PROBLEM - Apache HTTP on mw1158 is CRITICAL: Connection timed out [23:11:12] PROBLEM - Apache HTTP on mw1153 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [23:12:12] PROBLEM - Apache HTTP on mw1155 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [23:13:33] (03CR) 10Fabriceflorin: "Nemo," [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112639 (owner: 10MZMcBride) [23:14:02] RECOVERY - Apache HTTP on mw1153 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.076 second response time [23:14:03] RECOVERY - Apache HTTP on mw1159 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.803 second response time [23:14:12] RECOVERY - LVS HTTP IPv4 on rendering.svc.eqiad.wmnet is OK: HTTP OK: HTTP/1.1 200 OK - 66460 bytes in 0.334 second response time [23:14:34] ...? [23:14:52] RECOVERY - Apache HTTP on mw1160 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.721 second response time [23:14:56] (03CR) 10Fabriceflorin: "Here are the links that I forgot to include in my earlier message:" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112639 (owner: 10MZMcBride) [23:15:03] RECOVERY - Apache HTTP on mw1158 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.062 second response time [23:15:03] RECOVERY - Apache HTTP on mw1155 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.723 second response time [23:15:12] RECOVERY - Apache HTTP on mw1154 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.069 second response time [23:15:41] what was that? [23:16:32] RECOVERY - Apache HTTP on mw1156 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.076 second response time [23:16:50] (03CR) 10Gergő Tisza: [C: 04-1] "As far as I know nothing like that has been told to those communities who expressed interest (it certainly hasn't been told to me when I d" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112639 (owner: 10MZMcBride) [23:16:53] RECOVERY - Apache HTTP on mw1157 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 2.965 second response time [23:17:04] hmmm all the image scalers? [23:18:22] PROBLEM - Apache HTTP on mw1154 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [23:18:29] (03CR) 10Gergő Tisza: "...does NOT directly alter any major feature like article editing or storage..." [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112639 (owner: 10MZMcBride) [23:19:12] PROBLEM - Apache HTTP on mw1153 is CRITICAL: Connection timed out [23:19:12] PROBLEM - Apache HTTP on mw1158 is CRITICAL: Connection timed out [23:19:12] PROBLEM - Apache HTTP on mw1155 is CRITICAL: Connection timed out [23:19:12] PROBLEM - Apache HTTP on mw1159 is CRITICAL: Connection timed out [23:19:12] RECOVERY - Apache HTTP on mw1154 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.072 second response time [23:19:21] Feb 12 23:19:05 mw1154 kernel: [23374557.228211] Memory cgroup out of memory: Kill process 22349 (convert) score 1000 or sacrifice child [23:19:42] PROBLEM - Apache HTTP on mw1156 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [23:19:52] PROBLEM - Apache HTTP on mw1157 is CRITICAL: Connection timed out [23:20:12] PROBLEM - LVS HTTP IPv4 on rendering.svc.eqiad.wmnet is CRITICAL: Connection timed out [23:20:32] also Task in /mediawiki/job/21889 killed as a result of limit of /mediawiki/job/21889 [23:21:03] PROBLEM - Apache HTTP on mw1160 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [23:21:12] RECOVERY - LVS HTTP IPv4 on rendering.svc.eqiad.wmnet is OK: HTTP OK: HTTP/1.1 200 OK - 66460 bytes in 0.214 second response time [23:21:31] rendering, it's imagemagick,yes, did you kill convert ? [23:21:41] arg, i am on the BZ thing [23:22:02] RECOVERY - Apache HTTP on mw1153 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.050 second response time [23:22:27] apparently oom-death of various image-scaler processes in a normal thing in the logs? [23:22:32] RECOVERY - Apache HTTP on mw1156 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 3.230 second response time [23:22:58] Yeah we OOM in the scalers quite often [23:23:02] RECOVERY - Apache HTTP on mw1158 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.060 second response time [23:23:03] RECOVERY - Apache HTTP on mw1155 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.093 second response time [23:23:03] RECOVERY - Apache HTTP on mw1159 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 1.652 second response time [23:23:15] mw1154 has it happening multiple times per minute going back as far as local syslogs go :( [23:23:52] RECOVERY - Apache HTTP on mw1160 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.070 second response time [23:24:52] RECOVERY - Apache HTTP on mw1157 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.066 second response time [23:27:33] !log mw1185 continues to segfault at a average rate of ~800/hr for the last 48 hours [23:27:40] Logged the message, Master [23:28:00] bd808: https://gerrit.wikimedia.org/r/#/c/107419/ btw ;) [23:28:46] AaronSchulz: Looking. I remember seeing an email but then I forgot to review [23:29:53] !log xtrabackup clone db1049 to db1056 [23:30:00] Logged the message, Master [23:30:22] springle: I heard we were getting more slaves a while ago, right? [23:31:03] yes [23:31:07] given our high connection limits and errors...just curious what is going on there [23:31:56] <^d> springle: I wrote a change for your WikiPage::newFromId() request :) [23:32:04] <^d> Maybe we can pester AaronSchulz into reviewing it. [23:32:32] AaronSchulz: additional s1 slaves are ordered (128G boxes), plus several pmtpa 96G slaves are shipping [23:32:42] PROBLEM - MySQL Processlist on db1049 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 0 copy to table, 127 statistics [23:32:57] I wonder how many will arrive still alive ;) [23:33:12] eek don't say that [23:33:14] :) [23:35:42] PROBLEM - MySQL Processlist on db1049 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 0 copy to table, 673 statistics [23:37:27] wow TRANSACTION 1A3FF8532, ACTIVE 16233 sec [23:37:44] wikiuser [23:42:42] RECOVERY - MySQL Processlist on db1049 is OK: OK 0 unauthenticated, 0 locked, 0 copy to table, 0 statistics [23:44:27] mutante: if you find some minutes, please review a patch or two of mine, they are rather simple [23:44:34] most of them [23:44:48] matanya: yes, "if", but that is unfortunately not right now [23:45:06] BZ [23:45:11] I know [23:45:25] whenever you can :) thanks and good night [23:45:42] PROBLEM - MySQL Processlist on db1049 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 0 copy to table, 785 statistics [23:45:43] sure, just add gerrit, ttyl [23:47:42] RECOVERY - MySQL Processlist on db1049 is OK: OK 0 unauthenticated, 0 locked, 0 copy to table, 1 statistics [23:51:37] (03PS5) 10Dzahn: switch Bugzilla to zirconium [operations/dns] - 10https://gerrit.wikimedia.org/r/108906 [23:51:54] (03CR) 10MZMcBride: "Fabrice: thank you for your posts. I agree that it seems fine to hold off on merging this for a bit. Let's tentatively say Monday, March 3" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112639 (owner: 10MZMcBride) [23:58:22] (03CR) 10MZMcBride: "Completely agree with Tim L." [operations/puppet] - 10https://gerrit.wikimedia.org/r/111523 (owner: 10Ottomata) [23:59:28] mutante/andre__: Good luck with the move over - look forward to using the new Bugzilla in the morning :) [23:59:37] JohnLewis, thanks