[00:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 00:02:37 UTC 2013 [00:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [00:05:38] twkozlowski: my unederstanding is the default was changed back for all. I think the patch was by Krinkle? [00:05:48] understanding* [00:05:52] I'll do some hunting. [00:06:54] I don't recall changing wgCookieExpiration in either core of wmf. [00:06:58] but its possible [00:07:23] core: $wgCookieExpiration = 180 * 86400; [00:07:41] New patchset: Ori.livneh; "Clean-up of 'eventlogging::archive' class." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74562 [00:07:42] wmf: $wgCookieExpiration = 30 * 86400; [00:07:48] StevenW: twkozlowski [00:07:58] (latest master) [00:08:20] I may be confusing it was something related but different. [00:09:03] That patch about removing the anon cookie that expired waaay too long? [00:09:44] LeslieCarr: do you happen to know where the pybal config file for the squids live? [00:09:59] mwalker: fenari [00:10:07] like everything else ;) [00:10:26] ah; cool; where on that beast? [00:10:37] /home/w/conf/pybal [00:10:50] *thumbs up* [00:10:54] thanks! [00:10:57] yw [00:11:07] https://www.mediawiki.org/w/index.php?title=Manual:%24wgCookieExpiration&action=purge [00:14:35] thanks Krinkle [00:15:03] StevenW: Yes, but you responded to Raymond's question about that on the privacy policy discussion page on Meta :-) [00:15:45] I mean, the guy's question was about the Wikimedia set up, not the general MediaWiki login expiration. So I think the answer still applies [00:16:48] "This year we noted the discrepancy in engineering and reset $wgCookieExpiration back to 30 days, and I've updated the MediaWiki manual page you linked to note this." [00:18:05] https://www.mediawiki.org/w/index.php?title=Manual:$wgCookieExpiration&diff=724468&oldid=723903 [00:18:17] so Krinkle says it actually wasn't, that's why I pinged you StevenW :) [00:23:30] twkozlowski: The commit date doesn't resemble deployment for the svn commit (probably several months later), but yes, as I see it, I'd say there was about a year where the expiration was 180 days. [00:23:43] However thinking back I find that unlikely, afaik we've always had 30 days in production. [00:24:07] perhaps it was reset to 30 by other means, or maybe it wasn't deployed as-is. [00:24:16] in svn days we had quite a branch mess [00:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 00:24:52 UTC 2013 [00:25:02] Krinkle: certainly we didn't always have 30 days in production [00:25:22] People have been complaining about the change from 180 to 30, and I myself remember we used to have 180 [00:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [00:27:49] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 00:27:45 UTC 2013 [00:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [00:29:16] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 00:29:07 UTC 2013 [00:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [00:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 00:32:41 UTC 2013 [00:33:07] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [00:34:37] is there a good pattern for composing a single configuration file out of multiple resources? i don't like the rsync module's approach of creating a directory of file fragments and combine them at the end of the run; it seems weird to use /etc as storage medium for intermediate results [00:34:47] ori-l: nope [00:34:50] twkozlowski: I don't know if we ever had 180 in production, but looking purely at the commits I'd say that does indeed seem possible. However without other proof I wouldn't be so sure. [00:34:52] twkozlowski: I guess you have more data? [00:35:04] ori-l: that's something that puppet basically can't handle [00:35:16] hence the hack for rsync and puppet (and probably a bunch of other things) [00:35:17] it's really incredible the sort of things you can't do or can't do well with puppet [00:35:22] yep [00:35:23] Krinkle: Just my memory and a talk page post by a different user [00:35:32] twkozlowski: ok [00:35:33] ori-l: if there was iteration it would be doable [00:35:43] but alas, none in puppet [00:36:25] what about using the fact that arrays and hashes are mutable? [00:37:05] you could update a global var from multiple places, and use stages / virtual resources / some other sequencing trick to ensure that the variable's value is queried and the config file generated at the end [00:37:08] I guess it's possible to put some chaining in and add to arrays and hashes [00:37:12] then at the end make the file [00:37:22] but then you need to deal with chaining [00:37:29] or stages (which are totally fucking broken in puppet) [00:37:53] i like that puppet labs is at least fairly candid and apologetic of the basic design flaws, of which there are many [00:38:11] also, I'm not sure if you can actually modify a global variable that way [00:38:20] it seems like every third documentation page has a box with a note like: [00:38:28] "we botched this horribly; you probably shouldn't use this." [00:38:41] unfortunately a lot of those were best practices at some point [00:38:51] yes. [00:38:54] parametrize all classes! [00:38:55] wait, don't! [00:38:59] :D [00:40:02] it really seems like that realize they fucked up and they rush in some hack to fix it [00:40:15] and later realize "OH NO, this is horrible too!" [00:40:18] yes, and always by introducing a sweeping, new abstraction [00:40:29] @seen bblack [00:40:46] of course, I'm fighting puppet right now and I'm pissed off at it, so maybe I'm being overly harsh [00:41:37] here's what just happened to me: a module was broken, so it didn't compile it and used an old cached copy [00:41:58] I restart the puppet master, it shows me the error, I run puppet again, it shows me a different error [00:42:02] and caches the broken module [00:42:26] heh [00:42:26] so much fail [00:42:41] Wait, so if there's a parse error it's just, like, hey, I'll try this instead! [00:43:11] andrewbogott: it did for me [00:43:21] not with a parse error, i don't think [00:43:24] That makes it /very/ hard to debug! [00:43:36] ori-l: mine was a parse error :( [00:43:42] but with other errors it could get stuck in different places because the order of execution is non-determinsitic, i think [00:43:50] oh, heh, wonderful. [00:43:53] Although I guess it fits with the "This is terrible but seems to work for now" design ethos [00:45:52] i think the ease with which you can extend puppet with custom resource types, providers and ruby functions is one thing they got profoundly right; you can bend it to create concise DSLs for managing the configuration of complex software [00:46:07] i really enjoy that part of it [00:46:13] yeah, that's nice [00:46:50] but why param classes an definitions? [00:46:54] *and [00:47:00] what's the major difference? [00:47:33] there are many things in puppet which are roughly equivalent and where the salient difference boils down to 'broken in different ways' [00:47:36] including and using functions is a pretty massive pain in the ass [00:47:42] though I guess it's easier in modules [00:47:57] it's also discouraged [00:48:19] yeah, but i don't take puppet labs' pronouncements on these things as binding [00:48:23] custom facter is also a pain [00:48:33] because you have to ensure it's deployed before it's used [00:48:33] because unfortunately a lot of those were best practices at some point [00:49:17] what's custom facter? [00:49:21] some external tool that generates facts? [00:49:35] it's just adding a ruby facter script [00:49:47] that adds some fact [00:49:47] that you then want to use in puppet [00:49:49] but you deploy it from puppet [00:50:03] ah, vagrant has a nice hook for that that i've used; never tried implementing it from scratch [00:50:22] New patchset: TTO; "(bug 51605) add alias for project namespace on ckbwiki" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74567 [00:50:36] this is one thing that salt handles amazingly well. you have grains, modules, pillars, etc on the master and it'll fetch it before its used [00:51:06] grain = fact, pillar = global variable, module = custom function [00:51:15] state = class [00:51:42] hmm, that sounds pretty cool [00:51:43] you can use custom modules in states and it'll ensure it's installed before use. [00:52:04] and you can also use those modules for remote execution [00:52:14] you can also call individual states via remote execution [00:52:53] salt's states are deficient in functionality compared to puppet, though, so it's a give and take [00:54:22] ori-l: hm. is that hook used in vagrant common in puppet too? [00:54:30] I'm not sure how we're currently doing it, but it sucks [00:54:46] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 00:54:41 UTC 2013 [00:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [00:56:33] i think vagrant just assembles all custom facts into FACTER_%s environment variables and it uses the shell trick of defining a variable for a single command by including its definition as a prefix of the invocation [00:57:01] like IFS='|' awk ... [00:57:25] ah [00:57:51] ok. time to go climbing [00:58:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 00:58:48 UTC 2013 [00:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [01:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 01:02:42 UTC 2013 [01:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [01:14:47] New patchset: TTO; "(bug 51537) require 10 edits for autoconfirmed on ckbwiki" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74568 [01:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 01:24:47 UTC 2013 [01:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [01:26:42] gitblit seems to be unresponsive: https://git.wikimedia.org/tree/mediawiki%2Fcore [01:27:44] New patchset: Vogone; "Enabling the 'property-create' right for all users on testwikidatawiki Bug: 51637" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74528 [01:27:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 01:27:39 UTC 2013 [01:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [01:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 01:28:50 UTC 2013 [01:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [01:32:56] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 01:32:46 UTC 2013 [01:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [01:39:14] hello operations [01:39:14] can I ask something ? [01:40:52] I'd like to pitch an idea [01:41:34] but probably not the best time since it's late [01:42:20] I'd like to pitch the idea of having skeleton git repositories for different projects that require packaging [01:42:59] so for example, there would be a Python skeleton , a Ruby skeleton, a Java skeleton, a C or C++ skeleton , a Perl skeleton and a PHP skeleton [01:43:04] each in different repos [01:43:27] and the aim would be, that if someone needs to do a debianization, he could clone on of those skeletons, and fit his applications in it, and push [01:43:47] and in doing this, he would have more chances of doing it the right way [01:44:28] it's the opposite of what we're doing now(at least in terms of order) [01:44:43] right now we're developing and considering debianization as the last step [01:45:36] but if we had a way to use somethng that operations provided from the get-go, which they agree on, then we'd be able to build on top of that and then the whole process would be easier IMHO [01:54:46] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 01:54:42 UTC 2013 [01:55:42] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [01:58:06] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 01:58:04 UTC 2013 [01:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [01:58:46] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 01:58:45 UTC 2013 [01:59:17] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [02:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 02:02:39 UTC 2013 [02:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [02:15:30] !log LocalisationUpdate completed (1.22wmf10) at Fri Jul 19 02:15:29 UTC 2013 [02:15:41] Logged the message, Master [02:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 02:24:50 UTC 2013 [02:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [02:27:36] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 02:27:35 UTC 2013 [02:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [02:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 02:28:47 UTC 2013 [02:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [02:29:56] PROBLEM - Puppet freshness on manutius is CRITICAL: No successful Puppet run in the last 10 hours [02:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 02:32:43 UTC 2013 [02:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [02:33:48] !log LocalisationUpdate completed (1.22wmf11) at Fri Jul 19 02:33:48 UTC 2013 [02:33:59] Logged the message, Master [02:49:50] !log LocalisationUpdate ResourceLoader cache refresh completed at Fri Jul 19 02:49:50 UTC 2013 [02:50:01] Logged the message, Master [02:56:04] New review: Hazard-SJ; "Shouldn't that be after test2wiki? That's my only issue." [operations/mediawiki-config] (master) C: -1; - https://gerrit.wikimedia.org/r/74528 [02:57:36] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 02:57:34 UTC 2013 [02:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [02:58:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 02:58:46 UTC 2013 [02:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [03:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 03:02:37 UTC 2013 [03:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [03:03:54] New review: Vogone; "If you want to keep an alphabetical order, test2wiki would also have to be before testwiki." [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74528 [03:13:32] New review: Hazard-SJ; "Since the others are also misplaced, might as well a separate patch be created to fix that, so this ..." [operations/mediawiki-config] (master) C: 1; - https://gerrit.wikimedia.org/r/74528 [03:24:46] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 03:24:45 UTC 2013 [03:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [03:27:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 03:27:40 UTC 2013 [03:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [03:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 03:28:46 UTC 2013 [03:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [03:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 03:32:44 UTC 2013 [03:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [03:39:16] https://git.wikimedia.org appears to be down. [03:54:46] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 03:54:44 UTC 2013 [03:55:40] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [03:58:16] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 03:58:08 UTC 2013 [03:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [03:58:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 03:58:49 UTC 2013 [03:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [04:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 04:02:43 UTC 2013 [04:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [04:11:46] PROBLEM - Disk space on ms-be1 is CRITICAL: DISK CRITICAL - /srv/swift-storage/sdh1 is not accessible: Input/output error [04:24:46] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 04:24:45 UTC 2013 [04:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [04:27:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 04:27:37 UTC 2013 [04:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [04:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 04:28:49 UTC 2013 [04:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [04:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 04:32:40 UTC 2013 [04:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [04:54:46] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 04:54:44 UTC 2013 [04:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [04:57:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 04:57:38 UTC 2013 [04:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [04:58:46] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 04:58:44 UTC 2013 [04:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [05:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 05:02:38 UTC 2013 [05:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [05:24:46] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 05:24:44 UTC 2013 [05:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [05:27:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 05:27:38 UTC 2013 [05:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [05:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 05:28:50 UTC 2013 [05:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [05:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 05:32:40 UTC 2013 [05:32:56] PROBLEM - Puppet freshness on fenari is CRITICAL: No successful Puppet run in the last 10 hours [05:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [05:44:56] PROBLEM - Puppet freshness on bast1001 is CRITICAL: No successful Puppet run in the last 10 hours [05:54:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 05:54:47 UTC 2013 [05:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [05:57:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 05:57:40 UTC 2013 [05:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [05:58:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 05:58:46 UTC 2013 [05:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [06:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 06:02:38 UTC 2013 [06:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [06:13:26] PROBLEM - HTTP radosgw on ms-fe1004 is CRITICAL: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 758 bytes in 0.004 second response time [06:15:26] PROBLEM - HTTP radosgw on ms-fe1003 is CRITICAL: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 758 bytes in 0.003 second response time [06:15:42] oh ceph [06:16:06] PROBLEM - HTTP radosgw on ms-fe1002 is CRITICAL: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 758 bytes in 0.014 second response time [06:19:17] grrr [06:20:33] ceph? [06:21:49] yeah [06:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 06:24:50 UTC 2013 [06:25:37] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [06:27:06] RECOVERY - HTTP radosgw on ms-fe1002 is OK: HTTP OK: HTTP/1.1 200 OK - 311 bytes in 0.003 second response time [06:27:47] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 06:27:41 UTC 2013 [06:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [06:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 06:28:48 UTC 2013 [06:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [06:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 06:32:41 UTC 2013 [06:32:56] PROBLEM - Puppet freshness on neon is CRITICAL: No successful Puppet run in the last 10 hours [06:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [06:36:41] New patchset: Ori.livneh; "Clean-up of 'eventlogging::archive' class." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74562 [06:48:40] still up ori-l? [06:48:54] yeah; what's up? [06:49:00] no just saying [06:49:06] since i saw your commit :) [06:50:11] ah, yeah, i get a bit fanatical sometimes about a set of changes. [06:50:20] "sometimes" [06:50:33] New review: Faidon; "Unparameterizing it and hardcoding all_networks inside EL seems like a step in the opposite directio..." [operations/puppet] (production) C: -1; - https://gerrit.wikimedia.org/r/74562 [06:50:39] er, s/opposite/wrong/ even [06:50:58] Aaron|home: past your bed-time, too :P [06:51:11] * Aaron|home has laundry still spinning [06:51:35] of course that's just an excuse since I'm almost always up at this time [06:52:09] i took you to be speaking euphemistically anyhow ;) [06:52:26] * Aaron|home set his s4 lock from pin -> password -> pin again since the typo rate was too high [06:52:39] and all the letters cover up my Liz Taylor wallpaper [06:53:03] S4? [06:53:23] samsung galazy [06:54:04] the number keypad fits just right though [06:54:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 06:54:51 UTC 2013 [06:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [06:57:36] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 06:57:35 UTC 2013 [06:58:27] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [06:58:46] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 06:58:41 UTC 2013 [06:58:56] PROBLEM - Puppet freshness on analytics1019 is CRITICAL: No successful Puppet run in the last 10 hours [06:58:57] New review: Ori.livneh; "Because the module does not otherwise employ parametrized classes or rely on the ordering of resourc..." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74562 [06:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [06:59:39] knight to d6, faidon! :P [07:00:28] how's ordering related? [07:00:54] you mean that you can include ::archive from multiple places [07:01:22] right, but the resource declaration style of loading the class would then have to load first [07:02:05] so IIRC it's possible (and happens in our codebase) that you have class { 'eventlogging::archive': param => 'foo', } in one place, and 'include eventlogging::archive' in others [07:02:36] which is OK unless some other dependency causes puppet to instantiate the class on the include statement before it acts on the resource declaration style instantiation [07:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 07:02:40 UTC 2013 [07:02:56] PROBLEM - Puppet freshness on analytics1018 is CRITICAL: No successful Puppet run in the last 10 hours [07:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [07:03:23] it sounds obscure, but i come across it regularly. it bit me when i made the changes to the scap scripts. [07:03:38] why would you include it in one place and include it parameterized in another? [07:03:56] PROBLEM - Puppet freshness on analytics1020 is CRITICAL: No successful Puppet run in the last 10 hours [07:03:59] hi paravoid [07:04:16] good morning [07:04:58] good morning [07:05:21] you might include it in eventlogging::consumer, reasoning that any node that consumes events ought to have log rotation for /var/log/eventlogging, but you don't know in the resource definition about what rsync hosts will pull the data [07:05:28] paravoid: pushed some new patches in => https://gerrit.wikimedia.org/r/#/c/73860/ [07:05:54] paravoid: I addressed your comments and Brandon's [07:06:12] poor paravoid :) [07:06:21] average: you grab the left arm, i'll grab the right arm [07:07:00] hahaha [07:07:10] :D [07:07:17] average: will you be around in 20'? [07:07:26] I need to finish up with something else [07:07:31] paravoid: I have insomnia, I'll be around in * [07:07:38] *' [07:07:38] and Ori, and it's late for him too :) [07:07:52] but you're in Europe aren't you? [07:07:59] I am in Europe yes [07:08:12] ok, I got confused with the insomnia comment :) [07:08:25] my comments? huh? [07:08:35] jorm: brandon black [07:08:40] your evil alter-ego [07:08:53] hehe [07:08:54] the False Brandon. [07:09:25] he says the same about you [07:09:31] told us not to believe a word you said [07:09:45] ori-l: hm, logrotate and rsync at the same place feels weird [07:09:55] maybe that's the root cause of your issue [07:10:18] maybe rsync should move to the role? [07:10:28] I WAS THE FIRST. [07:10:46] jorm: https://en.wikipedia.org/wiki/Genetic_fallacy [07:10:49] maybe, yeah [07:10:57] or a different class, parameterized [07:11:19] there's just something nice about having this entire module completely dodge an entire class of puppet pitfalls (duplicate class declarations), and it somehow seems unfair to compromise that over such a small detail as rsync hosts_allowed [07:11:34] so role class it is [07:13:05] lol [07:13:19] I don't think we need to fear parameterized classes that much [07:13:34] your rsync accessor class could fail if you don't pass its parameters, if you prefer that [07:13:52] (if ... { fail("You need to supply a parameter to ...") } [07:15:11] i just flipped through this o'reilly book, "puppet types and providers" [07:15:27] and like a man-child with a new toy i keep looking for an excuse to implement a type or a provider in ruby [07:15:45] and it's taking every ounce of self-restraint that i don't have not to invent some new abstractions [07:15:59] so i'm just going to move it to the role class and call it a night :P [07:17:04] hahaha [07:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 07:24:51 UTC 2013 [07:25:26] RECOVERY - HTTP radosgw on ms-fe1004 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 186 bytes in 0.001 second response time [07:25:26] RECOVERY - HTTP radosgw on ms-fe1003 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 186 bytes in 0.011 second response time [07:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [07:26:02] oh, ceph.... [07:28:16] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 07:28:07 UTC 2013 [07:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [07:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 07:28:52 UTC 2013 [07:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [07:33:06] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 07:33:01 UTC 2013 [07:34:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [07:35:17] this is me sleeping btw, but now I'm not :) http://garage-coding.com/sleeping.png [07:35:56] that was random [07:38:56] PROBLEM - Puppet freshness on erzurumi is CRITICAL: No successful Puppet run in the last 10 hours [07:38:56] PROBLEM - Puppet freshness on lvs1004 is CRITICAL: No successful Puppet run in the last 10 hours [07:38:56] PROBLEM - Puppet freshness on lvs1005 is CRITICAL: No successful Puppet run in the last 10 hours [07:38:56] PROBLEM - Puppet freshness on lvs1006 is CRITICAL: No successful Puppet run in the last 10 hours [07:38:56] PROBLEM - Puppet freshness on virt1 is CRITICAL: No successful Puppet run in the last 10 hours [07:38:56] PROBLEM - Puppet freshness on virt3 is CRITICAL: No successful Puppet run in the last 10 hours [07:38:56] PROBLEM - Puppet freshness on virt4 is CRITICAL: No successful Puppet run in the last 10 hours [07:43:06] New patchset: Ori.livneh; "Remove 'eventlogging::archive' class." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74562 [07:46:53] Aaron|home: still around? [07:47:34] * Aaron|home is not around [07:48:16] I just want to show you something [07:48:49] Aaron|home: http://tracker.ceph.com/issues/5675 [07:48:52] epic bug is epic [07:50:40] :/ [07:52:22] also http://tracker.ceph.com/issues/5674 [07:52:24] and http://tracker.ceph.com/issues/5655 [07:52:33] and 5671 but this was a misunderstaing [07:52:37] misunderstanding [07:52:46] but the peering bug seems to be fixed, so at least there are some good news [07:53:37] and they wanted to release an rc today [07:53:39] bwahahaha [07:54:47] okay, stopping all radosgws [07:54:50] no point really [07:54:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 07:54:46 UTC 2013 [07:55:25] !log stopping radosgw across the ceph cluster, post-upgrade issues [07:55:35] Logged the message, Master [07:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [07:57:06] PROBLEM - HTTP radosgw on ms-fe1002 is CRITICAL: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 758 bytes in 0.001 second response time [07:57:24] it gets the semantics of HTTP 500 exactly right [07:57:26] PROBLEM - HTTP radosgw on ms-fe1003 is CRITICAL: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 758 bytes in 0.003 second response time [07:57:36] PROBLEM - HTTP radosgw on ms-fe1001 is CRITICAL: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 758 bytes in 0.002 second response time [07:58:06] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 07:58:03 UTC 2013 [07:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [07:58:46] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 07:58:43 UTC 2013 [07:58:47] okay, reviewing mode now [07:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [07:59:17] oh niiice [07:59:28] thank you [08:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 08:02:39 UTC 2013 [08:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [08:03:25] New review: Faidon; "Perfect, thank you!" [operations/puppet] (production) C: 2; - https://gerrit.wikimedia.org/r/74562 [08:03:26] Change merged: Faidon; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74562 [08:04:24] weeee, thanks [08:06:46] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [08:07:05] average: hey [08:07:07] 26 [08:07:07] 27 [08:07:07] dclass_java.h: 28 [08:07:07] » » javac -classpath ./java java/dclass/dClass.java 29 [08:07:09] » » javah -o java/jni/dclass_java.h -classpath ./java dclass.dClass [08:07:13] what does that do? [08:07:27] (first of all, one tab is enough, but let's ignore that for a moment) [08:07:36] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.123 second response time [08:07:57] creating libdclass_java.h out of dClass.java? [08:08:01] paravoid: do you think the peering bug was likely fully fixed? [08:08:18] Aaron|home: it's much, *much* better now [08:08:24] they fixed about 5 different things [08:08:49] the latest thing is a bug that exists back from bobtail [08:09:00] is there a list of blocking bugs somewhere? I thought that was the main one [08:09:12] and it was work that was done peering-time that didn't need to happen [08:09:22] and it was related to deletes, so that's why we experienced it more than others did [08:09:46] yes, that's the main one, but now I'm dealing with the fallout of the new version, see the bug above [08:10:37] New patchset: Mark Bergsma; "Don't run the default vcl_fetch function on mobile caches" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74590 [08:10:44] the number of bugs creeping up though is a bit worrying [08:11:08] especially given the prospect of relying 100% on ceph in production in a few months [08:11:15] so we're still contemplating what to do [08:11:29] maybe set up a small swift cluster in eqiad, just as a backup [08:12:36] RECOVERY - HTTP radosgw on ms-fe1001 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 186 bytes in 0.002 second response time [08:12:50] 20:45 the java bits ==> 1) there's a custom dclass_java.h header(you will see a target called dclass_java.h inside Makefile.am) that is automatically generated through javah, basically javac compiles dClass.java into dClass.class, and then javah reads dClass.class and says "oh, I know these function prototypes, I can now generate a C header files with the prototypes of those native API calls that I saw in J [08:12:52] paravoid: aside from the 301 what is the worst regression? [08:13:01] paravoid: ^^ [08:13:05] paravoid: pasted from a convo with bblack [08:13:53] Aaron|home: that's it for now [08:14:11] Aaron|home: I had #5651 that we debugged extensively yesterday and it got fixed [08:14:30] Aaron|home: then I had #5674 which I worked around and now I have the 301 [08:14:34] and after that, who knows :) [08:15:08] would this second cluster just be originals? [08:15:25] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74590 [08:15:39] no formulated plan yet [08:15:54] but I guess we'd start from originals, yes [08:16:26] PROBLEM - HTTP radosgw on ms-fe1004 is CRITICAL: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 758 bytes in 0.004 second response time [08:16:36] PROBLEM - HTTP radosgw on ms-fe1001 is CRITICAL: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 758 bytes in 0.008 second response time [08:16:39] what [08:16:45] ah, yes [08:16:58] average: okay, so the target needs a dependency on dClass.class [08:17:41] paravoid: yes, that is correct. I totally missed that, thank you [08:17:46] I'll fix it now [08:17:52] fix the tabs while at ti :) [08:17:53] it [08:20:27] but that's minor, ok [08:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 08:24:52 UTC 2013 [08:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [08:27:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 08:27:41 UTC 2013 [08:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [08:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 08:28:49 UTC 2013 [08:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [08:30:32] mark: can we kill these servers? these nagios flaps annoy me :) [08:30:51] i'm still using them for testing though [08:31:00] hmm [08:31:05] oh well [08:31:11] i'll ask chris/rob to decommission them properly later [08:32:04] paravoid: new patch [08:32:52] paravoid: splitted the dclass_java.h in two, namely: dClass.class and dclass_java.h targets. Added dClass.class target as dep to dclass_java.h target [08:32:56] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 08:32:46 UTC 2013 [08:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [08:36:58] I'm gonna get some groceries because I want to get a big timechunk to hit the keyboard hard afterwards [08:37:08] bb in 30-40m [08:38:19] paravoid: I'll take into account any of you feedback and incorporate it into review #73860 [08:38:34] *your [08:39:21] average: dClass.class is still missing a dep :) [08:39:49] paravoid: dClass.java right ? [08:39:53] you tell me [08:39:57] I guess so [08:40:01] but you know better [08:41:46] but looks good otherwise [08:41:49] new patch, did what I mentioned above [08:41:52] I haven't tested it but I guess you did [08:42:12] paravoid: I have yes [08:42:13] rm -rf dclass.jar java/jni/dclass_java.h libdclass*la *.la *.lo .libs/ .deps/ missing ; autoreconf -i ; ./configure ; make [08:42:24] make distclean? [08:42:31] trying make distclean [08:42:36] New patchset: Reedy; "Properly puppeti[sz]e purge-checkuser" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74591 [08:43:20] puppet[sz]e, hahaha [08:43:31] https://bugzilla.wikimedia.org/show_bug.cgi?id=51663 [08:43:34] oh, yeah, make distclean is cool too, however it doesn't clean everything, some bits still hang around, but they're overwritten anyway by new builds [08:43:46] "Emails rejected by the server of @wikimedia.org" [08:43:46] then I do ls .libs/ ; jar -tvf dclass.jar [08:43:50] I do that in order to check what I get.. [08:44:02] you realize it's a bug if it doesn't clean everything? :) [08:44:19] I wasn't aware [08:44:21] I'll fix it [08:45:04] Reedy: this address doesn't exist, is it supposed to? [08:45:16] I think it should go to OTRS [08:45:18] no permission-* addresses either [08:45:37] there's permissionS [08:45:45] New patchset: Reedy; "Properly puppeti[sz]e purge-checkuser" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74591 [08:47:21] Thehelpfulone: About? [08:47:46] resolved, invalid [08:47:47] thanks. [08:48:13] heh [08:49:07] New patchset: Reedy; "Properly puppeti[sz]e purge-checkuser" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74591 [08:50:02] New patchset: Reedy; "Properly puppeti[sz]e purge-checkuser" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74591 [08:51:21] New patchset: Reedy; "Properly puppeti[sz]e purge-checkuser" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74591 [08:51:57] I should probably install something locally to do the validation [08:53:40] Reedy, apt-get install puppet && puppet parser validate [08:54:12] New review: Reedy; "(1 comment)" [operations/puppet] (production) C: -1; - https://gerrit.wikimedia.org/r/74591 [08:54:35] MaxSem: i'm looking at the mobile cache hit rate atm [08:54:48] yep?:) [08:54:55] I suspect the variation of the Cookie header and all crap in there is the problem ;) [08:54:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 08:54:46 UTC 2013 [08:55:10] so Varnish doesn't support X-Vary-Options [08:55:18] but it also seems that the XVO header sent by MF is not complete [08:55:24] so if Varnish did, it would break [08:55:31] * MaxSem looks [08:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [08:55:44] you currently have this: [08:55:46] if( req.http.Cookie ~ "disable" || [08:55:47] req.http.Cookie ~ "optin" || [08:55:47] req.http.Cookie ~ "session" || [08:55:47] req.http.Cookie ~ "forceHTTPS" ) { [08:55:48] /* Do nothing; these are the cookies we pass. [08:55:50] * this is a hack in the absence of X-V-O support [08:55:53] */ [08:55:55] } else { [08:55:57] unset req.http.Cookie; [08:55:59] } [08:56:04] but most of that is not reflected in XVO [08:56:24] paravoid: solved the remaining bits being cleaned through the addition of DISTCLEANFILES = $(CURDIR)/java/jni/dclass_java.h $(CURDIR)/dclass.jar [08:56:33] paravoid: make distclean now does what it's supposed to [08:56:41] ok [08:56:46] you might be interested in .INTERMEDIATE too [08:56:55] owchie Accept-Encoding;list-contains=gzip,X-WAP,Cookie [08:57:05] if it's an intermediate file that you don't intend to ship but just use internally to build something [08:57:09] but that would do [08:57:44] ok, I will look into .INTERMEDIATE too [08:57:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 08:57:44 UTC 2013 [08:58:24] but if you are happy with this, I look forward for it being merged ( have a subsequent one to push immediately afterwards with the debian/ part) [08:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [08:58:29] MaxSem: seems like the mobile hit rate of the varnish backends is around 30-40% atm [08:58:35] and on the frontend, a surprisingly good 75% [08:58:59] I built a Varnish instance which ignores PURGEs for hit rate calculation [08:59:02] it's only deployed on cp1046 atm [08:59:16] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 08:59:06 UTC 2013 [08:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [08:59:47] btw https://gerrit.wikimedia.org/r/#/c/73342/ [09:00:37] yeah nice [09:00:47] mark, so the plain is again to use XVO? [09:01:01] no, probably not, or at least not in its current form [09:01:13] but I was trying to use the XVO header to make compatible VCL that does sort of the same [09:01:27] because i need to know what cookies MF really needs to vary on and how [09:01:32] but it seems the XVO header is not complete, so I can't [09:01:44] ok [09:01:48] we're gonna have similar problems for text varnish [09:01:51] sec [09:02:37] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 09:02:34 UTC 2013 [09:02:47] so yeah [09:02:59] i'll be looking at this for both clusters [09:03:05] and we should probably implement similar strategies ;) [09:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [09:03:35] on the text cluster, we simply unset the cookie header, and build a new one for the few cookies that varnish needs to vary on [09:03:46] and restore the original cookie header when sending it to mediawiki [09:04:09] that's probably a good option for MF too [09:07:17] average: yeah sure, it doesn't matter at all [09:07:22] average: I'm just giving tips :) [09:09:58] reedy@ubuntu64-web-esxi:~/git/operations/puppet$ puppet parser validate [09:09:58] notice: No manifest specified. Validating the default manifest /home/reedy/.puppet/manifests/site.pp [09:10:24] Oh, nvm [09:12:08] New patchset: Reedy; "Make puppet cronjob to run SecurePoll/cli/purgePrivateVoteData.php" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74592 [09:12:25] New review: Reedy; "Need to confirm run frequency" [operations/puppet] (production) C: -1; - https://gerrit.wikimedia.org/r/74592 [09:13:20] New review: Faidon; "See inline, I think what you are trying to do won't work." [operations/puppet/cdh4] (master) C: -1; - https://gerrit.wikimedia.org/r/69805 [09:17:35] mark, apparently because Varnish doesn't support XVO we're not setting it at all - will fix that [09:17:45] please do :) [09:17:57] also, squid does [09:18:02] so if anyone uses MF behind squid, it'll break [09:18:15] and even if not, it's good to keep track of how cookies should be varied on [09:18:19] Why is modules/cdh4 showing like it's got a local submodule change? [09:18:24] -Subproject commit c728ce56df9ee764b3f8807ad04e533f89f3ed7d [09:18:24] +Subproject commit 02718e3af97b2b0cddf9f428d32e4904b0389576 [09:18:26] I don't think we support Squid [09:18:52] nor does anyone else besides WMF use MF with frontend caches [09:20:17] Have you checked everywhere? ;) [09:21:12] indeed [09:21:13] bbl [09:24:39] Reedy, I can count all the third-party MF users with my hand's fingers:P [09:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 09:24:49 UTC 2013 [09:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [09:25:52] New review: Faidon; "I do like it, although ::install still feels a bit too complex." [operations/puppet] (production) C: -1; - https://gerrit.wikimedia.org/r/74380 [09:27:56] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 09:27:46 UTC 2013 [09:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [09:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 09:28:48 UTC 2013 [09:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [09:39:10] Reedy: around? [09:39:13] Aye [09:39:44] ok, have settings change to do [09:39:50] tiny change [09:39:51] heh [09:39:57] I think there's a testwikidatawiki one too [09:40:01] we made the site link widget disappear [09:40:05] :D [09:40:37] New patchset: Reedy; "Enabling the 'property-create' right for all users on testwikidatawiki Bug: 51637" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74528 [09:40:52] yeah, that one would be nice [09:40:57] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74528 [09:41:15] New patchset: Reedy; "Revert "Remove bigdelete from sysops"" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74596 [09:41:31] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74596 [09:41:58] New patchset: Reedy; "(bug 51608) Configure Babel-related variables for uk.wikisource" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74381 [09:42:16] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74381 [09:42:21] New patchset: Reedy; "(bug 51605) add alias for project namespace on ckbwiki" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74567 [09:42:42] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74567 [09:43:01] New patchset: Reedy; "(bug 51537) require 10 edits for autoconfirmed on ckbwiki" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74568 [09:43:12] New patchset: Aude; "readd enableSiteLinkWidget setting for Wikibase" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74597 [09:43:20] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74568 [09:43:46] New patchset: Reedy; "(bug 51328 and bug 51232) add autopatrolled and uploader groups to ckbwiki" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74308 [09:44:08] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74308 [09:44:32] New review: Addshore; "+1" [operations/mediawiki-config] (master) C: 1; - https://gerrit.wikimedia.org/r/74597 [09:44:55] New patchset: Reedy; "readd enableSiteLinkWidget setting for Wikibase" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74597 [09:45:04] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74597 [09:46:02] !log reedy synchronized wmf-config/ [09:46:12] Logged the message, Master [09:46:48] thanks reedy [09:46:52] that fixed it [09:48:05] Making Gerrit do cherry picks is great [09:48:42] * aude not done that ye [09:48:43] t [09:49:00] does it actually work? [09:49:10] Looks to be fine https://gerrit.wikimedia.org/r/#/c/74598/ [09:49:26] You can almost do it without touching your keyboard [09:49:27] wow, nice [09:49:46] You just need to press 1 key to get a list of branches [09:54:46] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 09:54:40 UTC 2013 [09:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [09:57:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 09:57:37 UTC 2013 [09:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [09:58:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 09:58:49 UTC 2013 [09:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [10:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 10:02:40 UTC 2013 [10:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [10:04:46] RECOVERY - Solr on vanadium is OK: All OK [10:10:12] hello [10:11:02] !log reedy synchronized php-1.22wmf11/includes/Setup.php [10:11:11] Logged the message, Master [10:11:42] New patchset: Hashar; "solr: __version__ magic field to GeoData schema" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74521 [10:19:09] !log reedy synchronized wmf-config/InitialiseSettings.php 'Disable ShortUrl on testwiki' [10:19:20] Logged the message, Master [10:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 10:24:46 UTC 2013 [10:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [10:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 10:28:53 UTC 2013 [10:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [10:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 10:32:36 UTC 2013 [10:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [10:39:39] New patchset: Reedy; "Disable ShortUrl on testwiki" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74601 [10:40:09] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74601 [10:54:46] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 10:54:45 UTC 2013 [10:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [10:58:16] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 10:58:08 UTC 2013 [10:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [10:58:46] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 10:58:45 UTC 2013 [10:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [11:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 11:02:39 UTC 2013 [11:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [11:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 11:24:49 UTC 2013 [11:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [11:27:56] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 11:27:46 UTC 2013 [11:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [11:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 11:28:52 UTC 2013 [11:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [11:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 11:32:43 UTC 2013 [11:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [11:35:13] New patchset: Addshore; "Revert "Enabling the 'property-create' right for all users on testwikidatawiki Bug: 51637"" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74609 [11:54:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 11:54:45 UTC 2013 [11:55:44] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [11:57:15] New patchset: Vogone; "Enabling the 'property-create' right for all users on testwikidatawiki" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74609 [11:57:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 11:57:40 UTC 2013 [11:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [11:59:16] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 11:59:08 UTC 2013 [12:00:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [12:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 12:24:51 UTC 2013 [12:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [12:26:19] New patchset: Faidon; "(power)dns: support multiple listen addresses" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74615 [12:27:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 12:27:39 UTC 2013 [12:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [12:28:45] New patchset: Faidon; "(power)dns: support multiple listen addresses" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74615 [12:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 12:28:50 UTC 2013 [12:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [12:30:56] PROBLEM - Puppet freshness on manutius is CRITICAL: No successful Puppet run in the last 10 hours [12:32:56] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 12:32:46 UTC 2013 [12:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [12:39:41] New patchset: Manybubbles; "Add elasticsearch module and role." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74534 [12:40:17] New review: Manybubbles; "Patch set 2 just converts tabs to spaces. I'll get the next set of problems now." [operations/puppet] (production) C: -1; - https://gerrit.wikimedia.org/r/74534 [12:43:24] hey manybubbles :) [12:43:30] I'm around if you need anything [12:43:42] New patchset: Hashar; "lanthanum as a jenkins slave" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/64601 [12:44:03] New review: Hashar; "rebased and inserted admins::mortals" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/64601 [12:44:58] paravoid: cool. I'm not worried about puppet so much as our standards. I only implemented things the way I did because I was getting clues from the solr module. Probably the wrong place to learn from. [12:47:25] yep [12:51:10] manybubbles: hiii there :-] [12:51:22] yo [12:51:41] manybubbles: so setting up EL/Solr/whatever in labs should be pretty simple : create instance / apply manifest / tweak mw config , done :-] [12:51:57] though there are various steps in between such as writing code and debugging it hehe [12:52:06] but beta itself should not be a trouble hehe [12:52:13] sweet! [12:52:31] I'll have puppet code to setup the instances sometime today once it passes our standards [12:52:36] PROBLEM - Disk space on cp1058 is CRITICAL: DISK CRITICAL - free space: /srv/sda3 12433 MB (3% inode=99%): /srv/sdb3 17843 MB (5% inode=99%): [12:54:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 12:54:50 UTC 2013 [12:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [12:58:16] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 12:58:07 UTC 2013 [12:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [12:58:46] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 12:58:42 UTC 2013 [12:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [13:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 13:02:37 UTC 2013 [13:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [13:05:56] New review: Aude; "this is not the issue." [operations/mediawiki-config] (master) C: -1; - https://gerrit.wikimedia.org/r/74609 [13:24:46] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 13:24:44 UTC 2013 [13:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [13:27:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 13:27:37 UTC 2013 [13:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [13:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 13:28:49 UTC 2013 [13:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [13:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 13:32:42 UTC 2013 [13:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [13:33:27] https://gerrit.wikimedia.org/r/#/c/74158/ could use some love. Just sayin'. :-) [13:49:42] New patchset: Manybubbles; "Add elasticsearch module and role." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74534 [13:50:19] New patchset: Aude; "Move property-create for * to after loading of Wikibase" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74620 [13:50:58] New review: Manybubbles; "Publishing some comments on an old revision. There are two things Faidon wanted me to do that I'm n..." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74534 [13:52:48] New review: Manybubbles; "Patch set 3 does most the things that Faidon asked for except the two the I'm not sure are good idea..." [operations/puppet] (production) C: -1; - https://gerrit.wikimedia.org/r/74534 [13:52:49] New patchset: Aude; "Move property-create for * to after loading of Wikibase" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74620 [13:53:40] New patchset: Aude; "Move property-create for * to after loading of Wikibase" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74620 [13:55:06] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 13:55:01 UTC 2013 [13:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [13:58:16] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 13:58:08 UTC 2013 [13:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [13:58:46] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 13:58:44 UTC 2013 [13:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [14:01:36] New review: Ottomata; "Ayyye, but I'm trying to do a rewrite here, just moving what was already there into a different plac..." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74380 [14:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 14:02:35 UTC 2013 [14:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [14:14:14] manybubbles: no package ensure => absent in the decom class? [14:15:06] ottomata: with the new code this:https://gerrit.wikimedia.org/r/#/c/74534/3/modules/elasticsearch/manifests/init.pp becomes "include java; Class['java'] -> Class['elasticsearch']", right? [14:15:10] oh wait, it needs the jdk? [14:16:07] paravoid: sure. is it normal to remove all the dependencies? [14:16:29] no, I think just removing the package is fine [14:17:14] paravoid: the jdk has all the debugging tools for java - stuff like the command to dump heap memory from a frozen vm [14:17:48] oh it's fine, I'm just wondering how that would work with the new java stuff :) [14:18:17] New review: Faidon; "Good enough reason then. But we need to follow up and clean this up :)" [operations/puppet] (production) C: 1; - https://gerrit.wikimedia.org/r/74380 [14:18:51] paravoid: the new java stuff doesn't ship the jdk? [14:19:06] not with "include java" I think [14:19:30] * paravoid looks [14:19:59] ah, yes it does [14:20:35] ottomata: wanna fix ori-l's comment and push it? [14:20:39] let me push the latest and you can have a look. I imagine I should wait until after that java stuff merges and do another revision with the new stuff then. [14:20:46] or maybe it won't be long... [14:20:49] yeah, it's pretty close [14:21:39] I think we need to be explicit though [14:21:48] so with the new java stuff it'd be [14:22:29] java::install { ensure => 'installed', distribution => 'openjdk', jdk => true, version => '7' } [14:22:41] er [14:23:07] grr, it's so broken [14:23:20] because it's a define [14:23:23] wtf is it a define [14:24:08] ideally we'd just have include java::openjdk or something... [14:24:39] now it's a define, which means that someone else can set up a differently named java::install that does the same [14:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 14:24:47 UTC 2013 [14:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [14:27:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 14:27:41 UTC 2013 [14:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [14:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 14:28:53 UTC 2013 [14:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [14:29:35] New review: Faidon; "So, I'm thinking over again and I think the whole thing here is pretty broken." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74380 [14:31:11] paravoid: https://gerrit.wikimedia.org/r/#/c/74158/ could use some love. Just sayin'. :-) [14:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 14:32:36 UTC 2013 [14:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [14:34:23] New review: Faidon; "So, I'm OK with you pushing this minimal change and not getting blocked by a rework as described abo..." [operations/puppet] (production) C: -1; - https://gerrit.wikimedia.org/r/74380 [14:34:30] New patchset: Manybubbles; "Add elasticsearch module and role." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74534 [14:37:10] New review: Manybubbles; "Last commit sets the node.name of each instance to its hostname and remove the elasticsearch package..." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74534 [14:37:28] New review: Faidon; "LGTM. Maybe the ops folks who help with search want to have a look too though." [operations/puppet] (production) C: 1; - https://gerrit.wikimedia.org/r/74534 [14:39:24] !log continuous integration : slightly adjusted main page https://integration.wikimedia.org/ (see integration/docroot.git ) [14:39:35] Logged the message, Master [14:41:15] manybubbles: so are we setting up ES in production or is the role class ::production just the reference of how we'd use it in production? [14:50:18] !log Upgraded mobile caches to varnish 3.0.3~rc1-wm14 [14:50:28] Logged the message, Master [14:53:30] New patchset: Faidon; "(power)dns: support multiple listen addresses" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74615 [14:54:46] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 14:54:45 UTC 2013 [14:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [14:56:25] paravoid: consider it a blind guess. If we'd prefer to not have it I can just drop it and we'll add it when we set it up. [14:57:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 14:57:38 UTC 2013 [14:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [14:58:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 14:58:50 UTC 2013 [14:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [15:02:36] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 15:02:35 UTC 2013 [15:03:04] Change merged: Andrew Bogott; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74385 [15:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [15:07:06] looks like git.wm.org is down again, can someone take a look? [15:07:26] (last time Chad had to restart gitblit) [15:11:04] no init or upstart script [15:11:05] sigh [15:24:46] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 15:24:44 UTC 2013 [15:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [15:27:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 15:27:42 UTC 2013 [15:28:22] New review: Ottomata; "I thought about creating classes for all of the different options and keeping java::install around j..." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74380 [15:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [15:29:06] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 15:29:04 UTC 2013 [15:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [15:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 15:32:41 UTC 2013 [15:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [15:33:56] PROBLEM - Puppet freshness on fenari is CRITICAL: No successful Puppet run in the last 10 hours [15:45:56] PROBLEM - Puppet freshness on bast1001 is CRITICAL: No successful Puppet run in the last 10 hours [15:58:16] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 15:58:07 UTC 2013 [15:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [15:58:46] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 15:58:42 UTC 2013 [15:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [16:00:32] Change merged: Andrew Bogott; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/73888 [16:03:57] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 16:03:11 UTC 2013 [16:04:32] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [16:09:55] New patchset: Ottomata; "Slight restructure for java module." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74380 [16:15:53] manybubbles: since it seems like we're going down the elastic search route, can I close out https://rt.wikimedia.org/Ticket/Display.html?id=5323 [16:17:00] manybubbles: since it seems like we're going down the elastic search route, can I close out https://rt.wikimedia.org/Ticket/Display.html?id=5323 [16:18:47] notpeter: I think so. Do you happen to see what my login to rt is? I can't remember but I know it isn't manybubbles or nikeverett [16:19:00] or neverett [16:19:20] it should be your email address? [16:20:25] New patchset: Ottomata; "Puppetizing hue" [operations/puppet/cdh4] (master) - https://gerrit.wikimedia.org/r/69805 [16:24:50] Reedy: about? [16:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 16:24:48 UTC 2013 [16:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [16:27:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 16:27:38 UTC 2013 [16:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [16:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 16:28:50 UTC 2013 [16:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [16:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 16:32:39 UTC 2013 [16:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [16:33:56] PROBLEM - Puppet freshness on neon is CRITICAL: No successful Puppet run in the last 10 hours [16:50:59] New review: CSteipp; "I have no idea on the puppet syntax, but the feature is very much desired." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74592 [16:55:06] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 16:55:00 UTC 2013 [16:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [16:56:40] New review: Ottomata; "Actually, hm. I'm not up for a bunch of changes inside of java::install right now. I'd be happy to..." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74380 [16:57:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 16:57:43 UTC 2013 [16:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [16:59:16] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 16:59:09 UTC 2013 [16:59:56] PROBLEM - Puppet freshness on analytics1019 is CRITICAL: No successful Puppet run in the last 10 hours [17:00:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [17:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 17:02:37 UTC 2013 [17:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [17:03:56] PROBLEM - Puppet freshness on analytics1018 is CRITICAL: No successful Puppet run in the last 10 hours [17:04:56] PROBLEM - Puppet freshness on analytics1020 is CRITICAL: No successful Puppet run in the last 10 hours [17:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 17:24:49 UTC 2013 [17:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [17:27:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 17:27:40 UTC 2013 [17:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [17:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 17:28:46 UTC 2013 [17:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [17:30:24] New patchset: Manybubbles; "Add elasticsearch module and role." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74534 [17:30:43] stupid gerrit I just changed the desription, didn't make a new patchset [17:31:01] oh wow! that did make a new patch set..... [17:31:43] New patchset: Manybubbles; "Add elasticsearch module and role." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74534 [17:31:55] there, if it has to be a commit message now it wraps at 80 columns [17:32:56] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 17:32:49 UTC 2013 [17:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [17:34:24] ^demon: I've been wondering about moving some of what we do in CirrusSearch to the job queue - it'd give us more resiliency and would give a nice path for reindexing documents when templates that they include change. [17:34:35] does that sound sane? [17:35:20] about resiliency, i'm kinda new here, but i was informed that just because a job is in the queue, its not guaranteed to run. Things occasionally(i dunno if thats 1 in a million?) get lost [17:37:02] <^demon> manybubbles: My worry with the jobqueue is that you can't assume it'll run in $someTimeFrame. For something like reindexing, I dunno if that'd be acceptable. [17:38:03] ^demon: huh.... I'm not sure we have a choice with things like transcluded pages - we just can't afford to reindex them all in process. [17:38:58] <^demon> Well with the transcluded ones, we could possibly do it. [17:39:15] <^demon> But if I edit a page, I think doing it in-process like we do now makes more sense. [17:39:56] PROBLEM - Puppet freshness on erzurumi is CRITICAL: No successful Puppet run in the last 10 hours [17:39:56] PROBLEM - Puppet freshness on lvs1004 is CRITICAL: No successful Puppet run in the last 10 hours [17:39:56] PROBLEM - Puppet freshness on lvs1006 is CRITICAL: No successful Puppet run in the last 10 hours [17:39:56] PROBLEM - Puppet freshness on lvs1005 is CRITICAL: No successful Puppet run in the last 10 hours [17:39:56] PROBLEM - Puppet freshness on virt1 is CRITICAL: No successful Puppet run in the last 10 hours [17:39:56] PROBLEM - Puppet freshness on virt3 is CRITICAL: No successful Puppet run in the last 10 hours [17:39:56] PROBLEM - Puppet freshness on virt4 is CRITICAL: No successful Puppet run in the last 10 hours [17:40:53] furthermore, with job queue jobs can be sometimes lost:) [17:41:16] manybubbles & ^demon, so elastic is a final choice? [17:41:50] MaxSem: do we track that at all? I'd be curious to learn more about lost jobs [17:42:48] MaxSem: I'm 90% sure it'll be elastic. Right now we're talking to some of the parties that were more in the solr camp and seeing what they think we lose with it. After playing with it for a week I see it as nicer. [17:43:55] ebernhardson, previously when the queue was in DB, there was a pattern of wikis feeling bad, someone noticing an overgrown queue and truncating it. while this particular scenario is unlikely now, I personally still don't consider JQ 100% reliable:) [17:44:56] PROBLEM - Puppet freshness on ms-fe1002 is CRITICAL: No successful Puppet run in the last 10 hours [17:46:19] MaxSem: good to know, thanks. I suppose in general there is no way to actually guarantee reliability from php(see for example, kraken integration) [17:47:13] MaxSem: even if we dont truncate things :) [17:49:24] Ryan_Lane, since you'll be on duty next week, will you be available for a Varnish flush on Wednesday 11 AM PST? [17:49:38] yes [17:49:46] cool:) [17:50:56] PROBLEM - Puppet freshness on ms-fe1003 is CRITICAL: No successful Puppet run in the last 10 hours [17:53:51] ^demon: cool.... Would you mind if I have a look at transcluded pages via the job queue then? [17:54:17] <^demon> Yeah sure :) [17:54:46] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 17:54:45 UTC 2013 [17:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [17:55:56] PROBLEM - Puppet freshness on ms-fe1004 is CRITICAL: No successful Puppet run in the last 10 hours [17:58:16] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 17:58:08 UTC 2013 [17:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [17:58:46] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 17:58:43 UTC 2013 [17:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [17:59:28] New patchset: Lcarr; "Revert "virt1007 is new aggregator"" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74654 [18:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 18:02:40 UTC 2013 [18:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [18:05:06] RECOVERY - Puppet freshness on fenari is OK: puppet ran at Fri Jul 19 18:04:59 UTC 2013 [18:07:05] manybubbles, ^demon: do you have any plans (or documents!) on how would you structure ES? [18:07:29] i.e. how many shards, how many indexes, how many wikis per each, if we're going to reuse database shard groupings [18:08:04] how will you structure your document json [18:08:04] paravoid: we're not really sure how we want to do it - I don't think we'll get it right unless we experiment with it some and do more research [18:08:09] I realize this might be way too early [18:08:11] ah, heh :) [18:08:13] right [18:08:14] okay :) [18:08:18] I was just wondering [18:08:24] paravoid: the document json we have something - but it could change [18:08:25] (no reason in particular) [18:08:46] but yeah, I'm pertty sure we'll want something along the lines of the database layout [18:09:10] nod [18:10:16] Change merged: Lcarr; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74654 [18:10:44] also, another thing that might be worth it for your RFC [18:11:05] (and I think I've heard this over IRC, I'm just saying it for the rest of your readers) [18:11:19] is the use cases you're trying to cover with this work [18:11:33] that is, not just an lsearchd replacement but also geodata etc. [18:11:56] PROBLEM - Puppet freshness on ms-fe1001 is CRITICAL: No successful Puppet run in the last 10 hours [18:12:37] New patchset: Demon; "Kill current gerrit-wm since gerrit-wm's replacement is ready" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74656 [18:12:56] RECOVERY - Puppet freshness on bast1001 is OK: puppet ran at Fri Jul 19 18:12:48 UTC 2013 [18:13:32] paravoid: I think it is a phased thing - but ultimately I think geodata is on the radar as it were. ^demon will have to correct me if I'm wrong though. [18:14:24] <^demon> Yeah that's right. [18:16:12] New review: Faidon; "LGTM" [operations/puppet/cdh4] (master) C: 2; - https://gerrit.wikimedia.org/r/69805 [18:25:26] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 18:25:18 UTC 2013 [18:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [18:27:41] paravoid: ok, this si the debian/* patchset https://gerrit.wikimedia.org/r/#/c/74651/ [18:27:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 18:27:42 UTC 2013 [18:27:51] <^demon> Ryan_Lane: We're going to move to the new gerrit-wm today. Mind looking at https://gerrit.wikimedia.org/r/#/c/74656/ and its parent? [18:27:57] paravoid: there are few additions in comparison to https://gerrit.wikimedia.org/r/#/c/68711/ [18:28:03] paravoid: the only addition is the libdclass-java [18:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [18:28:27] paravoid: we should drop #68711 as we decided we'll do that [18:28:27] ^demon: you're deleting so much of my code :( [18:28:30] heh [18:28:45] s#:(#\o/# [18:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 18:28:52 UTC 2013 [18:29:01] <^demon> Ryan_Lane: Considering how much we hate that bot's code, I assumed you'd be like "OMG I OWE YOU ONE" [18:29:13] paravoid: please review #74651 [18:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [18:30:37] ^demon: I should move lolrrit-wm's code to gerrit, I think. [18:30:39] it's on my github now [18:30:52] Ryan_Lane: ^demon doesn't want to call the bot lolrrit-wm, which is sad. [18:31:00] hahaha [18:31:05] github? :( [18:31:16] <^demon> YuviPanda loves him some github. [18:31:23] Ryan_Lane: yeah, I'd move it to gerrit soon. [18:31:28] put it in gerrit and have your bot ... [18:31:28] hah [18:31:32] Ryan_Lane: but I'll still use GitHub to do pull requests [18:31:32] nevermind then [18:31:38] that's fine [18:31:41] you have a bot for this [18:31:42] Ryan_Lane: I do that all the time :P My patch to core yesterday was via that too, for example [18:31:58] I <3 that bot, btw [18:32:00] Ryan_Lane: I need to enable it for operations/puppet so my toollabs puppet patches can also go that way. [18:32:10] Ryan_Lane: oh? YOu don't like GitHub do you? [18:32:44] just because I don't like it doesn't mean everyone should be forced to not use it [18:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 18:32:43 UTC 2013 [18:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [18:33:10] awww :) [18:33:29] give github some love, it will love you back [18:33:30] :D [18:34:14] <^demon> I don't want github's love. Gerrit is a jealous mistress and wouldn't be happy with me ;-) [18:34:14] it does, it does :) [18:34:24] ^demon: StockHolm syndrome :) [18:35:35] Change merged: Ottomata; [operations/puppet/cdh4] (master) - https://gerrit.wikimedia.org/r/69805 [18:41:04] Ryan_Lane, I'm getting a git-deploy error: http://pastebin.com/r5mYkbJU [18:41:48] Ryan_Lane: "This repo isn't configured. Have you added it in puppet? Exiting." That looks very scary [18:42:21] continuing for now [18:42:36] it's not going to work if you continue [18:42:44] let me check something [18:42:49] as in not doing anything? [18:43:18] did someone merge my change!? [18:43:18] one sec [18:43:28] Parsoid is still up, so no hurry [18:43:29] nope. wasn't that.... [18:43:39] I did a refactor recently [18:43:45] but it's not merged yet [18:44:28] oh [18:44:31] actually, it'll continue to work [18:44:39] reporting on the client is just broken [18:44:58] wait.. the pillar data is missing [18:45:01] Ryan_Lane: it finished both the sync and restart very quickly [18:45:15] I don't think it actually did anything on the clients [18:45:19] it didn't [18:45:26] k [18:45:33] was tin rebooted? [18:45:41] nope [18:46:07] ahhhhh [18:46:09] damn it [18:46:22] found it [18:46:34] it was a refactor from last week [18:48:29] I'll fix. one sec [18:48:40] temporary workaround for now... [18:51:06] what happens to all git patchsets/reviews ? [18:51:15] they all stack up on a server somewhere [18:51:21] right ? [18:52:54] gwicke: fixed [18:53:00] try now [18:54:09] ok, forcing it.. [18:54:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 18:54:46 UTC 2013 [18:55:03] looks promising [18:55:15] heh [18:55:18] !log updated Parsoid to d5fe6c9 [18:55:28] I had accidentally wiped out all the pillar data on all the minions [18:55:28] Logged the message, Master [18:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [18:55:48] Ryan_Lane: seems to have worked [18:55:50] because I changed how I was targeting minions to use grains, but I forgot the matching statement for grains [18:55:52] great [18:55:56] off to lunch I go [18:55:56] hey - my instance has locked up [18:56:37] parsoid is up, load on the parsoid cluster is about to drop to zeroish again after all those hanging workers were killed [18:56:48] https://ganglia.wikimedia.org/latest/graph_all_periods.php?c=Parsoid%20eqiad&m=cpu_report&r=hour&s=by%20name&hc=4&mc=2&st=1374260135&g=cpu_report&z=large [18:57:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 18:57:43 UTC 2013 [18:58:15] and now all is well [18:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [18:58:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 18:58:54 UTC 2013 [18:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [19:03:16] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 19:03:13 UTC 2013 [19:04:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [19:19:45] New patchset: Hashar; "beta: creates loginwiki project" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74670 [19:20:32] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74670 [19:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 19:24:52 UTC 2013 [19:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [19:27:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 19:27:40 UTC 2013 [19:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [19:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 19:28:51 UTC 2013 [19:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [19:31:02] greg-g: I'm updating Parsoid in wmf10 and 11 to what has been deployed before [19:31:44] had not updated the not-deployed-yet branches, which caused them to switch back to old code now [19:34:50] !log gwicke synchronized php-1.22wmf10/extensions/Parsoid/php/Parsoid.php 'restore Parsoid to previously deployed newer code' [19:35:00] Logged the message, Master [19:35:25] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 19:35:18 UTC 2013 [19:35:45] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [19:35:48] !log gwicke synchronized php-1.22wmf10/extensions/Parsoid/php/ParsoidCacheUpdateJob.php 'restore Parsoid to previously deployed newer code' [19:36:00] Logged the message, Master [19:36:44] !log gwicke synchronized php-1.22wmf11/extensions/Parsoid/php/Parsoid.php 'restore Parsoid to previously deployed newer code' [19:36:54] Logged the message, Master [19:37:23] !log gwicke synchronized php-1.22wmf11/extensions/Parsoid/php/ParsoidCacheUpdateJob.php 'restore Parsoid to previously deployed newer code' [19:37:34] Logged the message, Master [19:38:58] New review: Yuvipanda; "The replacement is ready to deploy now :)" [operations/puppet] (production) C: 1; - https://gerrit.wikimedia.org/r/74656 [19:43:17] !log shutting down spence for decommissioning [19:43:26] Logged the message, Master [19:46:42] * gwicke is very surprised to find .ppt files in mediawiki-config [19:47:52] * MaxSem puts a couple of .exes on gwicke's disk [19:48:35] * gwicke throws in some .bat files for good measure [19:52:20] commit 9df62516a0c2ccbf894e0972a294c8a92888512d [19:52:21] Author: Timo Tijhof [19:52:22] Date: Wed Jul 17 16:23:20 2013 +0000 [19:52:23] Untracked deployed files: docroot/foundation/presentations/**/*.ppt [19:52:30] haha [19:52:56] I suppose someone will revertwar that :) [19:52:57] too late, it is already in the history [19:53:09] so everybody will have to download a few megs of ppts [19:53:28] force push! rewrite history! :D [19:54:55] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 19:54:45 UTC 2013 [19:54:56] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [19:55:20] hmm, there are merged but undeployed changes to CommonSettings.php [19:55:36] commit b7f242bd1b8eed454e8592a58edc9f3b135fb124 [19:55:38] Author: aude [19:55:39] Date: Fri Jul 19 09:42:32 2013 +0000 [19:55:41] readd enableSiteLinkWidget setting for Wikibase [19:57:45] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 19:57:38 UTC 2013 [19:57:46] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [19:58:55] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 19:58:45 UTC 2013 [19:59:45] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [20:02:45] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 20:02:39 UTC 2013 [20:02:45] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [20:14:01] is there an easy way to check which version of CommonSettings.php is currently deployed? [20:14:56] RoanKattouw: ^^ [20:14:57] gwicke: It's in the mediawik-config repo [20:15:16] master of CommonSettings.php should be current, always [20:15:20] right, but when somebody did not sync after merge that is not helpful [20:15:23] If it's not, someone's been naughty [20:15:25] I think that is the case currently [20:15:26] Yes, exactly [20:15:28] OK [20:15:39] Well, so let's see [20:16:17] git pull also pulled in aude's commit [20:16:26] Great [20:16:59] and some more that did not touch CommonSettings actually [20:17:31] Oh OK [20:17:37] It looks like the only CommonSettings change is yours [20:18:00] no, I got a two-line diff, so aude's b7f242bd1b8 is likely not yet deployed [20:18:10] could you check if that is the case? [20:18:16] Checking [20:18:30] No it is [20:18:59] catrope@fenari:~$ scp mw1011:/apache/common/wmf-config/CommonSettings.php . [20:19:04] catrope@fenari:~$ scp tin:/a/common/wmf-config/CommonSettings.php CommonSettings-new.php [20:19:16] catrope@fenari:~$ diff -u CommonSettings.php CommonSettings-new.php [20:19:16] Which only shows a 1-line Parsoid-related change [20:19:30] ok [20:19:32] I'll sync then [20:19:36] thanks! [20:20:16] !log gwicke synchronized wmf-config/CommonSettings.php 'Slightly increase Parsoid dequeue rate' [20:20:25] Logged the message, Master [20:21:52] RoanKattouw: I upped the number of titles per job from 8 to 10 [20:22:18] if there are issues with template updates being processed too quickly, you can always reduce that number again [20:22:29] b7f242bd1b8 is deployed? [20:22:43] i have another settings patch [20:22:44] aude: Best I can tell it is [20:22:47] OK [20:22:56] yeah, it was fixed this morning [20:23:12] https://gerrit.wikimedia.org/r/#/c/74620/ is the other one, but not critical [20:23:19] If people would, you know, actually deploy things immediately after merging them like they're supposed to, we wouldn't be asking this question :( [20:23:26] i think reedy did [20:23:41] i'm sure he did [20:25:02] He probably did [20:25:03] just re-checked, and indeed your patch was deployed [20:25:05] He also did things after that [20:25:10] ok [20:25:13] but Antoine's patch was not [20:25:29] checked out revision before my pull was 80726d4 [20:25:59] commit d1b1e21948026a455fbdc742c6df415757c38f4e [20:26:00] Author: Antoine Musso [20:26:02] Date: Fri Jul 19 21:19:11 2013 +0200 [20:26:03] beta: creates loginwiki project [20:26:05] oh no [20:26:08] that is not deployed yet [20:26:16] oh my god sorry [20:26:20] forgot to rebase on tin [20:26:46] that adds entry under all-labs.dblist and wikiversions-labs.dat [20:26:50] (for beta) [20:26:53] yup [20:26:55] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 20:26:49 UTC 2013 [20:26:57] New patchset: Demon; "Kill current gerrit-wm since gerrit-wm's replacement is ready" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74656 [20:27:08] gwicke: could you rebase it ? [20:27:16] hashar: should that be deployed now to be consistent again? [20:27:31] yeah I can do it if you want [20:27:34] hashar: it is merged [20:27:45] do you want to revert it? [20:27:55] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [20:28:06] that is only for beta [20:28:17] I simply forgot to apply the patch on tin.eqiad.wmnet [20:28:36] I don't know if deploying it would be safe [20:28:47] I do it constantly :) [20:29:05] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 20:28:56 UTC 2013 [20:29:07] that only impacts labs (hence the -labs prefix). [20:29:16] prod uses different files [20:29:28] let me sync the files :) [20:29:38] ok, thanks! [20:29:45] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [20:29:54] !log hashar synchronized all-labs.dblist [20:30:05] Logged the message, Master [20:30:20] hashar@tin:/a/common$ sync-file wikiversions-labs.dblist [20:30:20] Could not open input file: /a/common/wikiversions-labs.dblist [20:30:20] pfff [20:30:46] I am tired [20:30:58] gwicke: done. Sorry for the confusion :( [20:30:59] !log hashar synchronized wikiversions-labs.dat [20:31:08] Logged the message, Master [20:31:09] hashar: no worries [20:31:12] nothing broke [20:31:29] why are you still awake btw? [20:31:35] ;) [20:31:38] the mediawiki conf files refers to wikiversions.dat / all.dblist using a wrapper getRealmSpecificFilename( $file ); [20:31:45] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 20:31:40 UTC 2013 [20:31:51] which returns the proper file name based on $wmfRealm and $wmfSite [20:32:09] it is 10pm30 only but you are right I should head to bed [20:32:32] my time zone conversions are getting a bit rusty [20:32:45] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [20:32:45] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 20:32:42 UTC 2013 [20:33:42] 6 (europe to nyc) + 3 (nyc to sf) :) [20:33:45] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [20:33:56] that's how i remember [20:34:19] when everyone says "out for lunch" I know it is 9pm in Europe. [20:34:21] (noon SF) [20:34:27] heh [20:42:35] finally bed time. have a good weekend [20:43:47] Change merged: Ryan Lane; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74656 [20:55:15] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 20:55:06 UTC 2013 [20:55:55] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [20:57:55] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 20:57:50 UTC 2013 [20:58:45] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [20:58:55] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 20:58:45 UTC 2013 [20:59:45] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [21:02:45] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 21:02:35 UTC 2013 [21:02:45] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [21:03:29] gerrit-wm cannot apparently send to this channel, and I'm trying to investigate why. [21:07:10] ^demon: hmm, I can't send to channel even from irssi, nicked to gerrit-wm [21:08:36] testing again? [21:08:49] ^demon: this is very confusing now. it works if I'm *not* nicked to gerrit-wm [21:09:02] <^demon> Was gerrit-wm +v'd or something before? [21:09:03] <^demon> Or something else weird? [21:09:18] no idea, and even if it were it should stick across restarts, no? [21:09:31] ^demon: hmm, the other bots are +v [21:09:31] 'd [21:09:40] ^demon: can you +v the bot in this channel? [21:09:41] <^demon> One would think [21:09:48] <^demon> No, I can't, no ops :( [21:09:57] ^demon: but why does that mean I can speak nick'd to something else? [21:10:23] ^demon: I'm starting it up as lolrrit-wm, see if that works [21:10:52] (CR) Ottomata: "Faidon, I'd appreciate a review of this one too. I'm not so sure that /etc/hue is a good place to store the default self-signed hue certs. I tried to store these in /etc/ssl, but /etc/ssl/private is not readable by the hue user. Is there a better place?" [operations/puppet/cdh4] - https://gerrit.wikimedia.org/r/74686 [21:10:54] (PS1) Demon: Remove gitweb, we don't use it anymore [operations/puppet] - https://gerrit.wikimedia.org/r/74687 [21:10:55] (PS1) Demon: Allow spiders to index gerrit again [operations/puppet] - https://gerrit.wikimedia.org/r/74688 [21:11:02] ^demon: :) [21:11:12] <^demon> weird. [21:12:41] ^demon: I'm tempted to call it done and leave it as is [21:12:56] (PS1) Odder: (bug 51684) Modify two variables for Ukrainian Wikisource [operations/mediawiki-config] - https://gerrit.wikimedia.org/r/74691 [21:13:14] <^demon> We can figure it out later, not the most pressing problem. [21:13:24] ^demon: yeah. let me email wikitech-l [21:13:51] ^demon: this was a nice quick and fun project. Thanks for poking me about it :) Let me know if there's more that can be done with the gerrit -> redis infrastructure :) [21:15:55] <^demon> Thank you! You have made me and Ryan_Lane very very happy campers. [21:16:12] <^demon> One step closer to killing gerrit2 user ;-) [21:16:14] YuviPanda: indeed, this is an awesome change [21:16:19] I hated those hooks [21:16:42] Ryan_Lane: :) it also is easier to maintain now, I think. Living on toollabs, etc. [21:16:53] it is, yeah [21:17:04] YuviPanda: is the config living anywhere? [21:17:13] in case people want to push in changes? people did so with the ircecho config [21:17:18] Ryan_Lane: https://github.com/yuvipanda/lolrrit-wm/blob/master/config.yaml [21:17:26] Ryan_Lane: and ^demon is co-maintainer on toollabs [21:17:31] <^demon> We should move that to gerrit too. [21:17:31] cool [21:17:37] one less thing I need to maintain :) [21:17:39] <3 [21:18:17] Ryan_Lane: hehe :D [21:18:24] ^demon: want to create a gerrit repo for it? [21:18:37] <^demon> Yeah, where should we stash it? [21:19:03] ^demon: good question. I've no idea. [21:19:29] <^demon> Naming things is hard ;-) [21:19:50] <^demon> Oh, I suppose labs/tools/something would make sense. [21:20:06] ^demon: hmm, Coren already had a scheme in mind [21:20:07] I think? [21:20:14] /labs/tools/lolrrit [21:20:53] ^demon: stopping to prevent l10n bot spam [21:21:18] ^demon: nothing will be lost though, thanks to redis queing. [21:21:32] <^demon> labs/tools/lolrrit now created, empty so we can push the history from your github repo. [21:21:51] ^demon: sweet. [21:22:50] <^demon> Made you and me owners of it. Should be able to review stuff for it now and so forth. [21:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 21:24:48 UTC 2013 [21:25:55] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [21:27:55] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 21:27:46 UTC 2013 [21:28:45] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [21:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 21:28:53 UTC 2013 [21:29:45] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [21:30:05] PROBLEM - SSH on pdf3 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [21:30:55] RECOVERY - SSH on pdf3 is OK: SSH OK - OpenSSH_4.7p1 Debian-8ubuntu3 (protocol 2.0) [21:35:45] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 21:35:40 UTC 2013 [21:36:35] (PS4) Ottomata: Adding role::analytics::hue [operations/puppet] - https://gerrit.wikimedia.org/r/74388 [21:36:45] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [21:50:04] (PS1) Andrew Bogott: Updated a bunch of file headers. [operations/puppet] - https://gerrit.wikimedia.org/r/74794 [21:50:05] (PS1) Andrew Bogott: Replace generic::sysctl::advanced-routing [operations/puppet] - https://gerrit.wikimedia.org/r/74795 [21:54:46] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 21:54:44 UTC 2013 [21:54:55] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [21:56:10] (CR) Andrew Bogott: [C: 2] Updated a bunch of file headers. [operations/puppet] - https://gerrit.wikimedia.org/r/74794 [21:56:11] (Merged) Andrew Bogott: Updated a bunch of file headers. [operations/puppet] - https://gerrit.wikimedia.org/r/74794 [21:57:35] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 21:57:33 UTC 2013 [21:57:45] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [21:58:55] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 21:58:50 UTC 2013 [21:59:46] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [22:03:15] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 22:03:06 UTC 2013 [22:03:45] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [22:10:53] (PS1) Andrew Bogott: Replace generic::sysctl::advanced-routing-ipv6 [operations/puppet] - https://gerrit.wikimedia.org/r/74798 (owner: Andrew Bogott) [22:18:40] (PS1) Andrew Bogott: Replace generic::sysctl::ipv6-disable-ra [operations/puppet] - https://gerrit.wikimedia.org/r/74799 (owner: Andrew Bogott) [22:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 22:24:53 UTC 2013 [22:24:57] (PS1) Andrew Bogott: Replace generic::sysctl::lvs [operations/puppet] - https://gerrit.wikimedia.org/r/74800 (owner: Andrew Bogott) [22:25:59] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [22:27:55] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 22:27:46 UTC 2013 [22:28:45] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [22:29:05] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 22:28:57 UTC 2013 [22:29:45] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [22:31:55] PROBLEM - Puppet freshness on manutius is CRITICAL: No successful Puppet run in the last 10 hours [22:32:34] (PS1) Andrew Bogott: Replace generic::sysctl::high-bandwidth-rsync [operations/puppet] - https://gerrit.wikimedia.org/r/74802 (owner: Andrew Bogott) [22:34:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 22:34:44 UTC 2013 [22:35:45] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [22:38:09] gerrit-wm: can you talk? [22:40:09] clearly not [22:49:04] (PS1) Andrew Bogott: Use the sysctlfile module in base::sysctl. [operations/puppet] - https://gerrit.wikimedia.org/r/74804 (owner: Andrew Bogott) [22:51:40] (PS1) Ori.livneh: Clean-up of 'eventlogging::monitor' class. [operations/puppet] - https://gerrit.wikimedia.org/r/74806 (owner: Ori.livneh) [22:53:16] andrewbogott: can this ^ ride shotgun with your patch? [22:54:55] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 22:54:46 UTC 2013 [22:54:55] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [22:54:56] ori-l, I don't feel up to/qualified to review that just now, but I don't mind if it conflicts… I'll have to rebase/resolve several of those before they can go in since I didn't commit them as a series. [22:54:57] it's the last in a series of clean-up commits, i'll finally be able to sleep at night [22:55:34] well, probably not, but still. [22:55:45] Ah, sorry to leave you in suspense :) I'm just about to punch out for the night. [22:55:56] * YuviPanda looks for his time card to punch [22:56:04] no problem. anyone else feeling puppet-y? [22:58:05] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 22:57:59 UTC 2013 [22:58:45] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [22:58:45] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 22:58:44 UTC 2013 [22:59:45] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [23:01:42] (PS1) Andrew Bogott: Use the sysctlfile module in udp2log::sysctl. [operations/puppet] - https://gerrit.wikimedia.org/r/74808 (owner: Andrew Bogott) [23:02:45] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 23:02:43 UTC 2013 [23:03:35] LeslieCarr, at this point on a Friday I expect you have no more inclination to read puppet code than I have, but… I'm about to add you as a reviewer to half a dozen snack-sized patches that may be of interest. Rearranging sysctl stuff. (Should all be no-ops, except for the typos.) [23:03:45] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [23:04:13] ok [23:07:30] andrewbogott: relocating to hammock to review [23:07:36] the official reviewing hammock [23:08:12] * andrewbogott is in a chaise longue [23:10:45] * YuviPanda does the matrix thing with closed mouth to gerrit-wm [23:17:07] (CR) Lcarr: [C: 2] Replace generic::sysctl::advanced-routing [operations/puppet] - https://gerrit.wikimedia.org/r/74795 (owner: Andrew Bogott) [23:17:08] (Merged) Lcarr: Replace generic::sysctl::advanced-routing [operations/puppet] - https://gerrit.wikimedia.org/r/74795 (owner: Andrew Bogott) [23:17:15] `hehehe lolrit [23:18:55] (CR) Lcarr: [C: 2] Replace generic::sysctl::advanced-routing-ipv6 [operations/puppet] - https://gerrit.wikimedia.org/r/74798 (owner: Andrew Bogott) [23:19:03] lolrrit is in ur code merging ur patchez [23:19:59] andrewbogott: grrr gerrit is now being like "must rebase patches" and won't do it via the button [23:20:04] (CR) Lcarr: [C: 2] Replace generic::sysctl::ipv6-disable-ra [operations/puppet] - https://gerrit.wikimedia.org/r/74799 (owner: Andrew Bogott) [23:20:16] LeslieCarr, some of those patches touch lines next to each other, so probably there are conflicts. [23:20:20] s'alright, I'll resolve [23:20:59] (CR) Lcarr: [C: 2] Replace generic::sysctl::lvs [operations/puppet] - https://gerrit.wikimedia.org/r/74800 (owner: Andrew Bogott) [23:21:20] LeslieCarr: andrewbogott I'm going to make it grrrit-wm shortly. [23:21:42] ori-l: i'm on yours now... [23:22:36] weird, no conflicts when I rebase locally [23:22:50] (PS2) Andrew Bogott: Replace generic::sysctl::advanced-routing-ipv6 [operations/puppet] - https://gerrit.wikimedia.org/r/74798 (owner: Andrew Bogott) [23:23:42] (CR) Andrew Bogott: [C: 2] Replace generic::sysctl::advanced-routing-ipv6 [operations/puppet] - https://gerrit.wikimedia.org/r/74798 (owner: Andrew Bogott) [23:23:43] (Merged) Andrew Bogott: Replace generic::sysctl::advanced-routing-ipv6 [operations/puppet] - https://gerrit.wikimedia.org/r/74798 (owner: Andrew Bogott) [23:23:47] (CR) Lcarr: [C: -1] "(1 comment)" [operations/puppet] - https://gerrit.wikimedia.org/r/74806 (owner: Ori.livneh) [23:24:55] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 23:24:47 UTC 2013 [23:25:55] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [23:26:19] (PS2) Ori.livneh: Clean-up of 'eventlogging::monitor' class. [operations/puppet] - https://gerrit.wikimedia.org/r/74806 (owner: Ori.livneh) [23:28:55] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 23:28:46 UTC 2013 [23:29:05] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 23:29:03 UTC 2013 [23:29:45] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [23:29:45] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [23:30:23] bah, this is a tangled mess of patch conflicts… should've done this as a series. [23:30:47] (CR) Lcarr: [C: 2] Clean-up of 'eventlogging::monitor' class. [operations/puppet] - https://gerrit.wikimedia.org/r/74806 (owner: Ori.livneh) [23:30:48] (Merged) Lcarr: Clean-up of 'eventlogging::monitor' class. [operations/puppet] - https://gerrit.wikimedia.org/r/74806 (owner: Ori.livneh) [23:30:53] Anyway, thanks for reviews, LeslieCarr! I have to run but will catch up with rebases and conflicts soon [23:31:00] * ori-l hugs LeslieCarr [23:31:01] yay! [23:31:02] thank you [23:31:09] now ori-l can sleep in peace [23:31:52] (CR) Lcarr: [C: 2] Use the sysctlfile module in udp2log::sysctl. [operations/puppet] - https://gerrit.wikimedia.org/r/74808 (owner: Andrew Bogott) [23:31:53] (Merged) Lcarr: Use the sysctlfile module in udp2log::sysctl. [operations/puppet] - https://gerrit.wikimedia.org/r/74808 (owner: Andrew Bogott) [23:32:45] (CR) Lcarr: [C: 2] Use the sysctlfile module in base::sysctl. [operations/puppet] - https://gerrit.wikimedia.org/r/74804 (owner: Andrew Bogott) [23:32:46] (Merged) Lcarr: Use the sysctlfile module in base::sysctl. [operations/puppet] - https://gerrit.wikimedia.org/r/74804 (owner: Andrew Bogott) [23:33:05] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Fri Jul 19 23:33:02 UTC 2013 [23:33:13] (PS2) Lcarr: Replace generic::sysctl::high-bandwidth-rsync [operations/puppet] - https://gerrit.wikimedia.org/r/74802 (owner: Andrew Bogott) [23:33:45] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [23:33:56] (CR) Lcarr: [C: 2] Replace generic::sysctl::high-bandwidth-rsync [operations/puppet] - https://gerrit.wikimedia.org/r/74802 (owner: Andrew Bogott) [23:33:57] (Merged) Lcarr: Replace generic::sysctl::high-bandwidth-rsync [operations/puppet] - https://gerrit.wikimedia.org/r/74802 (owner: Andrew Bogott) [23:47:17] andrewbogott: err: /Stage[main]/Base::Sysctl/Exec[/sbin/start procps]: Failed to call refresh: /sbin/start procps returned 1 instead of one of [0] at /var/lib/git/operations/puppet/manifests/base.pp:304 [23:47:38] well, that's disappointing [23:47:53] Although maybe it always did that :/ [23:48:26] ori-l, what's an example of a system that's happening on? [23:48:48] andrewbogott: vanadium [23:50:11] andrewbogott: error: "Invalid argument" setting key "net.core.rmem_max" [23:50:19] that's why it's failing [23:50:40] ok [23:53:41] That's in 99-big-rmem and I don't even see where that came from [23:53:49] It's not the version of that file that I puppetized... [23:54:10] do you purge => true / recurse => true / force => true /etc/sysctl.d? [23:54:55] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Fri Jul 19 23:54:51 UTC 2013 [23:55:23] no, but the patches I wrote don't add or rename any files, just change existing ones. [23:55:37] And, actually, should only change comments in those [23:55:55] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [23:57:33] andrewbogott: ah, looks like your change is blameless; it just had the misfortune of triggering a sysctl refresh [23:57:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Fri Jul 19 23:57:40 UTC 2013 [23:57:58] that's what I thought… although plenty of the patches I touch add sysctl settings like that one [23:58:03] I didn't add new ones, but... [23:58:16] i deleted that file and it was not recreated by puppet, indicating that it was probably a remnant of some old puppet resource that was not ensure => absent'ed [23:58:32] Oh, yeah, I figured it was a remnant, just didn't know if it was an important remnant :) [23:58:45] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [23:59:45] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Fri Jul 19 23:59:41 UTC 2013 [23:59:56] * andrewbogott wonders if it's safe to step away from the laptop again