[01:42:09] Thehelpfulone: hey. let me take a look [01:42:24] although -> #-labs [01:43:22] PROBLEM - MySQL Slave Delay on db1025 is CRITICAL: CRIT replication delay 388 seconds [01:45:46] RECOVERY - MySQL Slave Delay on db1025 is OK: OK replication delay 0 seconds [03:51:10] RECOVERY - Puppet freshness on spence is OK: puppet ran at Mon May 21 03:50:41 UTC 2012 [04:15:53] PROBLEM - udp2log log age for emery on emery is CRITICAL: CRITICAL: log files /var/log/squid/orange-ivory-coast.log, have not been written in a critical amount of time. For most logs, this is 4 hours. For slow logs, this is 4 days. [04:44:30] Reedy: still around? [04:46:06] i wonder if stat1 ever got the reinstall? there should be an RT on it [04:46:12] May 11: 11:35 mutante: stat1 - installed new kernel, but waiting to reboot. schedule with aotto [04:46:19] but nothing more recent [04:48:34] * jeremyb tries to decide if he's reading this right [04:48:43] stat1 has nearly 10TB free? [04:49:00] and not much more than that total? [04:49:05] (ganglia) [04:50:43] also seems to have a lot of ram (30ish GB?) and CPU (16 cores?) [04:51:27] anyway, i see no obvious issues in ganglia but i don't really know what to look for [04:51:48] http://stats.wikimedia.org/EN/TablesWikipediaEN.htm is up to date through april (the top row of the table [04:51:57] http://stats.wikimedia.org/EN/TablesWikipediaID.htm is only through march [04:52:16] same with Summary${lang}.htm [04:56:00] hrmm... so those files are derived from apergos's DB dumps apparently [04:56:30] anyone have a TZ on ezachte? [05:01:20] hrmm, i wonder if it's just crond not running? [05:02:03] (how often do these pages get updates? they all seem to say they were last updated on the 16th. maybe that's normal or maybe it's frozen in time) [05:02:40] but puppet ran recently... [05:04:35] i don't see any sign that puppet ensures crond is running [05:04:41] (in the repo) [05:04:54] * jeremyb has quite a monologue going! ;P [05:17:15] RECOVERY - udp2log log age for emery on emery is OK: OK: all log files active [06:03:58] PROBLEM - Puppet freshness on db29 is CRITICAL: Puppet has not run in the last 10 hours [06:34:16] ok, see wikitech-l for same issue as above [06:44:40] also, gerrit-wm still needs a boot. (post #mediawiki banning) [06:47:05] bye [06:47:18] bye [06:47:26] sleep!? [06:47:34] maybe! [06:48:16] * aude eating breakfast [06:48:25] {{fact}} [06:48:33] :) [07:43:28] helllo [08:12:38] New patchset: Dzahn; "add detail page for a single wiki and link in id column globally" [operations/debs/wikistats] (master) - https://gerrit.wikimedia.org/r/8299 [08:12:39] New patchset: Dzahn; "add pages / offset to display.php, get rid of old lines / threshold, shorten sort= to s=,..." [operations/debs/wikistats] (master) - https://gerrit.wikimedia.org/r/8300 [08:12:40] New patchset: Dzahn; "remove timestamp colors from code and into css classes, add the css file, add "..hours ago" to ts column," [operations/debs/wikistats] (master) - https://gerrit.wikimedia.org/r/8301 [08:12:41] New patchset: Dzahn; "retab css file, add classes for version colors, make timestamp oldness configurable,..." [operations/debs/wikistats] (master) - https://gerrit.wikimedia.org/r/8302 [08:12:41] New patchset: Dzahn; "adjust require pathes, use $IP, add required coalesced_query" [operations/debs/wikistats] (master) - https://gerrit.wikimedia.org/r/8303 [08:12:42] New patchset: Dzahn; "enhance ugly sortswitch, get rid of undefined constant / index warnings" [operations/debs/wikistats] (master) - https://gerrit.wikimedia.org/r/8304 [08:12:43] New patchset: Dzahn; "remove sorting stuff from detail page, move table creation and more into functions, add CSS, fix invalid HTML,..." [operations/debs/wikistats] (master) - https://gerrit.wikimedia.org/r/8305 [08:14:00] New patchset: Dzahn; "adjust device names / partition sizes in partman for analytics" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8306 [08:14:19] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8306 [08:21:03] New review: Dzahn; "(no comment)" [operations/debs/wikistats] (master); V: 1 C: 2; - https://gerrit.wikimedia.org/r/8299 [08:21:05] Change merged: Dzahn; [operations/debs/wikistats] (master) - https://gerrit.wikimedia.org/r/8299 [08:21:53] New review: Dzahn; "(no comment)" [operations/debs/wikistats] (master); V: 1 C: 2; - https://gerrit.wikimedia.org/r/8300 [08:21:55] Change merged: Dzahn; [operations/debs/wikistats] (master) - https://gerrit.wikimedia.org/r/8300 [08:23:02] New review: Dzahn; "(no comment)" [operations/debs/wikistats] (master); V: 1 C: 2; - https://gerrit.wikimedia.org/r/8301 [08:23:04] Change merged: Dzahn; [operations/debs/wikistats] (master) - https://gerrit.wikimedia.org/r/8301 [08:23:54] New review: Dzahn; "(no comment)" [operations/debs/wikistats] (master); V: 1 C: 2; - https://gerrit.wikimedia.org/r/8302 [08:23:55] Change merged: Dzahn; [operations/debs/wikistats] (master) - https://gerrit.wikimedia.org/r/8302 [08:24:49] New review: Dzahn; "(no comment)" [operations/debs/wikistats] (master); V: 1 C: 2; - https://gerrit.wikimedia.org/r/8303 [08:24:50] Change merged: Dzahn; [operations/debs/wikistats] (master) - https://gerrit.wikimedia.org/r/8303 [08:25:44] New review: Dzahn; "(no comment)" [operations/debs/wikistats] (master); V: 1 C: 2; - https://gerrit.wikimedia.org/r/8304 [08:25:45] Change merged: Dzahn; [operations/debs/wikistats] (master) - https://gerrit.wikimedia.org/r/8304 [08:26:58] New review: Dzahn; "(no comment)" [operations/debs/wikistats] (master); V: 1 C: 2; - https://gerrit.wikimedia.org/r/8305 [08:27:00] Change merged: Dzahn; [operations/debs/wikistats] (master) - https://gerrit.wikimedia.org/r/8305 [08:27:51] New review: Dzahn; "Ciscos use sdc/sdd" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/8306 [08:27:53] Change merged: Dzahn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8306 [08:29:36] PROBLEM - Puppet freshness on storage3 is CRITICAL: Puppet has not run in the last 10 hours [08:29:55] New patchset: ArielGlenn; "specific basedir as option to make deployment easier" [operations/dumps] (ariel) - https://gerrit.wikimedia.org/r/8309 [08:30:32] New review: ArielGlenn; "(no comment)" [operations/dumps] (ariel); V: 1 C: 2; - https://gerrit.wikimedia.org/r/8309 [08:30:34] Change merged: ArielGlenn; [operations/dumps] (ariel) - https://gerrit.wikimedia.org/r/8309 [08:31:36] Change abandoned: Dzahn; "there should be better ways and meanwhile it is running on all anyways" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/3510 [08:33:24] New review: Dzahn; "puppetize logrotate config from brewster and change to size based rotation" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/7167 [08:33:27] Change merged: Dzahn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/7167 [08:56:36] PROBLEM - Puppet freshness on ocg3 is CRITICAL: Puppet has not run in the last 10 hours [09:09:37] New review: QChris; "Is the documentation of --basedir consistent with the way CONFIGFILE is used?" [operations/dumps] (ariel) - https://gerrit.wikimedia.org/r/8309 [10:05:17] New patchset: Jeroen De Dauw; "add irc notifications for the #wikimedia-wikidata and #semantic-mediawiki channels" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8043 [10:05:38] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8043 [11:36:37] PROBLEM - Host analytics1002 is DOWN: PING CRITICAL - Packet loss = 100% [11:38:31] ACKNOWLEDGEMENT - Host analytics1002 is DOWN: PING CRITICAL - Packet loss = 100% daniel_zahn reinstall [11:40:40] RECOVERY - Host analytics1002 is UP: PING OK - Packet loss = 0%, RTA = 26.43 ms [12:26:31] New review: Nikerabbit; "(no comment)" [operations/puppet] (production) C: 1; - https://gerrit.wikimedia.org/r/8043 [12:29:14] PROBLEM - SSH on virt2 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [12:30:35] RECOVERY - SSH on virt2 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [12:39:23] * jeremyb scrolls up [12:39:40] no labs? [12:39:56] hashar: could you boot gerrit-wm ? (post #mediawiki banning) [12:40:07] ? [12:40:15] wtf are people banning gerrit-wm ? [12:40:30] talk to Joan or see logs [12:40:34] hashar: yes [12:40:57] who ever banned it deserve to get ops right removed [12:41:02] unless it suddenly became wild [12:41:02] 21 02:51:43 < jeremyb> damnit joan [12:41:03] 21 02:52:02 < jeremyb> 20 19:53:35 -!- gerrit-wm was kicked from #mediawiki by Joan [gerrit-wm] [12:41:07] ... [12:41:11] it became wild [12:41:15] k k [12:41:32] * aude hears [12:41:32] anyway [12:41:32] I am pretty sure I don't have ops access there [12:41:34] i think maybe it was someone using that script to do a mass approave or something [12:41:49] oh I do [12:41:49] hashar: i'm saying boot the process on manganese [12:43:45] hashar: it's currently a +q (p858 changed from Joan's +b) [12:43:59] I don't have access to manganese [12:44:02] oh [12:44:04] ;-( [12:45:51] !log Started ircecho on manganese [12:45:56] Logged the message, Master [12:46:05] our hero :-D thanks mark [12:47:33] New patchset: Thehelpfulone; "flood group to enwikisource, bug 36863" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/8326 [12:47:33] New patchset: Thehelpfulone; "flood group to enwikisource, bug 36863" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/8326 [12:47:42] New review: jenkins-bot; "Build Successful " [operations/mediawiki-config] (master); V: 1 C: 0; - https://gerrit.wikimedia.org/r/8326 [12:47:42] New review: jenkins-bot; "Build Successful " [operations/mediawiki-config] (master); V: 1 C: 0; - https://gerrit.wikimedia.org/r/8326 [12:47:43] hmm double bot? [12:48:28] mark: could we have just one copy? ;-) [12:48:50] the more the merrier? [12:48:51] ;) [12:48:57] :P [12:49:04] mark: ask Krinkle about that [12:49:19] * jeremyb incorporates dbbot-wm's history by reference [12:50:29] jeremyb: Ask me about what? Confused dbbot-wm with gerrit-wm ? [12:50:42] Krinkle: "the more the merrier" [12:51:10] heh :) [12:59:10] New review: Nikerabbit; "(no comment)" [operations/puppet] (production) C: 1; - https://gerrit.wikimedia.org/r/8037 [13:06:28] Dunno if anyone is around that knows labs stuff, but labs seems to have deadful I/O issues today to the point of not being able to login to servers. If you can do anything about it, awesome, if not Ryan will get badgered later. [13:06:44] Damianz: i think people know [13:07:11] jeremyb: Never assume! Or I just like annoying people.... not sure which [13:07:22] and have done some diagnosis. haven't caught up yet on what exactly [13:07:34] Damianz: sure. i'm just advising you that you should assume in this case [13:08:23] Considering people have been moaning since this morning and there's numerous bz tickets open I'd hope so - looking at how long a review takes, who knows though :P [13:09:59] Damianz: I'm here and have access but I'm not really sure what to do :-) [13:10:05] trying to get the hang of it still [13:10:26] ahhhh [13:10:29] GIT [13:10:36] I'd sugest gluster is being a tool but apart from that who knows without poking, Ryan still seems to be the person who knows the most sadly [13:10:40] I am on a project still using svn [13:10:41] noo [13:11:15] * jeremyb waves a ^demon [13:11:20] just in time [13:11:26] <^demon> uh oh [13:11:31] * hashar opens a bug about migrating all /trunk/tools project from svn to git [13:11:34] \o/ [13:11:36] good morning Chad [13:11:52] * jeremyb annexes richmond to DC [13:12:11] * hashar uses `git svn clone` [13:12:11] <^demon> annexing doesn't make it closer. [13:12:25] <^demon> hashar: We can just write rules for svn2git and do them all at once. [13:13:32] don't you already have rules to do that ? [13:13:34] New review: Nikerabbit; "(no comment)" [operations/puppet] (production) C: 1; - https://gerrit.wikimedia.org/r/5778 [13:13:38] oh forget me [13:13:46] <^demon> For projects we've already converted ;-) [13:13:56] yeah rules for svn2git will be great, isn't it all about adding a new path in some conf file ? [13:15:35] New review: Demon; "(no comment)" [operations/puppet] (production) C: -1; - https://gerrit.wikimedia.org/r/8120 [13:17:23] ^demon: isinstance? what's the question? [13:17:51] <^demon> You typo'd it? [13:17:52] * jeremyb points to http://docs.python.org/library/functions.html#isinstance [13:17:59] nope [13:18:00] <^demon> if isinstnace [13:18:06] <^demon> instnace? [13:18:08] no [13:19:23] <^demon> On line 138 you used "isinstance". On line 134 its "isinstanace" [13:19:36] <^demon> *isistnace [13:19:44] ohhh [13:20:02] well i typed it right just now [13:20:04] ;P [13:20:06] <^demon> :) [13:20:07] * jeremyb rereads [13:20:24] but otherwise? [13:20:26] <^demon> hashar: Here's a start http://etherpad.wikimedia.org/Git-migration-tools [13:20:40] <^demon> jeremyb: *looks* ok, but I haven't cherry picked it and tested yet. [13:21:31] ^demon: that is all cryptic to me ;-D [13:21:39] ohh [13:21:50] <^demon> The only part you'd need to add is more create repository lines [13:21:58] <^demon> The match is a regex :) [13:22:03] well I am not sure we want to do a brute force migration of /trunk/tools/* to some tools/$1 git repository [13:22:15] some projects there are obsolete [13:22:35] <^demon> Well it's harmless to dump them all. If we don't want them in gerrit just don't make the projects there. [13:22:50] also I was wondering this morning if we really want a new namespace such as tools/* [13:23:00] which is not very meaningful [13:23:00] <^demon> I already added mediawiki/tools/* [13:23:04] kk [13:23:04] <^demon> which some of that will go into [13:23:49] that was my question on some bug requesting to migrate the multiversion script [13:24:15] my proposal is operations/hetdeploy or operations/mediawiki/multiversion [13:24:25] or could be some platform/mwmultiversion [13:24:37] I have really no idea and can't make a choice :-( [13:25:04] <^demon> operations/* makes sense for it. We can bikeshed over the actual repo name later. [13:25:44] sure [13:25:58] wanna review the various tools ? [13:26:22] <^demon> I can this afternoon. I've got class in an hour and need to leave in ~30m. [13:27:57] New patchset: Jeremyb; "cleanup/refactor gerrit logging" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8120 [13:28:16] ok [13:28:18] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8120 [13:28:28] thanks gerrit2 ;) [13:32:16] huh. `git reset --hard origin/production` and then `git pull -a` fails with [13:32:19] Unable to find common commit with 1700114759097cb2237aa9e6e332093a90021926 [13:32:36] which is a commit from december [13:41:29] this is getting annoying [13:41:57] i deleted my local production and then did `git checkout -t origin/production` [13:42:08] then git pull -a gives same error again [13:43:57] PROBLEM - Host analytics1002 is DOWN: PING CRITICAL - Packet loss = 100% [13:43:59] huh, i don't have a clue how i fixed it. ;-( [13:44:01] $ git pull -a [13:44:01] Already up-to-date. Yeeah! [13:47:52] New review: Jeremyb; "(no comment)" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8120 [13:49:54] New review: Demon; "Testing claimed commenting bug." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8120 [14:11:41] RECOVERY - Host analytics1002 is UP: PING OK - Packet loss = 0%, RTA = 26.38 ms [14:37:04] cmjohnson1: ping [14:37:10] would you have time to look at the netapp? [14:46:56] PROBLEM - MySQL Slave Delay on db1018 is CRITICAL: CRIT replication delay 184 seconds [14:47:32] PROBLEM - MySQL Replication Heartbeat on db1018 is CRITICAL: CRIT replication delay 192 seconds [14:51:44] RECOVERY - MySQL Replication Heartbeat on db1018 is OK: OK replication delay 16 seconds [14:52:38] RECOVERY - MySQL Slave Delay on db1018 is OK: OK replication delay 0 seconds [14:53:19] hi mutante, did CT talk to you about getting us sudo on the new analytics box(es)? [14:55:52] aaannd mark, if you are around and have a sec sometime soon [14:55:54] https://gerrit.wikimedia.org/r/#/c/7988/ [14:56:12] would love to review that with you (or someone), so that if you have questions I can answer [14:56:32] you seem to be the most appropriate review, since you reviewed my git stuff too [15:04:06] mark: heya [15:04:10] hi [15:04:22] I can't get one of the two netapp controllers to respond [15:04:24] I am in eqiad w/ rob this week [15:04:27] oh [15:04:28] right [15:04:30] meh [15:04:57] how do you like eqiad? [15:05:03] much nicer than tampa! [15:05:10] no shit ;) [15:06:38] put a ticket in and I will get to it when I get back [15:06:46] yeah [15:11:23] ottomata: coming up within the next hour or so. 8/10 installed ..;) [15:12:24] oh amazing! [15:12:26] thanks! [15:12:49] ottomata: speak of the devil ;) [15:13:18] mutante is the devil!? [15:13:27] * mutante hides [15:13:32] dzahn == DEVIL ZAHN!? [15:13:40] I was going to say woosters :P [15:13:45] ohhhh [15:13:58] but sure, mutante sounds like a devilish nick too [15:14:25] it's just biology :p [15:55:52] New patchset: Dzahn; "full sudo privs for diederik,dsc and otto on analytics hosts" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8336 [15:56:12] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8336 [15:57:17] Q about sudo privs [15:57:23] why do we do NO PASSWORD? [15:57:25] New review: Dzahn; "per CT" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/8336 [15:57:27] Change merged: Dzahn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8336 [15:57:59] ottomata: because you seemed to want it on others and copying [15:58:37] ottomata: happy to change it to regular [15:59:40] you had it on all existing ones though [16:00:07] hmm, [16:00:11] * jeremyb personally prefers NOPASSWD [16:00:17] i don't mind, but I think it is safer to have to enter our sudo password when we sudo [16:00:49] we can do another commit to change all at once then if you want to change it, staying consistent for the moment [16:01:04] what do other ops people do? is everyone usually NOPASSWD? [16:01:35] well, they are root [16:01:57] ? [16:02:04] they sign in as root? [16:03:13] you have all 3 groups, people who can root, people who have sudo ALL, and people with selected sudo commands [16:03:28] where i see ALL i also see NOPASSWD combined with it though [16:03:34] hmm, ok [16:03:39] well i guess let's just be consistent, i was just wondering [16:04:41] PROBLEM - Puppet freshness on db29 is CRITICAL: Puppet has not run in the last 10 hours [16:07:07] ottomata: feel free to test on analytics1006 [16:07:43] it already created sudoers.d files for each user there [16:14:24] it works! [16:14:33] :) [16:15:25] ottomata: expect all to work except 1005 (1001 to 1010) [16:15:39] that's the last one? [16:15:48] yes, 1005 is rebooting [16:16:59] you got 2 software raid 1, one for / and one for swap, across first 2 disks (which are sdc and sdd) [16:17:32] RobH: do we have any spare misc servers in pmtpa? we may need one for labs [16:17:35] /dev/md0 being mounted as / [16:17:59] perfect, thanks so much! [16:19:11] ok, let me sign 1005 on puppetmaster and we should be fine for now [16:22:29] !log analytics1001 to 1010 installed and up in puppet [16:22:31] paravoid: labs cannot run on normal misc stuff [16:22:33] Logged the message, Master [16:22:42] paravoid: the ciscos will be shipping there this week for install next and the following [16:24:11] RobH: we were talking about using that as a temporary NFS server, so there's no need for a Cisco for that [16:24:28] RobH: a server with somewhat acceptable I/O would be fine [16:24:49] nfs server as opposed to the gluster servers dedicated to that kinda thing? [16:25:13] i may have a couple of spare misc servers, i would have to do a review to find out [16:25:17] but those servers dont have a lot of storage [16:25:25] misc servers tend to be dual 250gb [16:25:25] nfs server as opposed to an instance that runs an nfs server that in turn uses gluster [16:25:41] we have 20g right now iirc, so it should be fine [16:26:25] yep, 18g allocated, 12g used [16:27:11] ok, if ya need it go ahead and drop a ticket in procurement listing what you need and why you need it pls [16:27:17] that way i have a trail of whats goin on ;] [16:27:25] sure [16:27:45] should I file it in procurement even though we don't actually want to procure anything? [16:27:59] if we don't have any spares, we shouldn't buy one, since this should be a temporary measure [16:31:08] Ryan_Lane: are you okay with that plan? should I go ahead and file the ticket? [16:35:50] gimme a sec to read backscroll [16:36:03] I'm in the middle of an IRC onslaught [16:36:11] heh [16:37:11] buying one will take longer than using the ciscos [16:37:25] if we have an available misc, that'll work [16:37:34] otherwise let's plan on riding out the storm [16:37:48] agreed [16:47:35] New patchset: Dzahn; "add monitoring group analytics-eqiad and set in analytics role class" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8338 [16:47:54] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8338 [16:50:35] New patchset: Hashar; "puppetize wikibugs (irc bot for bugzilla)" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8339 [16:50:55] New review: Dzahn; "this time with a DC suffix, ack" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/8338 [16:50:55] Change merged: Dzahn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8338 [16:50:56] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8339 [16:58:36] New review: Bhartshorne; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/7884 [16:58:39] Change merged: Bhartshorne; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/7884 [17:01:16] thanks ben! [17:16:17] ottomata: you're welcome! but it's not live yet, because puppet takes frikkin hours on spence. [17:16:26] ha, ok [17:17:11] maplebed: did you do anything special to the sqlite dbs in swift? [17:18:07] AaronSchulz: no. [17:18:15] why, are they special? [17:18:27] it easily takes 15 seconds to list some thumbnails [17:18:37] in fact that's where all the slowness is [17:18:49] the concurrent deletes are very fast, even the 99th percentile [17:18:57] fantastic! [17:19:12] * AaronSchulz looks at db.py [17:19:14] CREATE INDEX ix_object_deleted_name ON object (deleted, name); [17:19:37] maplebed: just for sanity can you check if that index is there? [17:19:42] I never expected listings to be super quick, but 15s is a little absurd. [17:20:07] I don't know how to use sqlite very well. I'll try and find out. [17:20:29] and no we can't switch the DBMS, so says launchpad ;) [17:20:43] * AaronSchulz wonders what version of sqlite is running [17:21:04] 3.6.22-1 [17:21:15] ^^^ the package version for libsqlite3-0. [17:21:36] New patchset: Ottomata; "Creating generic::mysql::server class that installs packages and sets up my.cnf and starts mysqld." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/7988 [17:21:38] burrito time. [17:21:55] Change abandoned: Ottomata; "Including this change in this commit:" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/7887 [17:21:56] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/7988 [17:22:42] hiyaaaaaa, mark, are you still around? [17:23:00] or maybe someone else who's got puppet / mysql opinions? [17:48:11] !log ran authdns-update on dobson to pick up virt1002-1008 changes [17:48:16] Logged the message, Master [17:51:44] New patchset: Jeremyb; "make ircecho config sane (not just very long strings)" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8344 [17:52:04] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8344 [17:52:04] New review: Jeremyb; "(no comment)" [operations/puppet] (production) C: 0; - https://gerrit.wikimedia.org/r/8043 [18:04:45] !log dist-upgrade and reboot loudon [18:04:48] Logged the message, Master [18:23:16] ottomata: 3 diffs between your method and ^demon's. that i can think of off the top of my head. 1) name translation varies (no @users.mediawiki.org in yours) 2) svn's not made read only so people don't accidentally commit there and 3) ^demon maybe doesn't know it has happened [18:24:06] prolly other diffs with the commit msgs [18:27:17] uhh, wait, what are we talking about? [18:27:20] <^demon> You miss the refs/notes/commits that has svn commit data. [18:27:23] <^demon> jeremyb: ^ [18:27:40] New patchset: Reedy; "Added php-htmlpurifier to go with https://gerrit.wikimedia.org/r/#/c/7826/" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8348 [18:28:00] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8348 [18:30:21] PROBLEM - Puppet freshness on storage3 is CRITICAL: Puppet has not run in the last 10 hours [18:31:02] ottomata: your svn -> git mail [18:31:24] PROBLEM - Puppet freshness on brewster is CRITICAL: Puppet has not run in the last 10 hours [18:31:37] ottomata: want to review 8344? ;) [18:32:02] oh about the move from svn to udplog [18:32:02] ? [18:32:07] sorry [18:32:08] yeah [18:32:11] svn udplog to git udplog [18:32:20] still yearh [18:32:20] ^daemon == chad? [18:32:21] yeah* [18:32:25] yes! [18:32:37] how did you go so long without knowing that? [18:32:45] i actually know another techy chadh [18:32:50] it's kinda confusing sometimes [18:32:55] when I did the conversion, I provided an authors file so that emails and user names were preserved in the commit log [18:33:16] what is chad's method? [18:34:05] ottomata: 21 13:20:26 < ^demon> hashar: Here's a start http://etherpad.wikimedia.org/Git-migration-tools [18:34:06] <^demon> Same thing, but I normalize everyone's e-mails to the fake @users.mediawiki.org (which may exist as a redirect one day) [18:34:26] <^demon> Also, the tool I uses retains the svn commit info as a ref/notes/* [18:34:38] ohhh, why the fakes? [18:34:41] o we don't want the emails [18:34:44] but the usernames are ok? [18:34:46] hmm [18:34:59] ^demon: so refs/notes/* is only old stuff not for new commits? [18:35:01] hmm, would it be better to let you convert this then? [18:35:09] i haven't made any changes to it since converting, we can start over [18:35:30] <^demon> jeremyb: refs/notes/* are a feature of git generally. The tool we use just stores the old commit info as refs/notes/commits [18:35:58] commit info == commit message? [18:37:19] what happens to stuff like eol-style? [18:37:37] is it recorded anywhere? [18:40:40] <^demon> ottomata: No, the commit messages are kept the same like they had been done in git initially. This just stores a link to the revision in code review, so you can know what SVN revision it referred to. [18:40:57] <^demon> jeremyb: eol-style commits aren't needed in git, and are discarded when converting. [18:41:13] ohhhh, cool [18:41:15] <^demon> Pretend they never happened ;-) [18:41:29] hmm, in that case should we redo that conversion? [18:41:50] <^demon> Yes we should :) I'm working on a set of rules to do a bunch of project conversions this week. [18:42:55] ok [18:43:03] rules as in a script? or explicit instructions? [18:43:34] ottomata: see the link i pasted [18:44:21] <^demon> We should triage /trunk/tools today :) [18:44:35] i see it [18:44:37] New patchset: Asher; "update package changelog" [operations/software] (master) - https://gerrit.wikimedia.org/r/8351 [18:44:39] not sure what that is [18:45:04] is that a script or just pseudo code? [18:45:13] New review: Asher; "(no comment)" [operations/software] (master); V: 1 C: 2; - https://gerrit.wikimedia.org/r/8351 [18:45:15] Change merged: Asher; [operations/software] (master) - https://gerrit.wikimedia.org/r/8351 [18:45:26] <^demon> It's the configuration to map svn paths to git repos/branches on those repos. [18:45:31] oh [18:45:38] configuration to what? [18:46:00] <^demon> https://gerrit.wikimedia.org/r/gitweb?p=operations/software.git;a=tree;f=svn2git;hb=HEAD [18:46:04] <^demon> Magic tools :) [18:46:17] {{fact}} [18:47:18] hmm, oooook [18:47:23] <^demon> Chad says so [18:47:31] should I try to figure out how to use that, or should I let ^demon re-convert this repo for me? [18:47:41] :) [18:47:43] <^demon> You should let me do it with the others I'll be doing this week :) [18:47:47] ok perfect [18:47:59] i won't need to commit new changes until some other stuff gets deployed [18:48:00] so that's fine [18:48:15] I don't care about the name of the git repo, but it should be in the analytics/ project group [18:48:22] and you can delete the analytics/udplog one if you like [18:48:36] yes! [18:48:39] oops, wrong chat [18:48:45] <^demon> We can keep analytics/udplog, I'll just rewrite the history :) [18:50:58] ok cool [18:51:01] thanks! [18:56:27] ottomata: http://nagios.wikimedia.org/nagios/cgi-bin/status.cgi?hostgroup=analytics-eqiad [18:56:43] CooooooooOOOOl [18:56:54] eh actually, this one is better for bookmark: http://nagios.wikimedia.org/nagios/cgi-bin/status.cgi?hostgroup=analytics-eqiad&style=detail [18:57:21] PROBLEM - Puppet freshness on ocg3 is CRITICAL: Puppet has not run in the last 10 hours [19:03:35] mutante: as long as you're doing monitoring, search seems to be unmonitored in pmtpa? is that intentional? (works in eqiad i think) [19:03:39] New review: Pyoungmeister; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/6798 [19:03:41] Change merged: Pyoungmeister; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/6798 [19:04:04] * jeremyb asked about it a few times, just repeating because i saw you touching nagios [19:04:19] New review: Pyoungmeister; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/7751 [19:04:21] Change merged: Pyoungmeister; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/7751 [19:09:43] jeremyb: stor [19:09:56] jeremyb: sorry, still in meeting, feel free to open RT [19:10:41] mutante: i think last time i tried to file an RT it turned me down (i don't exist) [19:11:03] jeremyb: email to ops-requests@ should work [19:11:16] @rt.wm.o ? [19:11:22] yea [19:15:23] looks like the last one i tried was to core-operations@ 1.5 yrs ago [19:15:36] * jeremyb will try again ;) [19:17:50] RobH: what would it take to get lucid working on the db61/62 hardware? [19:28:05] New review: Demon; "Wouldn't it make more sense to put this in bugzilla.pp?" [operations/puppet] (production) C: 0; - https://gerrit.wikimedia.org/r/8339 [19:31:13] ctwoo: can you forward me whatever you got from nagios this weekend re: rsearch? [19:33:02] notpeter: i just forwarded you the first page [19:33:11] it flapped 3 or 4 times [19:33:14] I didn't get a page either [19:33:21] i think it happened the week before as well [19:33:27] search-pool4 in all cases [19:33:59] huh, ok [19:34:01] thanks [19:35:35] notpeter: sometimes it's search1015, sometimes search1016 [19:35:59] notpeter: most is in SAL i think but IIRC not the most recent boot [19:36:58] (just guessing based on what asher/faidon's responses were ;) ) [19:37:16] heya [19:37:22] can someone review this with me today sometime? [19:37:22] https://gerrit.wikimedia.org/r/#/c/7988/ [19:39:04] jeremyb: gotcha [19:39:12] I mean, those are the two in that cluster [19:39:21] it might be time for more sharding... [19:39:47] notpeter: well i don't understand how it works. there's more than that in pool4 right? [19:39:57] but only those 2 break [19:40:26] ottomata: can you put a comment at the top of mysql.pp directing people towards the generic classes at the bottom for non-prod+core dbs? [19:41:00] yup [19:41:08] jeremyb: those are the ones that actually take front-end requests [19:41:28] the other ones have spell-correction indexes and highlighting indexes [19:41:49] so the others are used by search101[56] but not directly from outside the pool? [19:41:57] yep [19:42:00] ok [19:42:26] New patchset: Ottomata; "Creating generic::mysql::server class that installs packages and sets up my.cnf and starts mysqld." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/7988 [19:42:46] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/7988 [19:43:02] New review: Asher; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/7988 [19:48:12] thanks binasher [19:48:20] does that mean someone else needs to review too to get it merged? [19:48:39] nope [19:48:47] do you have merge permissions? [19:48:58] i can merge it for you if not [19:49:15] isn't merge just ops? [19:49:23] Change merged: Asher; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/7988 [19:50:19] yeah, forgot about that.. it's merged and on the puppetmaster [19:50:41] *afk [19:55:53] New patchset: Hashar; "puppetize wikibugs (irc bot for bugzilla)" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8339 [19:56:13] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8339 [20:00:55] binasher: its a chipset for the nic [20:01:04] we we would have to install the drivers for it during install [20:01:07] ie: painful [20:08:08] New review: Hashar; "Patchset2:" [operations/puppet] (production) C: 0; - https://gerrit.wikimedia.org/r/8339 [20:08:56] New review: Hashar; "libemail-mime-perl" [operations/puppet] (production) C: 0; - https://gerrit.wikimedia.org/r/8339 [20:10:25] thanks binasher! [20:12:01] New patchset: Andrew Bogott; "Added entries for virt1001-virt1008." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8356 [20:12:21] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8356 [20:13:43] New patchset: Ottomata; "Not configuring user with generic_my.cnf.erb" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8357 [20:14:04] binasher, ergh, booboo in my .erb file, had something left over that I had meant to remove [20:14:04] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8357 [20:14:08] can we try that? [20:29:38] so, more questions for ops [20:29:44] I need to puppetize the full version of the GeoIP .dat files [20:29:55] right now only the Lite versions are puppetized [20:30:04] we have license keys for the full version [20:30:23] right now, they are being downloaded by a wget cronjob on puppetmaster [20:30:26] into the volatile source [20:30:36] I think can change the wget command to download the correct files [20:30:51] but then I'd have to put the license key in the wget URL [20:31:00] which then woudl be in the public puppet repo…which I assume is not what I should do :) [20:31:17] ottomata: the license key would go in the private repo [20:31:18] You can variablise it, and then put the key in the private repo. somehow [20:31:26] how do I do that? [20:31:29] ottomata: and use a variable in the public one [20:31:29] i don't have access to private repo [20:31:43] Reedy: ha ha [20:33:17] ottomata: at worst, you get someone else in ops to put it in that repo [20:33:36] ok, lemme make sure I know exactly what needs done, then I will poke again here [20:33:40] thanks! [20:33:51] there should be enough examples around, somewhere.. [20:34:25] ottomata: could you just paste the URL with %s in place of where the key goes? [20:34:57] maybe…reading geoip instructions right now: [20:34:58] http://www.maxmind.com/app/update?license_key=LICENSE_KEY_HERE&md5=MD5_SUM_OF_EXISTING_GEOIP_DAT_FILE [20:35:09] that URL will work for the country [20:35:26] there seems to be a script to do this too... [20:35:30] hang on, lemme figure this out [20:35:46] ottomata: there are a lot of examples of private repo variables in puppet [20:35:51] ok [20:36:04] ottomata: MD5?! [20:36:14] not sure if I need that [20:36:29] https://rt.wikimedia.org/Ticket/Display.html?id=2385 [20:36:39] The md5 parameter is optional, if it is passed, the server will return the message "No new updates available" if the MD5 of the uncompressed GeoIP.dat file on your server matches the MD5 of the latest GeoIP Country file on our server. [20:37:47] ottomata: for example in my.cnf.erb:password=<%= scope.lookupvar('openstack::nova_config::controller_mysql_root_pass') %> in puppet/templates/openstack [20:38:07] hmmk, thanks [20:40:05] preilly: that's just referring back to a variable in the public class not to a private password [20:40:48] and it's set here: [20:40:48] $ git grep -n controller_mysql_root_pass manifests [20:40:48] manifests/openstack.pp:815: $controller_mysql_root_pass = $passwords::openstack::nova::controller_mysql_root_pass [20:41:40] ok, actually, I just need to save a .conf file in the private repo [20:42:57] ottomata: for one key? [20:43:13] there is anotehr way to update these, [20:43:18] once GeoIP C lib is installed [20:43:27] it comes with an updategeoip script [20:43:34] that reads a config file with the key and some other stuff [20:43:41] and automatically downloads and installs [20:43:44] huh, ok. i hope it doesn't really need C for that ;P [20:43:49] the latest.dat file [20:43:50] ha, no [20:43:56] but you need the C lib to use the .dat files anyway [20:44:18] right, but the download will happen in a place where the files are not used. right? [20:44:39] ? [20:45:16] no it goes to default locaiton of file [20:45:16] ah [20:45:17] rats [20:45:18] right now the download happens on sockpuppet, right? [20:45:20] and it only downloads teh city [20:45:25] argh [20:45:29] so I need to do two things [20:45:49] 1. wget URL with license_key => download country db [20:46:03] 2. run geoipupdate script with config file in place with license_key => updates city db [20:46:18] you can't just wget both of them? [20:46:39] i don't think so….., i might be able to, but the URL will change for the city [20:46:46] sure [20:46:47] but [20:46:52] i need login and pw to find that url [20:46:56] and i don't have it [20:47:01] dartar? [20:47:04] trying to get it form diederik, not sure if he has it [20:47:12] oh, diederik [20:47:25] i dunno who has it [20:47:33] it looks like nimish registered [20:47:38] the md5sum is what form? hex? [20:47:42] # UserId, LicenseKey, ProductIds from ngautam@wikimedia.org account [20:47:47] lowercase/uppercase? [20:47:56] ha, good Q! [20:47:59] why can't it just use etags or something? [20:48:04] iunnoooo [20:55:00] New review: Asher; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/8357 [20:55:04] Change merged: Asher; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8357 [20:55:29] ottomata: merged that change [20:56:44] thanks [21:00:44] New review: Ryan Lane; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/8356 [21:00:46] Change merged: Ryan Lane; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8356 [21:01:27] New patchset: Andrew Bogott; "First stab at a partman recipe for the cisco boxes." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8408 [21:01:47] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8408 [21:03:37] thanks binasher, agh, I see what happened [21:03:42] I had committed the wrong version of that file [21:03:52] I have two clones of puppet right now [21:03:54] one of test [21:03:58] and one of production [21:04:04] I use the test one to try out changes on my local vm [21:04:21] and I had edited the template file there, and forgot to bring my changes over to my production clone [21:05:17] i have another commit coming, one sec [21:05:27] New review: Ryan Lane; "Should rename this to virt-raid10.cfg" [operations/puppet] (production); V: 0 C: 0; - https://gerrit.wikimedia.org/r/8408 [21:09:00] New patchset: Ottomata; "Fixing generic_my.cnf again." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8409 [21:09:20] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8409 [21:10:08] New patchset: Andrew Bogott; "Partman recipe for the cisco boxes." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8408 [21:10:28] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8408 [21:22:28] New patchset: Asher; "revoking nimish's cluster access" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8411 [21:22:48] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8411 [21:23:43] New review: Asher; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/8411 [21:23:45] Change merged: Asher; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8411 [21:28:30] 20 21:41:51 < jeremyb> do we have a policy on following PEP 8 or not? (style guide) [21:28:43] parting words before i take off. bbl ;) [21:29:05] (i think no one answered that) [21:32:35] jeremyb: generally no [21:32:52] can someone help me put a config file in the private puppet repo? [21:33:02] Ryan_Lane: that was in the context of hookhelper.py.erb (sp?) [21:33:08] Ryan_Lane: but also wondering in general [21:33:13] in general [21:33:14] there are two ways to do what I want to do, want to ask which is best [21:33:16] * jeremyb wonders why not [21:33:20] you can if you want [21:33:23] but no one will force you [21:33:34] well, for one, we hate the idea of 80 character lines [21:33:46] well forget about that part [21:33:59] i mean spacing around function calls kinda of stuff [21:34:04] kinda stuff* [21:34:06] * Ryan_Lane shrugs [21:34:15] anyway, will people complain if i switch existing stuff? [21:34:24] probably not [21:34:32] ok [21:34:47] danke [21:36:18] New patchset: Andrew Bogott; "Preseed virt1001-virt1008 with lucid and raid-10." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8413 [21:36:39] New patchset: Andrew Bogott; "Partman recipe for the cisco boxes." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8408 [21:36:57] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8413 [21:36:57] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8408 [21:38:06] New review: Hashar; "(no comment)" [operations/puppet] (production) C: 1; - https://gerrit.wikimedia.org/r/7877 [21:38:19] preilly or Reedy, can one of you help me get a config file into the private puppet repo? [21:38:30] You'll need someone from ops [21:38:35] I know I don't have access [21:39:28] ohhm, jeremb i think is gone [21:39:31] Ryan_Lane? [21:39:41] New review: Ryan Lane; "(no comment)" [operations/puppet] (production); V: 0 C: 0; - https://gerrit.wikimedia.org/r/8413 [21:39:51] which file? [21:40:46] its a GeoIP.conf file [21:40:58] that contains license keys for the paid versions of the geoip .dat files [21:41:28] there is a script, geoipupdate, that comes with the GeoIP package [21:41:32] that reads this config file [21:41:39] and can update the .dat files when they change [21:41:57] we can either [21:41:57] 1. install the script on puppetmaster only, and symlink the files into the volatile repo [21:42:06] and have puppet ensure they are up to date in the other hosts [21:42:28] or [21:42:28] 2. install the GeoIP.conf file on all hosts that need up to date .dat files, and run geoipupdate as a weekly cron [21:42:53] (sorry: 1. s/script/config file/ ) [21:43:32] why do we have a paid version? [21:43:47] is there no open content version of this stuff? [21:44:18] iunno! [21:44:24] i don't think there are any good ones, no [21:44:59] IIRC the paid version tends to get outdated and inaccurate / incomplte [21:45:09] vs. what? [21:45:13] they update it every week [21:45:20] sorry [21:45:24] I mean the "free" versions [21:45:25] ;-D [21:45:27] ah ok [21:45:28] aye [21:45:49] RECOVERY - Packetloss_Average on oxygen is OK: OK: packet_loss_average is -0.0341156198347 [21:47:21] hey Ryan_Lane: about paid version maxmind; because the paid version has a much better coverage and is more accurate and is not expensive [21:48:12] or we could develop an extensions and build a free / open source / peer build version :-D [21:49:04] heh [21:49:38] hashar: who is going to do the geo lookup without relying on maxmind? [21:49:48] end user ? [21:50:42] Hello [User:$1|$1], Would you want to help us making the world better? Here some anonymously submitted questions for ya: Country? Region? City? ISP? Connection speed? [21:50:44] ;-D [21:50:56] haha [21:51:03] tha'ts kinda cool actually [21:51:05] that would ** a$$ [21:51:07] show that to every wikipedia user? [21:51:18] and then we could even offer paid support ;-D [21:51:24] yeah to everyone [21:51:33] et voila [21:51:36] free and open source base [21:51:49] pretty cool idea, ha [21:51:59] in the meantime, help meeeeeeeeee use maxmind's til we get that all sorted out [21:52:00] eh? [21:52:01] but that is definitely going to cost a lot more than the thousand buck the GeoIP company is charging the WMF [21:52:08] i think people would be pretty freaked out from a privacy POV :) [21:52:20] ottomata: I am not part of ops btw :-( [21:52:27] New patchset: Andrew Bogott; "Preseed virt1001-virt1008 with lucid and raid-10." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8413 [21:52:45] haha, ok [21:52:48] New patchset: Andrew Bogott; "Partman recipe for the cisco boxes." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8408 [21:52:50] Ryan_Laaaannnnnne [21:53:07] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8413 [21:53:07] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8408 [21:53:22] New review: Ryan Lane; "(no comment)" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8408 [21:53:33] drdee: for one person scared about privacy in a /24, there are ~253 ready to give their informations (ex: facebook) [21:54:07] yeah totally, i think it would be amazing [21:54:28] but wikipedians seem to be a little more upsettable that the average fb user [21:54:31] it is all about perception right? people who use facebook have mostly no clue what is tracked, while if you ask this is your ip, please tell me where you live raises all kinds of alarm bells [21:54:42] we could even manage to do that just by doing analytics on the squid logs [21:55:01] naw, we'd need to get it from the sources themselves [21:55:04] either the users or the ISPs [21:55:07] someone editing [Some obscure little city article], might just be living there [21:55:13] ohhh [21:55:22] yeah but *might* is the key word there [21:55:29] New patchset: Andrew Bogott; "Partman recipe for the cisco boxes." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8408 [21:55:49] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8408 [21:56:18] New review: Ryan Lane; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/8408 [21:56:22] ottomata: anyway for geoip either bug an ops here, eventually open a RT ticket or bug vvoosters (replace vv by w ) [21:56:38] voooooooosters [21:56:41] hehe [21:56:48] um, yup, bugging ops here and right now [21:56:51] hewooooo ops! [21:56:55] OR [21:56:58] get a plane to SF ;-D [21:57:20] heh [21:58:03] New patchset: Bhartshorne; "it takes longer than a minute to run the ganglia_parser. slowing it down to once every 2." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8417 [21:58:23] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8417 [21:58:34] New review: Bhartshorne; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/8417 [21:58:36] Change merged: Bhartshorne; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8417 [22:01:57] maplebed: did you get a change to look at containers more? [22:02:08] not yet. [22:03:51] New patchset: Bhartshorne; "the data this check uses is only refreshed every 2 minutes; no need to check every 1. also whitespace changes." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8418 [22:04:11] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8418 [22:04:19] New review: Ryan Lane; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/8408 [22:05:16] New review: Bhartshorne; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/8418 [22:05:18] * AaronSchulz was confusing sqlite2 and 3 [22:05:19] Change merged: Bhartshorne; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8418 [22:05:30] its 2 that had the writer starving [22:09:37] New review: Ryan Lane; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/8413 [22:09:40] Change merged: Ryan Lane; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8408 [22:09:41] Change merged: Ryan Lane; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8413 [22:10:42] New patchset: Bhartshorne; "third try to get nagios to do what we want for the logging check interval" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8420 [22:11:01] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8420 [22:11:07] New review: Bhartshorne; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/8420 [22:11:10] Change merged: Bhartshorne; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8420 [22:11:27] hey Ryan_Lane I caught your changes in sockpuppet's diff. [22:11:29] can I merge them? [22:11:37] yes, please [22:11:39] I just submitted them [22:11:55] so I see! [22:11:59] merged. [22:17:15] it already finished. takes about 10min for the english (by far the largest). [22:17:37] the thing is that i can't see the lag info [22:17:37] http://noc.wikimedia.org/dbtree/ [22:18:14] no it is 0 again. [22:18:39] hmm, maybe i looked in the wrong spot before also, db1003 doesn't have a lag [22:19:53] New patchset: Ottomata; "Fixing generic_my.cnf again." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8409 [22:20:13] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8409 [22:21:18] New review: Bhartshorne; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/8409 [22:21:20] Change merged: Bhartshorne; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8409 [22:40:07] Jeff_Green: what happens with your user stuff when a user exists in more than one group that's included on a host? does puppet barf at defining the same thing twice? [22:48:52] Ok, cleaning up eqiad for the evening. [22:51:56] New patchset: Andrew Bogott; "Added some needed backslashes." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8423 [22:52:16] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/8423 [22:56:46] New review: Ryan Lane; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/8423 [22:56:49] Change merged: Ryan Lane; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/8423 [23:32:31] AaronSchulz: 3.177 seconds to "select * from object where name like '8/87/Obj%' and deleted = 0 limit 10 ;" from a container db on the live system. [23:32:49] did I format that query right to use the index? [23:34:56] I think so [23:35:41] the container nodes are also storage nodes atm right? [23:36:50] yes, that's right. [23:38:08] it's only on the first request that it takes a long time [23:38:19] presumably subsequent requests have the db already in memory. [23:38:56] how much of a difference is there? [23:39:32] 0.06s for subsequent requests vs. 3.5s for the first request. [23:40:05] I tried several times on different large dbs [23:40:19] got 3.2, 4.5, and 3.1s response times. [23:42:33] this only happens on large dbs; the small ones are fast even on the first query. [23:43:07] how big are typical commons shard dbs? [23:43:14] rows/bytes [23:44:00] the largest are around 90MB. [23:44:44] the whole dbs or the object table? [23:44:56] the db file. [23:46:02] yeah, makes sense, that's what matters, since it's all one file [23:46:13] and that has around 270k rows in the object table. [23:46:36] small potatoes :) [23:48:01] here's something puzzling. [23:48:15] there are 18 containers that are about 90MB (87-93), [23:48:31] then the next largest is 4MB [23:49:03] there are 10 that are between 1 and 4MB, then it goes down more evenly, to 924k, 873k, etc. [23:49:11] oh. [23:49:13] nevermind. [23:49:16] I'm only looking at one backend. [23:49:20] and one disk. [23:49:30] it makes sense that it should only have a small portion of the commons shards. [23:50:51] with 60 disks and three copies of each container file, there should be ~12.8 commons shards per disk. 18 is within range, especially since there are a few other big containers. [23:53:40] sure