[00:00:44] !tools-help | Cyberpower678 [00:00:44] Cyberpower678: https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/Help [00:01:33] Thanks [00:07:35] Looks like the RfX scripts aren't entirely continuous supported. [00:07:51] That bad RfX that was transcluded today, crashed my script. [00:24:33] scfc_de, I'm impressed with myself. My PC tagging script has been in continuous operation for 11 days now. :) [00:47:50] Any particular reason labs is being toolserver? [00:47:51] Current Load Avg (15, 5, 1m): [00:47:51] 202%, 285%, 339% [00:47:51] Avg Utilization (last hour): [00:47:51] 91% [00:48:12] Load is high again.\ [00:49:07] hmmm [00:49:13] this is annoying [00:49:30] I moved to labs to avoid this. [00:50:02] And I'm trying to do some tests for my new script. [00:51:09] Tools' load spiked incredibly twice in the last half hour. [00:51:26] What all is involved in creating a labs account? [00:51:45] Go to wikitech and signup. There. [00:52:14] Load back down to 42 [00:52:20] for now. [00:54:51] WHO DELETED MY CRONTAB? [01:04:27] LABS IS DOWN AGAIN> :| [01:04:44] arrrrrg [01:05:13] I'm trying out my new i18n code [01:06:16] And it's up again. [01:07:02] :| [01:59:14] Cyberpower678: What in blazes are you talking about? [01:59:16] 01:58:58 up 14 days, 7:22, 26 users, load average: 0.47, 0.68, 1.29 [01:59:54] Coren: you really expect Cyberpower678 to be able to answer that? :p [02:00:47] Coren, tool labs was going up and down. Kind of annoying when I'm trying to do stuff. I actually am calling those moments Toolserver v2 from now on. [02:01:23] What I pasted up there was straight from Ganglia Tools section [02:04:06] Coren, when I notice my connection slowing down, then failing, I look at the tools in Ganglia. As my connection was failing, load was higher than 300 at that moment and then dropped off again. [02:04:22] Then it happened again a few minutes later and dropped off again. [02:04:46] Not sure what's going on there but something is not right. [02:06:23] I'm seeing a period of ~ 2.5 minutes during which idle went down. Hardly "down". OTOH, there are regular spikes as some users have overly busy crontab needlessly, or do things like reload their crontab every minute. [02:07:13] Coren, can't blame for that anymore. :p [02:07:24] And, interestingly enough, the only place where stuff is supposed to be running -- the grid -- is fine. So any properly running job wouldn't have been affected. [02:07:53] Mysteriously enough though, my crontab vanished somehow. [02:08:38] Coren, jsub sends to the grid right? [02:08:45] Well, given that you had an automated process that replaced it at interval if anything went wrong with it that might happen. Yet another reason why running cron every minute to replace your crontab wouldn't have been a good idea. [02:08:50] Yes, it does. [02:09:37] Then I'm running my scripts correctly. :D [02:10:10] They're all jsubbed. 10 of the previously cron initialized scripts are now continuous. [02:12:08] Coren, you should enforce a new rule and broadcast it globally. [02:12:54] Any user not running jobs correctly will have a wiped crontab. Second offence, the crontab will be locked out. If there are still problems, the account will be disabled. [02:13:23] Coren, does that sound reasonable? [02:13:29] No, it doesn't. [02:13:43] Why? [02:14:06] The other users aren't being fair when running the stuff they want to run. [02:14:58] Because you're missing the point of collaborative environment. (Including, I should add, the fact that shared resources are _shared_ and that occasional dips in performance art part and parcel). There are finite resources; if you need all the processeing power for your task, you need a VPS. :-) [02:17:26] Coren, let's not forget that WMF is providing us with these resources free of charge. [02:17:39] Yes, that too. [02:17:40] :-) [02:18:07] The least you guys could expect from us is that we follow the rules and run our crap in the correct place, right? :) [09:11:44] heh [09:11:48] tools are dying again [09:11:58] nfs fail [09:58:38] http://ganglia.wikimedia.org/latest/?c=Labs%20NFS%20cluster%20pmtpa&h=labstore3.pmtpa.wmnet&m=cpu_report&r=hour&s=descending&hc=4&mc=2 [09:58:40] petan: ^ [09:58:59] some project doing heavy I/O [10:00:34] petan: maybe tools-webserver-01 http://ganglia.wmflabs.org/latest/?c=tools&h=tools-webserver-01&m=load_one&r=hour&s=by%20name&hc=4&mc=2 [10:00:44] which got a nice 100% CPU usage hehe [10:01:25] started around 7:40 UTC (aka a bit more than 2 hours ago) [10:45:57] its been flakey for the last 12-18 hours [11:12:17] [05:11:59] nfs fail [11:12:26] What are you talking about? The filesystem is fine. [11:19:58] * Cyberpower678 thinks Coren is having somewhat of an off day. ;) [11:21:04] * YuviPanda_ logs in to check [11:21:07] Cyberpower678: is this on -login? [11:21:11] or what? [11:22:01] Cyberpower678: wfm too [11:22:32] YuviPanda_, ? [11:22:38] as in [11:22:40] works fine for me [11:22:41] toollabs [11:23:58] I see a burst of IO some hours ago. This would have made performace laggy for a bit, not broken anything. [11:24:10] (And, indeed, logs show nothing broken) [11:24:36] Coren: what's yoru TZ again? [11:24:41] *your [11:24:49] EDT (atm, UTC-4) [11:24:55] Yes, it's early. [11:24:58] hmm, you're up early! :) [11:25:12] Coren: is using icinga to monitor processes a good idea? [11:25:23] Coren: I'm trying to figure out a way to alert me if my processes are not running [11:25:23] !coffee Coren [11:25:23] * Helpmebot pours Coren a nice hot cup of coffee. [11:25:30] Ol.o [11:25:32] o.O [11:25:32] Coren: or my web services are down [11:25:42] Cyberpower678: I thought we kicked that bot off here into the -offtopic [11:25:43] channel [11:25:52] I thought that too. [11:25:54] YuviPanda_: Well, it's done to monitor some critical processes (like mysql daemons), but it's generally considered overkill for most things. [11:26:10] @kick Helpmebot [11:26:10] Permission denied [11:26:14] ah [11:26:29] Coren: right. not on the regular icinga but perhaps on somtehing else? [11:26:35] Coren: or do you have some other suggestions [11:26:36] "or my web services are down"? [11:26:37] I see no reason to kick it now unless it's being abused here. [11:26:46] Oh, you mean monitor them. [11:26:59] * Coren ponders. [11:27:03] Coren: yeah. 'check wp-signpost/cgi-bin/api.py, make sure it is not 500' [11:27:07] and things like that [11:27:15] anyone know d3.js here ? [11:27:18] we can have our own icinga instance maybe... [11:27:28] OrenBochman, no [11:27:43] OrenBochman: ask in -analytics when more of the US is up :) [11:27:47] It might be worthwhile to have a tool maintainer icinga; we certainly don't want enduser tools to page opsen or spam -operations [11:28:00] Coren: yeah, agreed. -operations icinga should be ignored :P [11:28:06] Coren: how hard would it be? [11:28:13] * YuviPanda_ opens docs [11:28:15] And you'd want maintainers to be able to acknowledge outages, etc. [11:28:27] yeah. Email, I suppose? [11:28:49] YuviPanda_: Open a feature request bz? It's a fair idea, but will require thought. [11:28:56] I guess I't will have to wait [11:28:58] Coren: yeah. let me do that. [11:29:20] OrenBochman: I have no knowledge of that thing. [11:29:24] Sowwy. :-( [11:30:44] [bz] (8NEW - created by: 2Yuvi Panda, priority: 4Unprioritized - 6normal) [Bug 51434] Setup an icinga instance to monitor tools on tool-labs - https://bugzilla.wikimedia.org/show_bug.cgi?id=51434 [11:31:11] Coren: done. [11:31:33] Coren: also how is uwsgi coming along? I guess you've been distracted with making NFS not suck [11:31:48] You guess properly. [11:32:19] Strictly speaking, it only sucked once every other week. :-) [11:32:52] sure, but if you average that over the month, the spikes bring up the average :) [11:32:55] WSGI is my todo today. [11:32:59] nice! [11:44:28] Coren: for like 40 minutes servers were unresposible with load about 100 [11:44:37] I think it was nfs [11:44:53] hmm, http://tools.wmflabs.org/?status isn't loading for me right now [11:45:05] and my ssh hung [11:45:06] it was like 5 hours ago [11:45:08] is that my internet? [11:45:15] (this is happening right now) [11:45:28] nah, it was 3 hours [11:45:53] no [11:45:53] i mean [11:45:57] my ssh is hung right now [11:46:02] and ?status is hugn [11:47:47] <_acs_> hi guys, I am trying to access as always the instance "ssh -v acs@korma.wmflabs.org" [11:47:58] <_acs_> but it is blocked for some minutes now in [11:48:04] <_acs_> debug1: Offering RSA public key: /home/acs/.ssh/id_rsa [11:48:17] <_acs_> do you have any clues? [11:48:51] petan: ^ [11:49:13] yes labs are fucked now [11:49:22] I don't have powers to do anything, you need to talk to Coren [11:49:24] yeah [11:49:28] it seems fucked. [11:49:47] !ping [11:49:47] pong [11:49:47] pong [11:49:48] Just veeery slow [11:49:57] @q Helpmebot [11:50:05] at least something works [11:50:06] Someone is kicking the filesystem's behind. [11:50:09] !ping [11:50:09] pong [11:50:13] *everything* works. [11:50:29] Coren: -login has load over 20 are you sure? [11:50:39] Yes. [11:50:44] is it manybubbles again? [11:50:50] can we please please kick him off labs? [11:50:51] * petan eats many bubble [11:51:01] :D [11:51:06] Not only am I on it, but I've also just ssh'ed again to it. It's sluggish, but it works fine. [11:51:07] or the NFS server, I mean. [11:51:17] can we please punch him in eye [11:51:20] yeah, I can ssh again but it was down for a minute or so [11:51:33] petan: no, we can punch solr. No violence against people! :) [11:51:51] But I think I'll have to crack down on people running stuff on -login. *sigh* [11:51:51] Coren: there is difference between sshing with root key and sshing like regular user who'm /home is mounted on nfs [11:52:00] Coren: if it works fine to you doesn't really mean anything [11:52:18] petan: I know that; and that's why I didn't. [11:52:31] I don't see anything running on -login [11:52:37] most expensive process is gmond [11:52:49] which IMHO should have priority lowered by default [11:52:54] eg nice+ [11:52:59] petan: Really? You don't see all the screens? And tmux? and crons? [11:53:06] oh man, wonderful weather. I should go out [11:53:08] that is supposed to live there or not? [11:53:17] I thought that -login is for screens and shit [11:53:22] just not bots [11:53:25] and heavy jobs [11:53:30] (and mosh?) [11:53:42] One should not run stuff in a detached screen. [11:53:55] If it's long enough to run in a screen noninteractively, it should be sent to the grid. [11:54:21] well but what about stuff like shell :P [11:54:24] or ssh session [11:54:29] that doesn't eat almost anything [11:54:40] or some tail -f [11:54:43] in screen [11:54:52] or vim [11:55:07] these things are IMHO ok, (except for emacs in screen) because emacs suck [11:55:12] petan: Then it shouldn't be in a *detached* session, shouldn't it? [11:55:18] why not [11:55:26] what if you need to go away and want to return to it later [11:55:33] like if you move from work to home etc [11:55:42] I do that a lot [11:55:54] basically I work always in a screen just to make sure if my net die it doesn't break [11:56:34] proper idle processes, like vim or bash do not eat resources [11:56:42] I see no reason why these couldn't run in background [11:56:49] gmond seems more evil to me :P [12:00:04] Back to the primary point; I see nothing but occasional spikes in NFS useage that lead to somewhat laggy disk use; in particular, when " yes labs are fucked now" I've seen a delay of all of 5-6 seconds to log in. [12:00:28] I had delay like 20 minutes that time [12:00:50] basically all my open terminals stuck (I could just move cursor) [12:01:06] OH [12:01:07] nvm [12:01:15] ... the use spike lasted all of 3 minutes. Check your network connection. [12:01:18] I thought you talked about that message from morning [12:01:24] not that one [12:01:30] this one was like 5 seconds lag [12:01:50] Hardly justifies "labs is fucked" then. [12:10:37] !part [12:11:56] @unq Helpmebot [12:11:56] Sorry but I don't see this user in a channel [12:13:05] petan: do you want helpmebot superuser to be able to make it join/part channel? Is there a reason you don't already? [12:30:20] T13|needsCoffee probably I am not trusted enough :D [12:30:22] Bleh. I need to find who's hammering on the disk this much. [12:30:27] Coren: is there no problem now? [12:30:32] because I can't do anything now [12:30:35] !pin [12:30:35] pong [12:30:42] at least this bot works :D [12:30:42] It's laggy. That's called a *shared resource* [12:30:48] load 28 [12:31:00] wait, since when shared resource is called laggy :P [12:31:09] petan: Will you *please* stop obsessing about load? [12:31:13] okok [12:31:37] that's usualy indicator that something is wrong :P [12:31:50] No, it's not. It's an indicator of the number of processes that are ready to run. [12:32:09] it's number of processes that are ready to run but do not run because they are waiting for something [12:32:16] well in fact it's neither of that [12:32:19] but it's close :P [12:32:20] And given the severe misuse of cron on -login, a one minute lag causes that. [12:33:19] (plus the time to catch up) [12:35:11] good morning! I can't ssh to my solr-mw2 machine this morning [12:35:39] or not. it let me in eventually I think [12:35:53] manybubbles: Are you hammering on NFS again? [12:36:23] Coren: no. I haven't been running it since yesterday [12:37:11] I'd really like to get back to it at some point but I won't do it until I can no break everyone else [12:38:19] * Coren disables crontabs. [12:43:06] petan: You have Helpmebot superuser now :P Now you can make it join/part channels [12:43:19] aha cool [12:43:34] Coren disabling where? [12:46:25] Hm. ContinuityBotLabs.jar [12:46:35] Seems to be the evul disk hog [13:12:35] I think we should buy like 2tb of RAM and create a huge ramdisk for labs :P [13:12:46] that would be finally good IO [13:21:52] Huh. [13:22:02] * Coren wonders why puppet hammers on the NFS. [13:23:20] no wonder :0 puppet is evil [13:23:28] @search evil [13:23:28] No results were found, remember, the bot is searching through content of keys and their names [13:23:34] !evil is puppet [13:23:34] Key was added [13:23:47] @search scary [13:23:47] Results (Found 1): puppet, [13:23:50] here we go [13:23:51] !puppet [13:23:51] http://scary.hostei.com/ascarywaterorg-/img/scary+puppet1.jpg learn: http://docs.puppetlabs.com/learning/ troubleshoot: http://docs.puppetlabs.com/guides/troubleshooting.html [13:24:13] Worse is ua31's bot on -login though. [13:24:13] retsreklawts: you there? [13:25:43] Coren: can't we kill bots on -login? [13:25:55] That's what I'm looking into atm. [13:26:00] sweet. [13:26:05] * YuviPanda_ gets out of his way. [13:29:10] I think we should do killall python :D [13:29:31] that would solve a lot of troubles :o [13:30:24] ... [14:25:55] petan: FYI, if you like graphs, the gangllia for labstore3 are easy to interpret: red spikes were the Bug of Doom. Peach/organge peaks are actual IO traffic. [14:26:14] !ganglia [14:26:18] linky [14:26:22] !labstore3 [14:26:26] @search ganglia [14:26:26] Results (Found 2): load, load-all, [14:26:32] !load-all [14:26:32] http://ganglia.wikimedia.org/2.2.0/?c=Virtualization%20cluster%20pmtpa&m=load_one&r=hour&s=by%20name&hc=4&mc=2 [14:27:07] http://ganglia.wikimedia.org/latest/graph_all_periods.php?h=labstore3.pmtpa.wmnet&m=cpu_report&r=hour&s=descending&hc=4&mc=2&st=1373975819&g=cpu_report&z=large&c=Labs%20NFS%20cluster%20pmtpa [14:27:15] !labstore3 is http://ganglia.wikimedia.org/2.2.0/?c=Labs%20NFS%20cluster%20pmtpa&h=labstore3.pmtpa.wmnet&m=cpu_report&r=hour&s=by%20name&hc=4&mc=2 [14:27:16] Key was added [14:27:27] sorry but nfs is slowing down again :/ [14:30:42] liangent: I'd expect the ones who should be sorry are the ones doing lots and lots of IO. :-) That said, that seems to be already over. [14:30:55] That was a comparatively small bump. [14:32:09] Coren: maybe we should write e-mail with instructions how to write IO effective software [14:32:14] like use /tmp for temporary stuff etc :D [14:32:45] btw where are dumps? [14:32:49] is it nfs as well? [14:33:00] I think people running dump scans may eat a lot of IO [14:41:31] What's a 'lot' of IO on the labs cluster? Is dumping 50MB user database into a backup file an issue? [14:43:07] tb4: Almost certainly not. [14:43:44] tb4: The bursts I've seen were several hundred megs of writes. [14:43:56] Okaydokay ta - IO lag may have coincided with my backup script running was all. [14:45:06] I'm thinking one of the bigger culprit may be creating lots of small files. [14:45:28] As opposed to writing a large contiguous file at one; which NFS handles fairly well. [14:49:14] !ping [14:49:14] pong [14:52:23] !ping [14:52:23] pong [14:52:25] works [14:52:25] nice [14:54:00] AzaToth ping [14:54:07] pong [14:54:39] AzaToth I have some packaging questions [14:54:55] shout [14:55:27] how do I turn that debian folder into .deb - doesn't it require the sources to be present somewhere? how does it know where the produced binaries should be installed to? where I define that [14:56:40] you usually uses "git buildpackage" (when building from a git repo, otherwize, debuild) to build it [14:57:13] !log deployment-prep restored /data/project/apache/common-local/php-master/extensions/Diff/Diff.php got deleted somehow by git [14:57:15] Logged the message, Master [14:57:53] !log deployment-prep Retriggering the database updating job manually https://integration.wikimedia.org/ci/job/beta-update-databases/ [14:57:53] Logged the message, Master [14:58:02] AzaToth: debuild complains about missing orig.tar file [14:58:11] which is what makes perfect sense [14:58:21] where should that tar be? dh_make removed it [14:58:36] usually in the parent dir [14:58:48] but I started debuild from root dir [14:58:54] there is no parent [14:59:41] should I start debuild from debian and copy tar to root [15:00:14] the orig tar is usually located in the directory above the directory containing the package [15:00:23] o.O [15:00:48] ok so I have root, then I should have root/debian and then I should have root/debian/blah/tarfile ?? [15:01:10] this is really not well documented anywhere [15:01:25] most of debian documentations expect you already knows everything [15:03:56] no [15:04:19] if you have $HOME/builds/package/debian then the orig tar should be in $HOME/builds [15:04:32] a.k.a. the parent dir [15:04:55] no I don't have that [15:05:19] I have $HOME/Documents/root_of_git/debian [15:05:40] and root_of_git is supposed to be root, which I just don't want to move behind [15:06:11] basically what I want to is being able to do [15:06:15] a) git clone url [15:06:21] b) cd [15:06:27] c) ./makepackage.sh [15:06:38] d) be happy & have fully working blabla.deb [15:07:10] that's why everything needs to be in git [15:09:20] btw AzaToth why that .tar needs to have so shitty structure? why is that required? like not having stuff in a subfolder [15:09:51] if I do tar -zxf .orig.tar.gz it unpack all files into current directory instead of / [15:10:01] which is what most of people probably expect from tar [15:10:24] you probably failed then, as 99.999999999% of all orig.tars have a dir named the package [15:10:49] no I didn't failed, I was told by someone in that debian channel that it must be that [15:11:06] my original tar was creating subfolder [15:11:08] .999999999? [15:11:13] ? [15:11:37] petan: your original tar should be the orig tar [15:11:47] right [15:14:11] AzaToth so if it unpack all stuff to .orig/ [15:14:20] would it work with debian packaging [15:14:25] no need to unpack [15:14:49] I think the problem is that you are trying to make a non-native package in a native environment [15:14:50] I suppose that .tar is going to be unpacked at some point, otherwise I fail to see why it needs it [15:14:58] ? [15:15:04] what these 2 things mean [15:15:32] a native package (lacking -N at the end of the version) has no orig tar [15:15:42] this has version [15:15:45] i.e. a package without a separate upstream [15:15:50] it's _.orig [15:15:59] I am just lazy to type all that [15:19:50] petan: create a file debian/gbp.conf with following data: http://paste.debian.net/16385/ [15:20:10] upstream-tree=branch is needed as long you aint build the debian package from distinct tags [15:20:31] defualt debian branch is master and default upstream branch is upstream, so we need to override this [15:20:52] now when you run git buildpackage, you should get "gbp:info: pidgeon_1.2.5.0.orig.tar.gz does not exist, creating from 'master'" [15:29:14] petan: copy over? [17:37:01] !ping [17:37:02] pong [17:37:55] !pong [17:37:55] don't play with me [17:38:55] <^demon> !pang [17:39:05] <^demon> Aww :( [17:40:18] @labs-info korma [17:40:18] I don't know this instance, sorry, try browsing the list by hand, but I can guarantee there is no such instance matching this name, host or Nova ID unless it was created less than 23 seconds ago [17:40:30] @labs-info korma.wmflabs.org [17:40:30] I don't know this instance, sorry, try browsing the list by hand, but I can guarantee there is no such instance matching this name, host or Nova ID unless it was created less than 35 seconds ago [17:40:39] @labs-host korma.wmflabs.org [17:40:45] I am running http://meta.wikimedia.org/wiki/WM-Bot version wikimedia bot v. 1.20.0.0 my source code is licensed under GPL and located at https://github.com/benapetr/wikimedia-bot I will be very happy if you fix my bugs or implement new features [17:41:09] @labs-resolve korma.wmflabs.org [17:41:10] I don't know this instance, sorry, try browsing the list by hand, but I can guarantee there is no such instance matching this name, host or Nova ID unless it was created less than 14 seconds ago [17:41:13] * Ryan_Lane sighs [17:41:25] @labs-resolve korma [17:41:25] I don't know this instance, sorry, try browsing the list by hand, but I can guarantee there is no such instance matching this name, host or Nova ID unless it was created less than 5 seconds ago [17:41:33] * Ryan_Lane rolls his eyes [17:41:34] <_acs_> korma.wmflabs.org exists, for sure! :) [17:42:01] what project is this in? [17:42:23] _acs_: ^^ [17:42:39] ah. contributors [17:42:43] <^demon> Ryaaannnnnnnn! I'm baaaaaacccckkkkkk [17:42:53] ^demon: yep ;) [17:43:11] <^demon> Your days of not getting bugged about my one-line puppet changes are over ;-) [17:43:23] _acs_: I can't ssh into it from inside of labs either [17:43:29] I bet you guys OOM'd the instance [17:43:30] <^demon> aka: vacation [17:43:34] @labs-info [17:43:43] <_acs_> OOM'd? [17:43:52] <_acs_> Ryan_Lane: the web is working correctly [17:43:57] <_acs_> http://korma.wmflabs.org/browser/ [17:44:00] ran it our of memory [17:44:04] *out [17:44:25] <_acs_> Ryan_Lane: pretty strange ... we are doing pretty basic things there [17:44:34] <_acs_> just copying files [17:44:40] <_acs_> and serving the files using http [17:45:39] <_acs_> Ryan_Lane: maybe some issue in the auth service? [17:45:48] no [17:45:52] unlikely [17:46:01] let's reboot it [17:46:02] <_acs_> Ryan_Lane: maybe we can reboot the instance [17:46:05] <_acs_> yeah! [17:46:06] since we can't ssh into it [17:47:36] <_acs_> Ryan_Lane: I should leave now ... I will contact you later. Thank youuuu! [17:47:50] _acs_: you can reboot the instance through wikitech [17:47:55] via Manage instances [17:48:57] petan: Eh, why did you approve Madihakanwal222's shell request as well (as Clairemarcc's)? [17:49:21] scfc_de: something wrong with them? [17:49:24] non-existent? [17:50:00] andrewbogott: it looks good to me [17:50:16] Ryan_Lane: Look like spammers to me. [17:50:40] yep [17:50:42] they are [17:50:45] revoke shell [17:50:45] Ah, okay, Madihakanwal222's request is not approved. [17:50:46] Ryan_Lane, ok, thanks. Will also check in with otto since he wrote the original class. [17:50:50] I'm going to deploy an LDAP fix [17:50:58] and we can block them [17:51:08] petan: I take it all back. [17:52:14] deployed a fix and purged memcache [17:52:22] memcache purge was needed for the fix [17:53:37] now to test blocking [18:02:04] Ryan_Lane: Could you block Clairemarcc and Rosshouser as well, please? [18:02:39] * Coren cringes, hoping the block actually will apply to the targets. :-) [18:03:27] Coren: Ryan seems to be confident with https://gerrit.wikimedia.org/r/#/c/73530/ :-). [18:03:55] getCanonicalName is evil and should die in a fire [18:03:59] Ryan_Lane: Does labs use nova-network or quantum? Can't remember what release prod is running atm [18:04:05] nova-network for now [18:04:12] hehe, everyone keeps trying to set things on fire on IRC :D [18:04:13] quantum when we upgrade to grizzly [18:04:20] Ah - I won't ask you why quantum is the biggest pile of shit ever then [18:04:29] s/quantum/nuetron/ [18:04:37] Damianz: are you using it in grizzly? [18:04:44] Yeah [18:04:57] hearing that it's terribly is not reassuring :( [18:05:09] it looked to be a bit more flexible than nova-network [18:05:32] So far I get a loop of trying to add interfaces for gre based and a whole load of python exceptions for vlan based tenant networks :( [18:05:48] Might just be me though - first time trying grizzly on a single node [18:05:49] https://wikitech.wikimedia.org/wiki/Special:BlockList <— all that's needed? [18:06:06] * Damianz might try re-installing this box with devstack since he needs it fast and can come back later if people like it [18:06:28] https://bugzilla.wikimedia.org/show_bug.cgi?id=40024 <— I have a bug with my feelings of getCanonicalName :) [18:06:51] Damianz: I haven't tried it yet [18:06:52] I need to [18:07:00] On a brighter note - horizon is really nice looking these days and the labs dns stuff is awesome, I had pain trying to get auto dns working on instance creation. [18:07:41] hi! I'm having trouble running pywikibot. I wrote up what I did and the error I got--not sure if I'm doing something wrong, or if there is a problem beyond my paths/setup: http://www.mediawiki.org/wiki/User:Kmenger/Tool_Labs#My_trouble_running_pywikibot.C2.A0on_Tool_Labs. Does anyone have a moment to help? [18:07:41] labs dns stuff? [18:07:43] awesome? [18:08:03] Damianz: which auto-dns stuff are you using? [18:08:23] Damianz: I hope you're using designate, and not the crap built into nova/nuetron [18:08:36] Neither have I - it looks pretty neat though and mucher simpler than dell's production thing [18:08:44] well, s/crap/mostly non-working code/ [18:09:10] the code isn't crap, but it's been deprecated and is broken in grizzly as far as I know [18:09:28] I looked at that - actually I was using some pdnsopenstack thing that basically pulls stuff out of rabbit, that's hacky... but I need this for like tomorrow [18:10:07] check out designate [18:10:12] (was moniker) [18:10:21] it's going in for incubation [18:10:21] One thing I do like the look of and would like to play with in quantum is the LBASS stuff with haproxy [18:10:24] I wonder if that was approved [18:11:22] Oooh this is like full dns management [18:11:26] it is [18:11:29] https://designate.readthedocs.org/en/latest/ <-- [18:11:37] I helped push it into incubation [18:11:57] One thing I also hate about openstack, it's really hard to find the right docs for projects - the wiki is... lacking [18:12:11] the wiki isn't used for documentation, for the most part [18:12:21] and incubated projects aren't listed on docs.openstack.org [18:12:44] or are they? [18:13:02] nope. they aren't [18:13:36] http://docs.openstack.org/incubation/ [18:13:46] seems they are, but integrated are not [18:14:03] Ryan_Lane: Re blocking, there's also Uzmamunirr (https://bugzilla.wikimedia.org/49947) and Neehaali82 (https://bugzilla.wikimedia.org/50048). I think that's all, and if blocking is working now, https://bugzilla.wikimedia.org/49811 could probably be closed. [18:14:21] kma500: I'll take a look at it. [18:14:56] Striking difference: http://ganglia.wikimedia.org/latest/graph.php?r=day&z=xlarge&h=labstore3.pmtpa.wmnet&m=cpu_report&s=descending&mc=2&g=cpu_report&c=Labs+NFS+cluster+pmtpa [18:15:01] the default being to block the IP is fucking stupid [18:15:06] The orange spikes are NFS stalling. [18:15:08] * Damianz -> hunting for food [18:15:08] Thanks, Coren! [18:15:13] I wonder how many legitimate users we're blocking [18:15:20] on wikimedia projects [18:15:35] the default for ISPs is dhcp [18:15:42] Ryan_Lane: Wouldn't life be simpler if everyone had to login and validate their email :P [18:15:59] Damianz: once the IP is blocked, they can't anymore [18:16:18] Oooh hard block, not the you've been a little bad block [18:17:43] so, I figured out why we're running out of disk space [18:18:00] the ephemeral base images are being created by nova as raw disks, rather than qcow [18:18:19] so, a 160 or 320G ephemeral disk is actually eating 160/320G [18:18:29] in the base images [18:18:45] the base image directories are eating like 300G+ of disk space [18:18:47] for nothing [18:19:19] [bz] (8RESOLVED - created by: 2Peter Bena, priority: 4Normal - 6normal) [Bug 50048] block Neehaali82 - https://bugzilla.wikimedia.org/show_bug.cgi?id=50048 [18:19:20] [bz] (8RESOLVED - created by: 2Peter Bena, priority: 4High - 6critical) [Bug 49811] Block user is broken on wikitech - https://bugzilla.wikimedia.org/show_bug.cgi?id=49811 [18:19:34] [bz] (8RESOLVED - created by: 2Peter Bena, priority: 4Normal - 6normal) [Bug 49947] Block User Uzmamunirr on Labs - https://bugzilla.wikimedia.org/show_bug.cgi?id=49947 [18:19:50] My getCanonicalName fix also should have fixed RenameUser :) [18:19:54] which is an added bonus [18:20:42] kma500: I'm no Pywikibot expert, but I think what you miss is the PYTHONPATH export. An important note you might not be aware of is that if you export an environment variable like this, it doesn't survive from session to session unless you also add it to your .bash_profile [18:21:25] (I.e.: if you did the export while setting up, then logged off and came back to test, the variable would have been gone) [18:21:45] kma500: You can check by typing "echo $PYHONPATH" and see if the value is right. [18:21:58] Erm, "echo $PYTHONPATH" even [18:24:06] Thanks for taking a look! I had set a pythonpath in my ./bash_profile. When I type echo $PYTHONPATH, I get the following: /shared/pywikipedia/rewrite:/shared/pywikipedia/rewrite/externals/httplib2:/shared/pywikipedia/rewrite/scripts [18:27:26] kma500: Then, AFAICT, that should work since /shared/pywikipedia/rewrite/generate_user_files.py is right there in the path. [18:27:58] kma500: Could you try now? [18:28:12] okay [18:28:28] same error [18:29:32] execfile('generate_user_files.py') [18:29:33] IOError: [Errno 2] No such file or directory: 'generate_user_files.py' [18:30:08] Hm. [18:30:12] legoktm: ^ [18:30:27] kma500: also try giving full path to execfile? [18:30:29] Oh [18:30:36] I need to read up on execfile(), but I'd have expected it to obey PYTHONPATH? [18:30:42] legoktm: does execfile use PYTHONPATH? [18:30:52] Not sure [18:31:18] kma500: On another topic, I see a few questions in your notes. Want inline answers? [18:31:39] Hi, YuviPanda. I'm not sure what you mean. [18:31:59] kma500: It's probably easier if you set up a virtualenv and install pwb to that [18:32:06] At least, that's what i've experienced [18:32:27] Coren: That would be great! Or we could talk if it's easier. [18:32:59] +1 to virtualenv. global things are evil. [18:34:24] kma500: by 'full path', I mean execfile('/shared/pywikipedia/rewrite/generate_user_files.py') [18:34:29] reather than just generate_user_files [18:34:46] but still, I agree with legoktm. use virtualenv. we aren't that short of space :) [18:37:14] Oh, I see. No. I didn't do that, YuviPanda. I was trying to run pwb.by basic -dry User:KBot500. [18:37:35] kma500: ah, right. Sorry, haven't used pywikipediabot, but do try using virtualenv. [18:37:42] never have to worry about PYTHONPATH again :) [18:37:49] : ) [18:38:12] I haven't used virtualenv before, but I'll look into it. Sounds like maybe this is the set up to rec in the docs? [18:38:59] kma500: no, our docs don't have enough python help, sadly. I can help with some virtualenv trouble, and google has plenty of help too [18:40:15] Thank you! I'll start with google and see how far I can get. I appreciate your help! [18:41:07] kma500: yw :) I should spend some time writing up Python docs (virtualenv and wsgi), but waiting for Coren to finish up uwsgi so that can be properly done. [18:41:09] eek CGI [19:03:30] I turned my thing on again at <1/4 the old speed. I'll look at the ganglia graphs from time to time to make sure I'm not killing everyone. If I am, please let me know. [19:14:46] manybubbles: It's certainly noticable. [19:15:10] manybubbles: I really really need to get you off NFS with the physical DB. [19:15:11] Coren: Does this mean nfs is not dog slow now? [19:15:23] Coren: I can terminate it if you'd like. [19:15:31] Coren: I think I'll just move it to my laptop.... [19:15:36] manybubbles: It remains usable. [19:16:05] manybubbles: The problem is that how fast you do the queries does not affect mysql's access pattern all that much. [19:16:20] Databases and networked filesystems are not friends. [19:16:35] Coren: mostly.... [19:16:41] Coren: but yeah, this is pretty horrible [19:17:05] manybubbles: I have a physical mysql server in the queue that will be dedicated to labs. [19:17:20] manybubbles: Depending on whether you can hold off a few days. [19:17:56] Coren: I can wait if I have to. [19:18:39] manybubbles: Lemme see if I can talk to Ken and get some DBA to help me set this up faster. [19:19:19] Coren: thanks! I can work on other things but I'd really really like to have this at some point in the near future.... [19:21:13] manybubbles: Because I'm pretty sure, seeing what it does to the server, that we'll have a "no database on NFS" policy. :-) [19:22:59] Coren: prolly a good call. Right now we'll be running solr/elasticsearch on local storage but I'm not sure how that'll scale either. Getting this loaded is part of the point. [19:23:39] manybubbles: If I can manage to grab a DBA, I'll have that DB up in no time. I could just do it, but I want to do it in a way the db-aware opsen recognize. :-) [19:23:55] (We already have the hardware ready) [19:24:02] Coren: does the tools webserver support jsonp? [19:24:27] Coren: funny story, for most of my previous jobs I was the closest thing we had to a DBA. Not saying I should do it, just irony for me to be waiting on the DBA. [19:26:19] JohannesK_WMDE: Not unless a (mostly) stock Apache can. I know little of jsonp except that it's tricky to get right and avoid CSRF and XSRF vulnerabilities; if you tell me what you need to support it I'll be in a better position to answer you. [19:27:20] manybubbles: The issue isn't so much "knows DBs" as it is "knows how the WMF likes its DBs" :-) [19:27:31] Coren: totally [19:28:00] Coren: i don't need the cross-site stuff, just the callback feature. i want to put some JSON for the limes map in a separate file on the server, then load it from javascript on the client. [19:29:33] JohannesK_WMDE: I don't think you need server support if you follow same origin policy, do you? [19:29:43] JohannesK_WMDE: ist silke z.Z. in urlaub oder warum antwortet Sie nicht? [19:30:00] Steinsplitter: ja, soweit ich weiß ist sie im urlaub [19:30:09] JohannesK_WMDE: danke^^ [19:31:30] Coren: i just want "somescript.src = 'tools.wmflabs.org/foo/bar.json?callback=myCallback'" call my callback. :) i don't know if that needs any special server support or not [19:32:50] JohannesK_WMDE: That'll work provided bar is written in a supported language. (e.g. 'bar.php' or 'bar.py' for instance) [19:33:03] JohannesK_WMDE: It'd also work with any executable, if it's in /cgi-bin/ [19:36:02] scfc_de: Are you planning on working some more on https://gerrit.wikimedia.org/r/#/c/71112/ ? [19:36:04] Coren: hmmm... the .json is not really a script, it's just a bunch of config variables basically. as far as i understand it (and i don't usually do web stuff) it works like this... you request "http://.../something?callback=myCB" and the server gives you the contents of "http://.../something" wrapped inside "myCB(...data...)" [19:36:38] so the server has to support it. i don't know if stock apache can do it, that's kind of why i'm asking [19:36:39] Ah, normally the json would be generated dynamically from a script that'd parse the query 'callback' [19:36:49] * Coren ponders. [19:37:53] yes, it looks like a CGI parameter but AFAIK the server parses it [19:37:56] You could do it with a RewriteRule in your .htaccess, with some effort. But the point is you just want to grab that as a static file? [19:38:00] or so i thought :) [19:38:10] I've never seen that setup, personally. It's doable. [19:38:49] But it's a little odd to be serving static values through this mechanism, no? Hm. No matter, I can probably whip something up for you. [19:41:12] well, i just want to have some kind of project-specific config file in javascript, and this sounded like a good way to do it. but if this isn't supported by the server, maybe i should look for something else. [19:41:19] JohannesK_WMDE: Coren I think a simple php / python script is the way to go here, since the .json need not be present. [19:41:30] JohannesK_WMDE: and yeah, this sounds a bit complex :D [19:41:54] JohannesK_WMDE: Well, actually, it's as easy as a three-line PHP script about. [19:42:27] JohannesK_WMDE: I can make a sample for you if you want; that'd take me 2 minutes. :-) [19:42:38] hmm, with the htaccess? [19:42:47] Coren, YuviPanda: yep, the .htaccess mod sounds unnecessarily complex. [19:42:54] YuviPanda: No need for htacces if it's in php. :-) [19:43:02] Coren: well, if it needs to be .json [19:43:15] YuviPanda: No it doesn't. :-) [19:43:25] Coren: thanks; but i think i can manage the php script that calls 'print' or something :p [19:43:33] Coren: well, unless you want something.php/something.json [19:43:49] JohannesK_WMDE: Remember to set the mime type to "application/json" [19:43:56] YuviPanda: Nothing that complex. Lemme show you. [19:44:00] but still... createing a script just for printing some config values. there must be a simpler way? [19:44:04] Coren: interesting. do show me! [19:44:08] Coren: err [19:44:19] Coren: are you going to do a #!/bin/php and +x it and run it as cgi-bin? [19:44:29] or something like that? [19:44:48] because that's just evil :PO [19:46:45] YuviPanda: http://tools.wmflabs.org/csbot/foo.php?callback=TheFoo [19:46:55] Coren: it has a .php! [19:47:05] YuviPanda: Matches /data/project/csbot/public_html/foo.php [19:47:07] Coren: yep, that's what the result should look like... [19:47:25] YuviPanda: Yeah, so? There is no need for the name to end in .json [19:47:32] tch tch, that's what I'm talking about [19:47:45] .json would need .htaccess, etc [19:47:51] without that yes, it is simple :) [19:48:11] YuviPanda: Yeah, I know you're taking about it. Why in blazes would one care what the URI is? [19:48:21] hehe, indeed. you won't [19:48:36] unless you're ashamed to be writing PHP. or something like that [19:50:03] ... a three-line program is a three line program pretty much regardless of what it's written in. Seriously, if you worry about what language something like this is written in, your priorities are off. :-P [19:50:31] Coren: [19:50:45] Coren: and no, I was genuinely curious about how you would run a php script without it being .php [19:50:51] perhaps .htacess can override, unsure. [19:50:55] but yes, point is moot now [19:51:04] Yeah, you could sethandler on .json for instance. [19:51:13] Why you'd /bother/ is beyond me. :-) [19:51:15] hmmm. the .php would have to open & read the json file though. because i don't like if users have to edit the .php to change the settings. and then, i think it starts getting too much overhead. this must be a solved problem for web developers, right? [19:51:19] I like neat URLs, y'know. [19:51:49] JohannesK_WMDE: yeah, you'd want to have .htaccess rewrite /hello.json?callback=cb into /return.php?file=hello.json&callback=cb [19:51:57] JohannesK_WMDE: It's even simpler then. [19:52:01] to have pretty URLs and for people to have arbitrary json files [19:52:15] (if you care about pretty URLs) [19:52:16] JohannesK_WMDE: Simply use readfile() [19:53:28] YuviPanda: ok, that would work. still, is there no way to "just recv() the file" and have some way of knowing when it's done? without having a script fire off on the server? maybe i'm too pedantic. :) [19:54:06] JohannesK_WMDE: You need a secript of /some/ sort to add the function name around the json. [19:54:10] JohannesK_WMDE: not for arbitrary values of callback= [19:54:12] JohannesK_WMDE: No way around /that/ [19:54:57] JohannesK_WMDE: You can hide the script with rewriterules and sethandlers in your .htaccess, but just having the script outright is much less complicated and works the same for the client. :-) [19:55:28] Look at the modified /data/project/csbot/public_html/foo.php [19:55:34] That one reads the json from a file instead. [19:55:52] Coren: btw, we have mod_rewrite enabled, right? [19:55:59] Coren: and overridable from .htaccess? [19:56:07] * YuviPanda needs to do some work on those... [19:56:10] YuviPanda: Yes on one, I /think/ so for the other. [19:56:16] If not, it should be. [19:56:25] Coren: because I have a .htaccess file and it is not being respected [19:56:35] Coren: wp-signpost, under cgi-bin. [19:56:38] ... or it's wrong. :-) [19:56:43] Coren: I'm unsure if that is because I suck at mod_rewrite [19:56:44] rules [19:56:46] yeah, that :) [19:56:51] or if it is not enabled [19:56:56] higher probability for the former, of course. [19:56:57] still [19:57:04] If it wasn't enable, you'd get 500s [19:57:18] * Coren goes check. [19:57:24] Coren: oh? so if it isn't enabled and I just put a .htaccess there with RewriteEngine On I get 500s? [19:57:40] hmmm. [19:57:51] YuviPanda: You should, since you'd get an error. [19:58:04] hmm, from the RewriteEngine On not being recognized [19:58:05] right [19:58:24] I just looked at your .htaccess [19:58:31] how bad is it? [19:59:13] I've only had to deal with nginx for rewrites for the last 2-3 years (boo apache!) [19:59:44] You suck at RewriteRule. :-) ^/wp-signpost(.*) is in /cgi-bin/ so it'd never match anything but ^/wp-signpost/cgi-bin(.*), it's not going to catch stuff outside of cgi-bin as you seem to intend. :-) [20:00:06] You probably want that .htaccess in public_html instead. :-) [20:00:55] So /wp-signpost/foo would rewrite to /wp-signpost/cgi-bin/foo [20:01:00] Coren: aaaah. That explains. [20:01:07] Coren: so I should have the same file but in public_html [20:01:16] Err, /wp-signpost/cgi-bin/api.py?foo [20:01:24] true [20:01:27] Actually, you probably want it /only/ under public_html [20:01:31] yeah [20:01:32] mv it [20:01:37] let me [20:01:47] Because under /cgi-bin you'll get circular. :-) [20:01:51] yeah [20:01:56] a to b to a to b to.. [20:02:13] Coren: ty! :) [20:02:29] and I'll have to create a new tool called 'signpost' after that, and move things there and delete this one... [20:02:33] because I'm an idiot. [20:02:45] So /wp-signpost/foo -> /wp-signpost/cgi-bin/api.py?foo -> /wp-signpost/cgi-bin/api.py?/cgi-bin/api.py?foo -> /wp-signpost/cgi-bin/api.py?/cgi-bin/api.py?/cgi-bin/api.py?foo ... until Apache tires of your shenanigans. :-) [20:03:31] hmm, http://tools.wmflabs.org/wp-signpost/feed is 404ing [20:03:39] oh no [20:03:40] wait [20:03:56] hmm [20:04:10] Coren: shouldn't /wp-signpost/feed to go /wp-signpost/cgi-bin/api.py/feed [20:04:10] ? [20:04:19] i'm getting a 404 tho [20:04:49] * Coren checking. [20:05:30] http://tools.wmflabs.org/wp-signpost/cgi-bin/api.py/feed works [20:09:00] YuviPanda: From the docz "When using the rewrite engine in .htaccess files the per-directory prefix (which always is the same for a specific directory) is automatically removed for the RewriteRule pattern matching and automatically added after any relative (not starting with a slash or protocol name) substitution encounters the end of a rule set." [20:09:12] hmm [20:09:16] so i need to get rid of wp-signpost [20:09:21] * Coren nods. [20:09:59] RewriteRule ^/(.*) /wp-signpost/cgi-bin/api.py/$1 [20:10:07] (Note the added /) [20:10:21] oooh [20:10:22] right [20:10:40] Actually, your way should work too. [20:11:21] Coren: neither way seems to work [20:11:21] I stillg et 400s [20:11:21] Hm. [20:11:22] err [20:11:23] 404 [20:11:26] * Coren goes see the logs. [20:13:35] Oh. From the same docs: "The removed prefix always ends with a slash, meaning the matching occurs against a string which never has a leading slash. Therefore, a Pattern with ^/ never matches in per-directory context" [20:13:38] So you need: [20:13:44] RewriteRule ^(.*) /wp-signpost/cgi-bin/api.py/$1 [20:13:47] hmmm [20:13:48] A bit of both. :-) [20:13:50] heh [20:13:56] Coren: is there any procedures to force maintainers of a toolserver thingi to move it here? [20:13:59] but the /wp-signpost/ isn't needed? [20:14:28] No, that's the part that's removed: all of /wp-signpost/ (including the trailing /) [20:14:39] [Tue Jul 16 20:14:18 2013] [error] [client 10.4.1.89] File does not exist: /data/project/wp-signpost/cgi-bin/api.pyfeed [20:14:50] You have to add the / in the target. :-) [20:14:53] yeah [20:14:54] just did that [20:14:57] http://tools.wmflabs.org/wp-signpost/feed works [20:15:03] Yeay! [20:15:08] and it's a rewrite not a redirect! yay [20:15:14] and I now suck slightly less at RewriteRules! [20:15:17] so... not bad [20:15:22] ty, Coren :) [20:15:47] AzaToth: Well, not as such. Everyone is a volunteer so we can't 'force' anything; but toolserver is being turned off eventually and anything that didn't move goes away. [20:15:48] Coren: I assume they (UTRS) will require JIRA [20:16:15] Coren: do we have any alternative to JIRA? [20:16:24] AzaToth: Bugzilla. [20:16:53] AzaToth: Although if they prefer / need something else like RT, they can ask for it. Of course, only OSS need apply. [20:17:09] OSS? [20:17:19] Open Source Software. [20:17:28] k [20:17:33] FLOSS if you prefer. :-) [20:17:36] heh [20:17:49] I thought it was a TLA of something else [20:18:12] TDM TLA [20:18:53] (I.e. Jira is right out) [20:19:27] Coren: fyi: I'm currently working on setting up intuition in tool labs (the account was already created, thanks for that). [20:19:47] Krinkle: Yeay! [20:20:09] Krinkle: Let's make sure it's well publicised so that tools can get clean i18n [20:20:24] Yep, I'll let you know as soon as its stable. [20:20:25] oh, JIRA isn't OSS? [20:20:32] Far from OSS. [20:20:53] hmm, Atlassian has nothing OSS? [20:20:59] I... don't know what I was expecting [20:20:59] ok [20:21:17] YuviPanda: I wish. It's actually a reasonably nice piece of software; and they're not Bad Guys (they have inexpensive licenses for small users, and give free license to nonprofits and open source projects) but they aren't open source themselves. [20:21:46] Toolserver has historically always had an Atlassian stack (FishEye, JIRA etc.) [20:21:57] I might be biased after dealing with Gerrit, [20:22:19] on that note I should go back to writing Android applications in Java, in an IDE built with Java, to run on a phone that is mostly Java :) [20:24:08] I'm pretty sure Atlassian would make comparable money to what they did now if they opensourced their stack and sold support instead. But hey, their business, their rules. [20:24:22] Hi I cannot push file from local to remote using scp command [20:24:55] Coren: true [20:24:57] harshkothari: It should work provided you can log in with ssh normally. [20:25:00] harshkothari: can you paste the command? [20:25:13] harshkothari: Do you have more details? (I.e. the command you use, the error message(s)) [20:25:30] YuviPanda: Coren Permission denied (publickey). error [20:25:53] harshkothari: Can you ssh to tools-login properly otherwise? [20:26:16] scp datav.html tools-login.wmflabs.org:/data/project/lcm-dashboard/public_html/visualization YuviPanda Coren is this right ?? [20:26:45] That should work, provided you have your key set up properly. [20:26:57] (and username?) [20:27:18] Ah, yes, you also need your username specified unless you have it in your settings. [20:27:31] where ?? Coren YuviPanda > [20:27:54] username@tools-login.wmflabs.org ? [20:27:58] yeah [20:27:59] scp datav.html harsh@tools-login.wmflabs.org:/data/proj... [20:28:10] ok let me try [20:29:38] Coren: YuviPanda yeah it worked thanks :) :) [20:30:41] :) [20:35:40] ping andrewbogott_afk when you are back [20:38:15] Coren: Re https://gerrit.wikimedia.org/r/#/c/71112/, yep, I'll finish that soon. [20:47:35] https://www.mediawiki.org/w/index.php?title=Accessing_labs_without_special_software_ondiscussionpage:Project:Support_desk&redirect=no [20:47:47] someone hit an issue with LQT... [20:47:54] but anyway there's a labs question [21:06:37] Coren, YuviPanda: seems that there's a new openstack project that's not in incubation yet, but does NoSQL as a service and they have a redis POC [21:06:53] hmm, so that does isolation? [21:07:00] and multitenancy? [21:07:14] it should. let me make sure this isn't some piece of crap service [21:07:18] haha :) [21:07:38] uuuuggggh [21:07:40] nevermind [21:07:43] haahahahaha :D [21:07:46] this is a renamed red dwarf [21:07:53] I forgot they had to change their name [21:07:59] no thanks [21:08:02] hmm, burnt out husk of something that was actually useful at some time? :) [21:08:04] heh :) [21:08:11] it's a DBaaS [21:08:16] meant for mysql [21:08:24] but it's really just a freaking orchestration framework [21:08:24] ow [21:08:38] that talks to nova and puts stuff onto virtual machines [21:08:40] it's worthless [21:08:43] heh. [21:09:45] Ryan_Lane: I don't like the fact that we run a patched memcached, but my patches to memcached have been met with silence (along with every other patch for the last 9 months) [21:09:53] * Ryan_Lane nods [21:09:58] Ryan_Lane: would ideally like to kill it, since redis can be used for most similar use cases [21:09:59] thoughts? [21:10:01] Coren: ^ [21:10:06] I think we should kill it [21:10:21] Ryan_Lane: and also kill petan's C# memcached rewrite deamon :) [21:10:27] +1 [21:10:27] which also runs on tools-mc [21:10:57] none of these are on puppet or anything, so someone logging in and doing a purge would be more than good enough [21:11:04] and of course, removing them from !help [21:11:20] Ryan_Lane: but does this mean that memcached as a project is now dead/too-stable? [21:11:31] maybe a little of both? [21:11:40] true, but that is sad. [21:13:04] Ryan_Lane: either you or Coren should write to labs-l about memcached, since I am pretty sure I will be stoned if I write about killing memcached [21:13:22] well, you could suggest it and I could +1 it [21:13:56] I'm already heading to a minor stoning next week when people suddenly discover changes to UploadCampaigns (despite me posting about it to all relevant places)... :P [21:14:04] if you really don't want to, I can. [21:16:15] I got it [21:16:21] Ryan_Lane: ty :) [21:19:28] done [21:20:48] Ryan_Lane: can we also rename tools-mc to be tools-redis? [21:21:07] post to the thread ;) [21:21:49] yeah [21:21:56] Ryan_Lane: can openstack do renames? [21:22:03] I remember hearing something about that being a pain. [21:22:20] only the display name, unfortunately [21:22:30] maybe when we switch to using delegate? [21:23:16] hmm, okay [21:23:28] Ryan_Lane: we can at least have a dns entry? and encourage people to use that? [21:23:50] that only really works on public IPs [21:23:55] and public IPs don't work internally [21:24:03] hmm? how is tools-mc resolved? [21:24:11] dns [21:24:19] yeah, so can't we add a new one? [21:24:37] new entry to that dns? or maybe even hack in an /etc/hosts... :) [21:24:41] why not just create tools-redis [21:24:44] and replicate the keys [21:24:47] then delete the old one [21:24:50] hmm, we could. [21:24:56] well, we could just move the aof files [21:24:58] and restart [21:24:59] should be fine [21:25:00] that too [21:25:08] I don't want to put in any manual hacks [21:25:11] true [21:25:16] we have automation for a reason ;) [21:25:22] but people will be currently using tools-mc, so that needs to be fixed. [21:25:27] yeah, anyway, we can give them a month's notice [21:25:28] and stuff [21:25:43] * Ryan_Lane nods [21:26:15] okay, +1'd. thanks! [21:26:37] yw [21:29:55] +2'ed [21:29:56] :-) [21:32:10] heh [21:32:16] consensus met! [21:32:18] :D [21:33:02] but what if someone finds out that I'm just Coren's sockpuppet?! [21:34:39] YuviPanda: I've met you irl ;) [21:35:02] Ryan_Lane: you're coming to HK, right? [21:35:21] yep [21:36:08] nice! [21:44:53] Coren: have you seen bug 51212? [21:45:15] it also screws up other scripts of mine [21:45:30] !b 51212 [21:45:30] https://bugzilla.wikimedia.org/51212 [21:45:49] giftpflanze: increase memory limit? [21:45:52] giftpflanze: to, say, 1G? [21:45:58] hm [21:46:39] [bz] (8NEW - created by: 2m.p.roppelt, priority: 4Unprioritized - 6normal) [Bug 51212] libgcc_s.so.1 must be installed for pthread_cancel to work - https://bugzilla.wikimedia.org/show_bug.cgi?id=51212 [21:48:09] giftpflanze: does that work? [21:48:20] still trying [21:48:22] ok [21:48:28] giftpflanze: is this tcl? or php? [21:48:47] tcl in sh wrapper [21:49:04] hmm, okay [21:49:20] if that doesn't work we might need a 32bit wrapper, which Coren can look into. but this should idaelly fix it [21:56:57] seems to help [21:57:27] giftpflanze: \o/ [21:57:37] :) [21:59:54] giftpflanze: can you comment on the bug to let people know that it helped you? [22:00:03] of course [22:00:15] giftpflanze: wonderful :) [23:00:15] That 'libgcc_so' missing message is, 100% of the time, "not enough memory to load the executable and all its shared libraries". The message is dumb, admitedly, but not under my control. [23:00:42] Coren: true, we should add it to help [23:00:44] or something [23:00:50] !help [23:00:51] !documentation for labs !wm-bot for bot [23:00:54] !toolshelp [23:00:58] !toolhelp [23:01:02] hmm [23:01:04] !tools [23:01:04] There are multiple keys, refine your input: tools-admin, tools-bug, tools-help, tools-request, tools-web, [23:01:08] !tools-help [23:01:09] https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/Help [23:01:19] Ah, I see petan has renamed his keys. [23:01:37] Coren: mind if I update? [23:01:42] add a section, rather [23:01:59] Of course not. [23:02:04] "It's a Wiki™" [23:04:41] Coren: {{done}}. Let me know if that was inaccurate. [23:05:44] "Why am I errors"? [23:05:46] * Coren chuckles. [23:06:14] Coren: ... I didn't see that at all, I swear [23:06:22] missed a 'getting' [23:06:56] Coren: fixed [23:07:06] Not just most probably, too, it's definitely that. libgcc_so cannot /not/ be installed; failing to load it can only be caused by oom. [23:07:29] (Well, or the system being completely broken, but *nothing* would work) [23:07:50] well, 'most probably' is weasely, but this isn't enwik... [23:28:35] aude: Sorry, having connection problems today. What's up? [23:55:30] !log deployment-prep applying 'role::eventlogging' to i-00000733 [23:55:48] ori-l: morebots is dead [23:56:14] * andrewbogott pokes morebots [23:56:40] it's not on the channel, is it? [23:56:56] it isn't [23:56:56] !log andrewbogott kicked morebots, but morebots started it [23:57:09] !log morebots is my imaginary friend [23:57:20] Logged the message, master. [23:57:25] !log is Do it morebots you lazy bum! [23:57:31] .. [23:57:38] T13|playingDad: heh [23:57:54] !log is Do it morebots you, lazy bum! [23:57:54] is is not a valid project. [23:57:59] haha :) [23:58:03] and no abusing the SAL! [23:58:14] Keke [23:58:22] !log deployment-prep applying 'role::eventlogging' to i-00000733 [23:58:25] Logged the message, Master [23:58:31] why thank you, labs-morebots [23:58:42] [bz] (8ASSIGNED - created by: 2Legoktm, priority: 4Unprioritized - 6normal) [Bug 49088] archive table not accessible - https://bugzilla.wikimedia.org/show_bug.cgi?id=49088 [23:58:58] That command must be on the ignore list...