[00:49:25] 06Labs, 10Tool-Labs: Can't login to tts service group in tools - https://phabricator.wikimedia.org/T149505#2754654 (10Ladsgroup) [01:01:19] 06Labs, 10Tool-Labs: New user home directory creation broken - https://phabricator.wikimedia.org/T149505#2754707 (10Krenair) [01:52:51] 8:43 PM Anyone have any idea why my bot will run on bastion but not on the grid even with trusty?? Ive asked in labs but havent really got a response [01:53:15] what was the error from the grid? [01:57:42] T148944 [01:57:43] T148944: Complete config - https://phabricator.wikimedia.org/T148944 [01:58:55] what command are you running to submit it to the grid engine Zppix? [01:59:43] because to me it looks like you tried to provide -l release=trusty as an argument to your program rather than jsub [01:59:46] Ive tried jstart ive tried jsub with -l release trusty ive tried jstart with the release arg [01:59:54] the full command, Zppix [02:00:03] What should i donthen [02:00:05] Do [02:00:12] paste the full command [02:01:08] I tried jstart -l release=trusty sopel [02:01:20] and this resulted in? [02:01:22] Jsub -l release=trusty sopel [02:01:25] Same [02:01:29] As it shows on task [02:02:30] krenair@tools-bastion-03:~$ jstart -l release=trusty lsb_release -c [02:02:31] Your job 234671 ("lsb_release") has been submitted [02:02:31] krenair@tools-bastion-03:~$ cat lsb_release.out [02:02:31] Codename: trusty [02:03:03] ?? [02:03:39] jsub does the same [02:03:52] Does the codename mean it uses trusty [02:03:55] Im confused [02:04:00] yes [02:04:03] Ok [02:04:05] Let me see [02:04:56] What is this 'sopel' anyway? [02:05:39] I just get jstart: error: argument program: Program 'sopel' not found. [02:06:44] Its in envs/bin/ [02:06:48] Its irc bt [02:06:49] Bot [02:07:05] See wikitech.wikimedia.org/wiki/tool:Zppixbot [02:09:12] still doesnt work [02:22:30] Zppix, I didn't change anything [02:22:50] so it's not surprising that doing the same command still produces the same response [02:25:42] !log tools restarted maintain-kubeusers [02:25:47] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/SAL [02:27:33] 06Labs, 10Tool-Labs: New user home directory creation broken - https://phabricator.wikimedia.org/T149505#2754785 (10Krenair) 05Open>03Resolved a:03yuvipanda @yuvipanda restarted maintain-kubeusers on tools-k8s-master-01, it works again now: ```krenair@tools-bastion-03:~$ sudo become tts tools.tts@tools-b... [02:28:45] 06Labs, 10Tool-Labs: become should have a better error message when homedir doesn't exist - https://phabricator.wikimedia.org/T149511#2754788 (10yuvipanda) [03:39:41] 06Labs, 07Tracking: Existing Labs project quota increase requests (Tracking) - https://phabricator.wikimedia.org/T140904#2754813 (10Andrew) [04:18:50] 10Labs-project-Wikistats: W3C wiki updates broken - https://phabricator.wikimedia.org/T149000#2754818 (10Dzahn) @hashar Thank you for this, but there is a difference between different tables. The "fixit" thing is for wikis in the "mediawikis" table, while it does not apply to "farm" tables likw W3C where we dont... [04:37:37] 10Labs-project-Wikistats: W3C wiki updates broken - https://phabricator.wikimedia.org/T149000#2754824 (10Dzahn) So in this case we actually just had to switch http to https here: https://gerrit.wikimedia.org/r/#/c/318776/1/usr/lib/wikistats/update.php and that switches LXDE and Editthis Wikis to https [04:39:48] 10Labs-project-Wikistats, 13Patch-For-Review: LXDE wiki updates broken - https://phabricator.wikimedia.org/T149396#2754825 (10Dzahn) @RobiH yep, it's the protocol change. http -> https (did not change the format part) https://gerrit.wikimedia.org/r/#/c/318776 https://gerrit.wikimedia.org/r/#/c/318776/2/usr... [04:40:45] 10Labs-project-Wikistats: W3C wiki updates broken - https://phabricator.wikimedia.org/T149000#2754828 (10Dzahn) [04:41:05] 10Labs-project-Wikistats, 13Patch-For-Review: LXDE wiki updates broken - https://phabricator.wikimedia.org/T149396#2750958 (10Dzahn) http://wikistats.wmflabs.org/display.php?t=lx [06:46:52] PROBLEM - Puppet run on tools-services-01 is CRITICAL: CRITICAL: 44.44% of data above the critical threshold [0.0] [06:51:20] PROBLEM - Puppet run on tools-webgrid-lighttpd-1411 is CRITICAL: CRITICAL: 22.22% of data above the critical threshold [0.0] [07:26:52] RECOVERY - Puppet run on tools-services-01 is OK: OK: Less than 1.00% above the threshold [0.0] [07:31:21] RECOVERY - Puppet run on tools-webgrid-lighttpd-1411 is OK: OK: Less than 1.00% above the threshold [0.0] [12:24:17] Change on 12wikitech.wikimedia.org a page Nova Resource:Tools/Access Request/GNegar was created, changed by GNegar link https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/Access_Request/GNegar edit summary: Created page with "{{Tools Access Request |Justification=Persian auto correction tool |Completed=false |User Name=GNegar }}" [12:28:14] does anybode here have a working piece of code to connect a java-programm to a mysql/maria-db database? [12:31:55] CreepyZombie https://dev.mysql.com/downloads/connector/j/5.1.html [12:32:35] I'll check it out later, thx [12:32:42] Your welcome :) [12:53:36] PROBLEM - Host tools-secgroup-test-102 is DOWN: CRITICAL - Host Unreachable (10.68.21.170) [12:55:13] Change on 12wikitech.wikimedia.org a page Nova Resource:Tools/Access Request/GNegar was modified, changed by Tim Landscheidt link https://wikitech.wikimedia.org/w/index.php?diff=939604 edit summary: [14:14:22] 10Striker, 05WMF-deploy-2016-10-11_(1.28.0-wmf.22), 05WMF-deploy-2016-10-25_(1.28.0-wmf.23): Check for 2FA protection and enforce validation of 2FA tokens - https://phabricator.wikimedia.org/T144712#2755023 (10Reedy) [14:17:43] PROBLEM - Host tools-secgroup-test-103 is DOWN: CRITICAL - Host Unreachable (10.68.21.22) [14:28:30] PROBLEM - Host secgroup-lag-102 is DOWN: CRITICAL - Host Unreachable (10.68.17.218) [15:48:16] halfak, can you clean up your homedir on tools-bastion-02? It's getting a bit crowded in there. (Not an emergency, but, next few days would be good.) [15:48:50] halfak: oh, wait, nevermind, I think I'm mis-identifying the problem [15:54:28] (I briefly forgot that homedirs are on NFS and so not relevant to disk warnings :) ) [16:13:13] !log tools.lolrrit-wm testing ssl connection to irc on grrrit-wm [16:13:15] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.lolrrit-wm/SAL [16:24:34] (03CR) 10Glaisher: "Only had a quick look and it looks okay code-wise but I'm not sure this the right approach. This seems like the sort of thing which should" [labs/tools/stewardbots] - 10https://gerrit.wikimedia.org/r/318229 (https://phabricator.wikimedia.org/T149265) (owner: 10Platonides) [16:43:01] (03Draft1) 10Paladox: Make grrrit-wm connection to freenode secure using ssl [labs/tools/grrrit] - 10https://gerrit.wikimedia.org/r/318790 [16:43:04] (03Draft2) 10Paladox: Make grrrit-wm connection to freenode secure using ssl [labs/tools/grrrit] - 10https://gerrit.wikimedia.org/r/318790 [16:44:08] !log tools.lolrrit-wm test is a success switching grrrit-wm to ssl and deploying https://gerrit.wikimedia.org/r/#/c/318790/ to switch it [16:44:10] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.lolrrit-wm/SAL [16:44:20] (03CR) 10Paladox: [C: 032] Make grrrit-wm connection to freenode secure using ssl [labs/tools/grrrit] - 10https://gerrit.wikimedia.org/r/318790 (owner: 10Paladox) [16:44:50] (03Merged) 10jenkins-bot: Make grrrit-wm connection to freenode secure using ssl [labs/tools/grrrit] - 10https://gerrit.wikimedia.org/r/318790 (owner: 10Paladox) [17:46:18] !log lolrrit-wm testing trying for a specifyied name instead of it comming up with random ones. [17:46:19] Unknown project "lolrrit-wm" [17:46:19] Did you mean to say "tools.lolrrit-wm" instead? [17:46:29] !log tools.lolrrit-wm testing trying for a specifyied name instead of it comming up with random ones. [17:46:31] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.lolrrit-wm/SAL [18:10:33] !nick [18:10:56] paladox: ? [18:11:25] Luke_ just trying something, i found this https://github.com/matrix-org/matrix-appservice-irc/blob/master/config.sample.yaml#L196 [18:11:33] I am deciding weather to try the libary :) [18:13:26] That's actually a good idea to support !nick. [18:14:00] I also want to get !restart-grrrit-wm supported to :) [18:14:05] Luke_ ^^ [18:14:43] why don't do it via shell? IRC-Commands make it easier to abuse, and you need more access management [18:17:20] Luke_ it is already supported via shell [18:17:48] But we need it easyer to restart it [18:17:54] without ssh into the instance [18:18:16] Oh your talking about the nick [18:20:14] about every command from IRC [18:24:03] Oh, i would only support !restart, but maybe we could do what wm-bot does and only support it for certain user [18:24:05] users [18:29:53] paladox: add a userlist for admins and trusted (wmf related cloaks) [18:29:58] ? [18:31:43] Zppix: this always exluce me :P [18:32:08] Become special then maybe just kidsin [18:32:11] Kidding [18:32:25] you're eligible for a wikimedia cloak aren't you? [18:33:42] yep, but I don't want to change :P [18:34:23] since this cloak gives me access to a bunch of channels too [18:34:34] Ok [18:34:44] Luke_ https://github.com/draggor/node-ircbot [18:34:55] permission section [18:39:52] Luke you can technically have 2 cloaks you know right [18:40:30] Zppix: you mean like @tnb/staff/wikimedia.Luke081515? [18:41:08] I dont know how it works [18:41:14] I just know you can [19:18:01] o/ madhuvishy [19:18:04] Around? [19:18:26] halfak: yeah! [19:18:38] \o [19:18:47] I saw andrewboggot's ping earlier about my homedir size on tools [19:18:56] It's 650MB. Is that too much? [19:20:28] halfak: aah, yeah that's about 16% of all of tools [19:21:20] 650MB is 16% of all available space? o.O [19:21:45] I see 4.1T of 8T in use [19:21:47] halfak: no. 8% of available space. we have 8T [19:21:48] yeah [19:22:00] "MB" [19:22:16] oh [19:22:18] sorry [19:22:21] i misread [19:22:29] halfak: not at all, you're fine :) [19:23:38] You know, back when I started grad school, we had a 250MB quota on our NFS-based home directories. [19:23:52] That was bananas. It was 2006. [19:23:59] i think andrew thought it was in the local disk of tools-bastion, which have less space. [19:24:10] oh wow [19:24:14] Oh! That makes sense. [19:25:12] Oh well. andrewbogott said he was mistaken, but I thought I'd check in anyway. Thanks :) [19:25:14] o/ [19:25:31] yw :) [19:25:37] * halfak considers puppetizing snuggle today. [19:30:25] madhuvishy, sorry one more thing. What is your puppet development workflow when you are setting up a new project? [19:31:44] halfak: mostly using self hosted puppet master on a labs instance, pushing to gerrit and testing on the labs instance. [19:32:30] you could also develop on the instance with vim, but have to setup gerrit there with ssh keys to push directly [19:32:47] Is there a good wiki page documenting how to do something like that? [19:33:05] * halfak isn't clear on self-hosting a puppet master [19:35:43] halfak: I don't think so. The way to set up own puppetmaster has changed a bit with yuvi's recent work on it too - I ran into some trouble doing this yesterday and he said he would fix the docs. https://wikitech.wikimedia.org/wiki/Standalone_puppetmaster. [19:36:12] I think at the moment, make a new instance, apply role puppetmaster::standalone through Horizon (I can help with this), and then poke yuvipanda [19:36:31] Bummer. OK, we'll, maybe today I'll just take detailed notes on my migration and use that to build up puppet some other time. [19:36:58] okay, i'll try to put some docs together meanwhile [19:37:47] madhuvishy, was thinking that a mini-puppetizing-workshop might be a fun thing to do at a hackathon [19:38:40] halfak: yeah for sure [19:38:59] halfak: is there a reason why you aren't running snuggle on tools-kubernetes? [19:39:18] madhuvishy, snuggle predates tools-kubernetes quite a bit [19:40:15] ah, I guess my question is, have you considered running it on k8s, and made a conscious decision not to, and what the reasoning was [19:41:02] Oh. Yeah. Have not yet seriously considered. I use MongoDB -- is that going to work out? [19:41:17] If I could have all the time, I'd probably spend some time on it. [19:41:19] Yes [19:41:44] Wait [19:41:47] No [19:41:52] Sql i believe [19:42:10] I get tools and mw confused [19:42:12] Lol [19:42:37] :P [19:43:06] halfak: ah hmmm mongodb might be tricky [19:43:16] I use denormalization and Mongo's efficient update patterns to make a live-update problem tractable. [19:43:25] So it might be a trick to move to something else. [19:43:32] Honestly, I think redis would work fine. [19:43:38] But it would require a lot of coding time :( [19:43:51] redis would be far superior, to be honest :) [19:44:31] * halfak starts up a Jessie instance to replace an Ubuntu 12.04 instance. [19:44:32] :D [20:01:46] halfak: you could also run mongodb from a labs instance (mongo is already puppetized i think), and have snuggle talk to it from tools [20:02:11] would require fairly less effort [20:03:30] Hi, why is loading python interpreter at toollabs taking about 4 seconds? I tried to load it at another computer, it took only 20 miliseconds. [20:06:21] Urbanecm: tools runs on top of nfs - therefore while it looks like simple disk IO, there's a a bunch of talking over the network happening underneath [20:06:53] Stats: dev.tools.wmflabs.org: 4 seconds; bastion.wmflabs.org: 20 miliseconds, masaryk.wikimedia.cz (this computers is under control of Wikimedia Czech Republic) 20 ms, Raspberry Pi: 191 miliseconds, login.tools.wmflabs.org: 2 seconds [20:07:15] so performance on tools is not really comparable to local disk [20:07:20] I think bastion.wmflabs.org use NFS too. Am I right? [20:08:06] Urbanecm: no, that would be local too i think [20:09:13] home directories on tools comes from nfs. not for the rest of labs [20:09:20] Okay, thanks. [20:09:28] So let's call it normal. [20:09:40] But is it okay too that jsub pwd runs for almost 11 seconds? [20:12:13] Urbanecm: it's possible - a bunch of things need to happen for a job submitted to the grid to complete. It needs to gets scheduled on an instance first and then run. Plus, all of this is happening over nfs [20:12:33] Okay. Could you tell me which things exactly? [20:13:32] Urbanecm: https://wikitech.wikimedia.org/wiki/Help:Tool_Labs/Grid [20:13:37] Thans. [20:13:39] *Thanks [20:15:36] I'm getting a weird error when setting up the /srv/ on a brand new instance (snuggle-enwiki-01.eqiad.wmflabs). See https://gist.github.com/halfak/3b132a307445990295b99dd71ccc0e5f [20:15:58] /usr/local/sbin/nfs-mount-manager mount /home returned 32 instead of one of [0] [20:16:29] halfak: does it persist if you rerun puppet? [20:17:17] Will try again in a moment and report [20:21:03] madhuvishy, error persists [20:24:36] Looks like I was using apache for snuggle O.O [20:24:41] yuck [20:24:54] halfak: looking [20:25:38] madhuvishy, looks like I have something mounted there anyway. [20:25:41] "/dev/mapper/vd-second--local--disk 21G 44M 19G 1% /srv" [20:25:51] ^ from "df -h" [20:26:07] halfak: that's just your local disk [20:26:21] Hmm... I kind of just want local disk :) [20:26:36] ah, yeah - but the snuggle project has nfs mounted [20:26:39] was only worried about the /srv mount having some space for me to put stuff :D [20:26:47] the nfs server hasn't picked up on the new instance yet [20:26:49] I want to turn that off. [20:26:53] so puppet is failing [20:27:12] ah you don't want it at all? there are 2 instances i think [20:27:59] halfak: if you'd like it turned off, that's easy to do - can you file a task I can take care of it [20:28:05] Yeah. I don't use it at all. [20:28:05] probably tomorrow though [20:28:30] madhuvishy, gotcha. I'll do that as soon as I have the migration working so that we can make sure I don't accidentally break anythign first :D [20:34:05] halfak: run puppet now? [20:34:11] should be fixed. [20:34:32] Thanks madhuvishy [20:39:59] madhuvishy, I can't "sudo -u www-data ..." [20:40:10] I know there's a security setting to change for this, but I can't find it. [20:50:49] halfak: lojjik [20:50:52] uhh [20:50:54] looking [20:51:01] sorry for the wrong ping! [20:51:49] ha! [20:52:04] Thanks for helping me today :) [20:52:52] * halfak is trying to cram some volunteer hours in and appreciates madhuvishy doing the same. [20:53:54] halfak: iirc - it's supposed to be here https://wikitech.wikimedia.org/wiki/Special:NovaSudoer [20:54:04] the page is really slow though [20:54:11] \o/ [20:54:14] That looks familiar [20:55:34] halfak: :) ya it doesn't load for me for some reason [20:55:52] but wikilabels/wikimetrics should have settings for www-data if you wanna check [21:02:03] Got it working \o/ [21:09:52] yay [21:22:11] is an admin online? [21:22:26] madhuvishy, ^ [21:22:40] Terminator: sup? [21:27:49] madhuvishy: handling with krenair now, thanks :) [21:28:23] okay :) [21:43:31] yuvipanda / valhallasw`vecto : Mind giving http://wdq.wmflabs.org/ a nudge? [21:47:51] PROBLEM - Puppet run on tools-services-02 is CRITICAL: CRITICAL: 33.33% of data above the critical threshold [0.0] [21:48:10] !log wdq-mm restart wdq-01.wdq-mm.eqiad.wmflabs, instance was unsshable [21:48:10] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Wdq-mm/SAL [21:49:27] multichill: it seems to be performing a lot of IO, I guess it's trying to start up? [21:55:04] multichill: do you know how long it's been down? [21:55:38] multichill: logs say it might've been down since... june?! [21:55:46] that sounds wrong [21:55:49] multichill: anyway, is back up now [21:56:16] yuvipanda: I would say since last round of reboots? [21:57:48] multichill: yeah, not entirely surprised I guess [21:57:58] multichill: do you have any idea what % of tools are still using it vs wdqs? [21:58:03] I just noticed because I forgot one of my bots still used it [21:58:16] Switched it to the other system right away [21:58:48] ah nice :D [22:03:59] I'm not sure why www-data seems to not have permission to bind to port 80 on jessie. [22:04:07] Anyone know what might be up with that. [22:05:20] hey halfak [22:05:29] Hey yuvipanda [22:05:37] www-data usually doesn't have permission to bind to port 80 anywhere. usually nginx / apache spawn as root, bind, then fork [22:06:11] Gotcha. Hmm. For this system, I don't have nginx or apache. I was hoping to just use uwsgi directly. [22:06:35] Maybe I could do my routing to 8080 in the dns [22:06:51] halfak: yeah [22:06:58] uwsgi has the --http thing [22:07:05] easiest is to make it listen on 8080 yes [22:07:36] I've fiddled with it before and didn't find an easy way for uwsgi to bind to 80 as root and then fork [22:07:42] but I didn't spend more than 30min looking [22:08:45] Yeah. That's what a quick googling showed me too [22:08:49] Thanks this will work [22:10:34] halfak: cool. there are ways to hack around it, but not worth it imo [22:27:49] RECOVERY - Puppet run on tools-services-02 is OK: OK: Less than 1.00% above the threshold [0.0] [23:09:39] OK. Half-way through the migration. Calling it quits for today. Sync server was started and that will be a good test. [23:10:01] !log snuggle started up snuggle-enwiki-01. syncd running [23:10:03] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Snuggle/SAL