[09:12:57] !log tools.admin restarted service (T213147) [09:13:00] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.admin/SAL [09:13:00] T213147: Toolforge Home Page is CRITICAL - https://phabricator.wikimedia.org/T213147 [09:59:44] !log tools rebooted tools-checker-01 (T213252) [09:59:48] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/SAL [09:59:48] T213252: toolscheckerctl fails to start checks - https://phabricator.wikimedia.org/T213252 [14:42:04] !log tools experimentally moving tools-paws-worker-1019 to eqiad1 [14:42:06] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/SAL [16:27:22] https://paws.wmflabs.org/ -> 502 Bad Gateway [16:27:36] fuzheado: that's my fault, we're working on it [16:27:50] Cool, thanks! [16:33:12] Hi! Are there problems with the enwiki databases? [16:33:59] select count(*) from revision join page on rev_page = page_id where p [16:34:34] age_title = 'Castle_Conway' and page_namespace = 0 [16:34:47] SQL queries like this ^ do not work [16:35:36] chasemp andrewbogott anyone here? [16:47:29] in enwiki? checking [16:48:31] There's a problem with s5 replication, but I'm not aware of much on enwiki...looking around [16:51:02] Definitely looking slow... [16:52:34] Ah I see it [16:53:31] doctaxon: looks like someone dropped a column in the page table. I'll check if they are rebuilding the views yet or if that is something I can jump on. [16:56:25] bstorm_ jynus: is it running now [16:56:36] now it's okay [16:56:42] Ah ok :) [16:56:52] Whatever it was, I think someone was in the process of resolving it [16:57:30] bstorm_: also being talked about in -databases channel fyi [16:57:43] There's a databases channel?!?!?! [16:57:55] :) [16:58:07] T212254 [16:58:07] T212254: Drop valid_tag table - https://phabricator.wikimedia.org/T212254 [16:58:57] * bstorm_ joined [17:00:46] ya, thank you, it is working fine now [17:12:15] the new bigbrother doesn't work with symlinks, right? [17:17:01] !log tools moving paws-worker-1017 and paws-worker-1016 to eqiad1 [17:17:04] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/SAL [17:17:59] revi: it submits the command just as you'd do manually. i'd suggest using the full path to your command just in case [17:18:10] fuzheado: paws should be back now :) [17:18:23] seems someone else from our team fixed it [17:18:33] andrewbogott: thanks much! [17:18:42] just wondering if jlocal is something different from jsub or qsub like thing? [17:18:49] fuzheado: gtirloni and chicocvenancio fixed it, I just broke it :) [17:18:52] so I can just put -quiet on bigbrother jobs [17:20:13] !log tools.stewardbots Disable jlocal crontab commands [17:20:14] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.stewardbots/SAL [17:20:30] revi: you could edit bigbrother.sh and add -quiet there, the bigbrother.sh is totally yours :) [17:21:16] lemme seeā€¦ [17:21:29] this is first time I ssh to box after the bigbrother migration [17:22:11] I actually thought it was something shared lol [17:22:51] Hauskatze: lol you do that while I was trying to fix spambot [17:23:45] revi: jlocal is for very short lived processes that actually run directly on the cron instance rather than on the grid. Its needed for the bigbrother.sh script because the normal grid nodes are not marked as job submission hosts, so a job launched on the grid would not be able to start the monitored job if it is not found [17:25:26] I guess submit_job function is where I need to put [17:25:29] -quiet? [17:25:34] (mobile sucks) [17:27:07] revi: yes [17:27:18] thanks! [17:28:26] !log tools.stewardbots fix bigbrother.sh to do their job quietly & re-enable the bigbrother via crontab [17:28:27] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.stewardbots/SAL [17:29:45] thanks gtirloni and bd808! [17:29:52] probably fixed by that [17:30:36] nice! [17:31:12] revi: oops, but glad you fixed it -- methinks we lack coordination :) [17:31:39] well that was fine in case there was floodquit while I was fixing it [17:31:58] that would've prevented my inbox being spammed every 5 minutes again [17:32:21] * Hauskatze doesn't have the email open, floods revi's [17:33:01] >_< [17:33:12] RIP my mailbox [17:45:59] didn't work quite well [17:46:02] > Usage: /data/project/stewardbots/bigbrother.sh [17:49:06] !log tools.stewardbots disable bigbrother again [17:49:07] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.stewardbots/SAL [17:51:10] can someone (maybe Hauskatz) make sure crontab is gone? I got a crontab mail [17:51:20] I need to sleep so [17:51:47] revi: what kind of emails are you receiving? is bigbrother.sh restarting your jobs constantly? [17:51:58] looks like so [17:52:15] this is what I got https://usercontent.irccloud-cdn.com/file/TdI9rkq8/IMG_7182.PNG [17:53:05] "/usr/bin/python /data/project/stewardbots/StewardBot/StewardBot.py" [17:53:08] this needs to be a single command [17:53:11] was it like that before? [17:53:17] IIRC it was [17:53:48] or it was somehow converted to that while bigbrother migration? [17:54:07] https://www.irccloud.com/pastebin/PwGGxHuz/ [17:54:10] this should work now [17:54:32] thx [17:54:37] rxy: ^ in case [17:54:52] (you're still working on this) [17:54:57] anyway 3am wow [17:54:58] nini [17:55:07] g'night :) [17:55:11] g'night [17:55:22] (tho I recall saying this lol) [18:34:46] gtirloni: im getting your test emails :P [18:42:59] addshore: thanks :) i was testing the wrong thing lol, sorry [18:52:14] gtirloni: hi, around? [18:53:16] https://wikitech.wikimedia.org/wiki/Help:Toolforge/Grid#Bigbrother_(Deprecated) isn't work for my python script. [19:00:23] rxy: what's your tool? what's the error? [19:00:42] at tools.stewardbots [19:00:47] no error output [19:01:11] but it seems can't found python env [19:02:38] how do you usually start it? [19:03:16] we usually using ` /data/project/stewardbots/StewardBot/restart_stewardbot.sh ` [19:04:13] looks like there's another wrapper script that talks directly with the grid [19:04:16] the old bigbrother had this: [19:04:18] #jstart -N stewardbot -mem 2G python /data/project/stewardbots/StewardBot/StewardBot.py [19:04:18] #jstart -N sulwatcher -mem 2G python /data/project/stewardbots/SULWatcher/SULWatcher.py [19:05:16] I guess "jlocal" is system user without any path? [19:05:18] -quiet is not a valid option for jstart [19:05:50] i've removed it from bigbrother.sh [19:07:30] thx [19:07:44] the bot still can't start though, even manually without using bigbrother.sh [19:08:53] ah ok, /mnt/nfs/labstore-secondary-tools-project/stewardbots/public_html/StewardBot/StewardBot.py wasn't executable (`chmod +x`) [19:08:57] should be fine now [19:10:12] ah, thanks [19:16:59] work correctly! Thanks a lot :) [19:24:52] :) [20:16:56] !log tools moving tools-paws-worker-1013 and tools-paws-worker-1007 to eqiad1 [20:16:59] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/SAL [22:21:08] !log admin neutron quota-update --tenant-id tools --port 256 [22:21:10] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Admin/SAL