[00:00:05] (03CR) 10Krinkle: [C: 032] Remove unused 'p[class=statistics]' [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/434117 (owner: 10Krinkle) [00:00:08] (03CR) 10Krinkle: [C: 032] Move "Limited to 20 results per wiki" message from bottom to top [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/434118 (https://phabricator.wikimedia.org/T167524) (owner: 10Krinkle) [00:00:45] (03Merged) 10jenkins-bot: Remove unused 'p[class=statistics]' [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/434117 (owner: 10Krinkle) [00:00:47] (03Merged) 10jenkins-bot: Move "Limited to 20 results per wiki" message from bottom to top [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/434118 (https://phabricator.wikimedia.org/T167524) (owner: 10Krinkle) [06:36:28] (03PS1) 10Legoktm: Support upgrading phan-taint-check-plugin [labs/libraryupgrader] - 10https://gerrit.wikimedia.org/r/434124 [06:36:45] (03CR) 10jerkins-bot: [V: 04-1] Support upgrading phan-taint-check-plugin [labs/libraryupgrader] - 10https://gerrit.wikimedia.org/r/434124 (owner: 10Legoktm) [06:37:24] (03PS2) 10Legoktm: Support upgrading phan-taint-check-plugin [labs/libraryupgrader] - 10https://gerrit.wikimedia.org/r/434124 [11:06:24] (03CR) 10Lokal Profil: [C: 032] Allow to override replicas database credentials [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/433949 (owner: 10Jean-Frédéric) [11:07:51] (03Merged) 10jenkins-bot: Allow to override replicas database credentials [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/433949 (owner: 10Jean-Frédéric) [11:08:46] (03CR) 10jenkins-bot: Allow to override replicas database credentials [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/433949 (owner: 10Jean-Frédéric) [14:04:40] (03Draft1) 10Paladox: Increase delay for reporting a second time about failures [labs/icinga2] - 10https://gerrit.wikimedia.org/r/434180 [14:04:42] (03PS2) 10Paladox: Increase delay for reporting a second time about failures [labs/icinga2] - 10https://gerrit.wikimedia.org/r/434180 [14:04:45] (03CR) 10Paladox: [V: 032 C: 032] Increase delay for reporting a second time about failures [labs/icinga2] - 10https://gerrit.wikimedia.org/r/434180 (owner: 10Paladox) [17:18:39] is there any reason why sulinfo is down? [22:29:56] (03PS1) 10Rosalieper: First set of pictures [labs/tools/Commons-twitter-bot] - 10https://gerrit.wikimedia.org/r/434272 [22:31:30] !help Question about Toolforge Grid. How does the -once flag work? [22:31:30] valerio-bozzolan: If you don't get a response in 15-30 minutes, please create a phabricator task -- https://phabricator.wikimedia.org/maniphest/task/edit/form/1/?projects=wmcs-team [22:31:57] I mean, it should prevent a task (with the same name?) to be runned twice. Isn't it? [22:33:14] yes [22:33:15] But I created a stupid script with "sleep 5; echo 1" and called multiple time using "jsub -once -N asd.sh", but I can run it twice [22:33:16] valerio-bozzolan: ^ [22:33:38] it probably is not running by the time the second one gets to grid [22:34:12] grid submission is not a cheap or quick operation [22:35:22] valerio-bozzolan: is that the exact command you sent? `jsub -once -N asd.sh` [22:35:46] (03PS2) 10Rosalieper: First set of pictures [labs/tools/Commons-twitter-bot] - 10https://gerrit.wikimedia.org/r/434272 [22:36:29] OK you are completly right. [22:37:03] I was calling the jsub calls asyncronously to try [22:37:14] that command seems to be missing the script itself [22:37:52] Yes there were a typo but now I'm in. [22:38:42] So the jsub need some seconds to be registered, after that, the system can handle the -once [22:39:03] it might be a lot more than seconds [22:39:15] But anyway less than 5 minutes e.g. [22:44:54] !help Hi! On Toolforge, is it possible to call "webservice restart" from a job submitted by jsub? Seems like /usr/local/bin/webservice is not available on the execution host. [22:44:54] danmichaelo: If you don't get a response in 15-30 minutes, please create a phabricator task -- https://phabricator.wikimedia.org/maniphest/task/edit/form/1/?projects=wmcs-team [22:45:20] danmichaelo: shouldn't be possible, not [22:46:28] if `webservice` didn't register the job it won't attempt to stop the job [22:46:42] do you need help stopping the job? [22:46:48] danmichaelo: ^ [22:49:02] Not sure if I understand, but I'm trying to make a script that restarts the webserver whenever it hangs (times out), which happens every now and then due to https://phabricator.wikimedia.org/T104799 [22:51:18] I think I misunderstood your original question [22:51:32] reading for in place of from [22:51:52] yeah, it's perhaps a somewhat unusual thing to do :) [22:54:51] yeah, I see it is indeed not copied to grid nodes [22:56:04] you might have some luck copying it to your tool's folder, which is mounted on all nodes through NFS, but I'm not sure this is a stable configuration [22:57:18] quick restarts of either grid engine jobs or kubernetes pods will not go over well [23:10:47] danmichaelo: grid non-web exec hosts are not configured as submit hosts [23:10:57] idk the reasoning [23:11:24] ha, true, thanks zhuyifei1999_ [23:12:16] chicocvenancio: k, thanks for checking! For now, I made the script just ping me instead since it only seems to happen every few weeks [23:12:19] https://phabricator.wikimedia.org/T123270 [23:12:29] (np) [23:12:44] but, is https://phabricator.wikimedia.org/T104799 grid or k8s? [23:13:08] k8s job control should work from grid... [23:13:26] (a weird concept as that may be) [23:14:02] kubectl is not there, but the credentials are in NFS [23:14:08] chicocvenancio: the task looks like grid, but it should apply to both afaict [23:15:00] no webservice or kubectl, but kubernetes doesn't really care were you send the commands from, as long as authed [23:15:24] * zhuyifei1999_ tries that on toolsbeta [23:21:01] indeed seems to work [23:21:25] at least no exceptions, except when webservice shell tries to invoke kubectl [23:22:18] it is a strange use-case, however, not sure we want to make it generally acessible as it is extremely error prone [23:23:48] eg. trying `webservice --backend=grid restart` or `webservice --backend=kubernetes shell` will fail, maybe in not so fun way