[02:18:15] 10Toolforge: Make it less cumbersome to bootstrap and update python webservices - https://phabricator.wikimedia.org/T174769#3599322 (10Legoktm) [02:19:46] PROBLEM - Puppet errors on tools-exec-1416 is CRITICAL: CRITICAL: 50.00% of data above the critical threshold [0.0] [02:54:45] RECOVERY - Puppet errors on tools-exec-1416 is OK: OK: Less than 1.00% above the threshold [0.0] [03:39:43] PROBLEM - Puppet errors on tools-exec-1408 is CRITICAL: CRITICAL: 30.00% of data above the critical threshold [0.0] [04:34:44] RECOVERY - Puppet errors on tools-exec-1408 is OK: OK: Less than 1.00% above the threshold [0.0] [05:02:18] PROBLEM - Puppet errors on tools-redis-1001 is CRITICAL: CRITICAL: 50.00% of data above the critical threshold [0.0] [05:42:19] RECOVERY - Puppet errors on tools-redis-1001 is OK: OK: Less than 1.00% above the threshold [0.0] [06:30:41] PROBLEM - Puppet errors on tools-exec-1434 is CRITICAL: CRITICAL: 30.00% of data above the critical threshold [0.0] [06:32:40] PROBLEM - Puppet errors on tools-exec-1440 is CRITICAL: CRITICAL: 22.22% of data above the critical threshold [0.0] [06:39:07] 10PAWS, 10cloud-services-team (Kanban), 10User-bd808: Not able to edit user-config.py file in PAWS - https://phabricator.wikimedia.org/T175167#3599457 (10Amishas157) @bd808 , I think problem is following: There are two dirs: /home /paws The dir where you created `user-config.py` is `/paws` which works for... [07:10:42] RECOVERY - Puppet errors on tools-exec-1434 is OK: OK: Less than 1.00% above the threshold [0.0] [07:12:38] RECOVERY - Puppet errors on tools-exec-1440 is OK: OK: Less than 1.00% above the threshold [0.0] [07:21:50] 10Tools, 10Wiki-Loves-Monuments: Redirect toolserver.org/~erfgoed/stream/ - https://phabricator.wikimedia.org/T175671#3599469 (10Nemo_bis) [07:54:11] 10Tools, 10Wiki-Loves-Monuments: Redirect toolserver.org/~erfgoed/stream/ - https://phabricator.wikimedia.org/T175671#3599469 (10JeanFred) Yeah, it’s not reaaaally a successor but it’s the closest thing I could find in the rTHER repository. :-( Shame that this tool was not migrated (although it does not sound... [08:07:16] Hi, Could someone please reintroduce this deleted page? https://wikitech.wikimedia.org/wiki/Nova_Resource:Wikitrust ? [08:07:38] we want to revive the project and need that page to reboostrap the project [08:09:15] Reedy: ? [08:33:52] 10Cloud-Services, 10Tools, 10Community-Tech-Tool-Labs, 10Developer-Relations, and 3 others: Create an authoritative and well promoted catalog of Wikimedia tools - https://phabricator.wikimedia.org/T115650#3599549 (10Qgil) In relation to {T158149}, I dare to ask: what is the current status? :) [09:10:34] In the beta cluster where is the centralnotice infastructure wiki. Meta or deployment? [12:37:40] PROBLEM - Puppet errors on tools-exec-1406 is CRITICAL: CRITICAL: 66.67% of data above the critical threshold [0.0] [12:38:16] (03PS2) 10Krinkle: api: Remove hardcoded shard list, prep for new wikidata shard (s8) [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/377447 [12:38:19] (03CR) 10Krinkle: [C: 032] api: Remove hardcoded shard list, prep for new wikidata shard (s8) [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/377447 (owner: 10Krinkle) [12:38:51] (03Merged) 10jenkins-bot: api: Remove hardcoded shard list, prep for new wikidata shard (s8) [labs/tools/guc] - 10https://gerrit.wikimedia.org/r/377447 (owner: 10Krinkle) [13:07:39] RECOVERY - Puppet errors on tools-exec-1406 is OK: OK: Less than 1.00% above the threshold [0.0] [13:38:11] (03PS1) 10Giuseppe Lavagetto: Add missing secrets [labs/private] - 10https://gerrit.wikimedia.org/r/377465 [13:38:46] (03CR) 10Giuseppe Lavagetto: [V: 032 C: 032] Add missing secrets [labs/private] - 10https://gerrit.wikimedia.org/r/377465 (owner: 10Giuseppe Lavagetto) [14:29:50] Kelson: You actually need the Cloud VPS project to be recreated. See https://phabricator.wikimedia.org/project/view/2875/ for the process of requesting a project. [14:33:38] bd808: ok, thx [14:44:18] yoooo doodlees [14:44:27] when making a new instance, i see a c1.m2.s80 option [14:44:32] in the Public column [14:44:34] it says 'no' [14:44:41] what does the public column mean when selecting an image? [14:46:04] ottomata: I think that column just means that the image flavor is not available to all projects in Cloud VPS. [14:46:10] ah ok [14:46:14] so its just in deployment prep [14:46:14] ok [14:47:15] thanks [14:56:21] 10Toolforge: Slow performance on toolforge project - https://phabricator.wikimedia.org/T175703#3600662 (10Fnielsen) [15:05:27] (03PS1) 10Jean-Frédéric: Split categorization out of daily update job [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/377483 (https://phabricator.wikimedia.org/T174871) [15:10:55] (03CR) 10Zppix: [C: 031] Update Wikimedia-AI's irc config [labs/tools/wikibugs2] - 10https://gerrit.wikimedia.org/r/377364 (owner: 10Zppix) [15:11:21] 10Cloud-Services, 10Tools, 10Community-Tech-Tool-Labs, 10Developer-Relations, and 3 others: Create an authoritative and well promoted catalog of Wikimedia tools - https://phabricator.wikimedia.org/T115650#3600742 (10bd808) I would be happy to assist in some attempt at this problem, but I do not have the fr... [15:11:36] is andrewbogott around? [15:12:15] jynus: I'm here, what's up? [15:12:45] I will merge "soon" something like https://gerrit.wikimedia.org/r/377460 [15:13:00] that enables the firewall for m5 [15:13:35] normally that takes some downtime of the network, a few seconds (around 30) [15:13:56] ok — I think that should be fine [15:13:56] what's the damage for cloud services? [15:14:09] This won't work for a bit, I don't think anything will crash [15:14:17] and is there a best time [15:14:18] but if you notify me when you merge I can keep an eye out [15:14:27] sorry, I mean 'Things wont work for a bit' [15:14:27] I could do it very early in my morning [15:14:33] but maybe you prefer to be around [15:14:44] tell me what you prefer [15:14:48] We could do it in 45 minutes, otherwise about this time tomorrow would suit me as well [15:14:52] I would like to be here when you do it. [15:14:57] me too if you don't mind [15:14:58] let's do it tomorrow [15:15:02] tomorrow works for me [15:15:06] m1 is also affected [15:15:17] ok, so 15:00 UTC tomorrow? [15:15:24] yeah [15:15:33] andrewbogott: I'll add it to the team cal :) [15:15:39] actually, can we do 15:30? I might be in transit at 15:00 [15:15:59] wait [15:16:07] actually, tomorow is wednesday [15:16:20] lots of backups running [15:16:28] thu? [15:16:31] I may wait until next week on a monday or tuesday [15:16:35] k [15:16:42] I'm out Friday and Monday, otherwise don't care [15:17:01] 19th 15:30 utc? [15:17:12] 15:00, we'll be in a meeting at 15:30 :) [15:17:30] 19 is tuesday [15:17:45] cloud has a meeting on tuesdays jynus [15:17:47] yep, we have our meeting at 15:20 every Tuesday, starting in a few minutes [15:18:11] monday? [15:19:18] (03PS1) 10Jean-Frédéric: Add Egypt in Arabic eg_ar [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/377488 (https://phabricator.wikimedia.org/T174261) [15:19:27] Tuesday 15:00 is best I think [15:19:41] ok [15:19:57] (03CR) 10jerkins-bot: [V: 04-1] Add Egypt in Arabic eg_ar [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/377488 (https://phabricator.wikimedia.org/T174261) (owner: 10Jean-Frédéric) [15:19:59] I will prepare an m1 failover at that time, too [15:20:56] (03PS2) 10Jean-Frédéric: Add Egypt in Arabic eg_ar [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/377488 (https://phabricator.wikimedia.org/T174261) [15:21:30] (03CR) 10jerkins-bot: [V: 04-1] Add Egypt in Arabic eg_ar [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/377488 (https://phabricator.wikimedia.org/T174261) (owner: 10Jean-Frédéric) [15:22:02] jynus: sounds good, thanks for the warning :) [15:28:56] 10cloud-services-team (Kanban), 10DC-Ops, 10Operations, 10ops-eqiad: labvirt1015 crashes - https://phabricator.wikimedia.org/T171473#3600824 (10Cmjohnson) The CPU in slot 2 has been replaced and racadm log cleared. Please let me know if additional problems pop up. Return shipping info of old part USPS 92... [15:30:47] (03PS3) 10Jean-Frédéric: Add Egypt in Arabic eg_ar [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/377488 (https://phabricator.wikimedia.org/T174261) [15:31:41] (03CR) 10jerkins-bot: [V: 04-1] Add Egypt in Arabic eg_ar [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/377488 (https://phabricator.wikimedia.org/T174261) (owner: 10Jean-Frédéric) [15:36:04] (03PS4) 10Jean-Frédéric: Add Egypt in Arabic eg_ar [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/377488 (https://phabricator.wikimedia.org/T174261) [15:36:35] (03CR) 10Merlijn van Deen: [C: 032] Update Wikimedia-AI's irc config [labs/tools/wikibugs2] - 10https://gerrit.wikimedia.org/r/377364 (owner: 10Zppix) [15:36:58] (03Merged) 10jenkins-bot: Update Wikimedia-AI's irc config [labs/tools/wikibugs2] - 10https://gerrit.wikimedia.org/r/377364 (owner: 10Zppix) [15:37:07] (03CR) 10jenkins-bot: Update Wikimedia-AI's irc config [labs/tools/wikibugs2] - 10https://gerrit.wikimedia.org/r/377364 (owner: 10Zppix) [15:52:46] (03PS5) 10Jean-Frédéric: Add Egypt in Arabic eg_ar [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/377488 (https://phabricator.wikimedia.org/T174261) [15:53:40] 10Cloud-VPS (Project-requests), 10cloud-services-team: Create a project for Wikimedia Armenia - https://phabricator.wikimedia.org/T175567#3600882 (10bd808) Approved [15:55:03] 10Cloud-VPS (Quota-requests), 10Discovery, 10Wikidata, 10Wikidata-Query-Service: Request increased quota for wikidata-query labs project - https://phabricator.wikimedia.org/T175196#3600883 (10bd808) Approved [15:57:30] PROBLEM - Puppet errors on tools-proxy-02 is CRITICAL: CRITICAL: 44.44% of data above the critical threshold [0.0] [15:58:19] PROBLEM - Puppet errors on tools-bastion-05 is CRITICAL: CRITICAL: 44.44% of data above the critical threshold [0.0] [16:09:32] PROBLEM - Puppet errors on tools-bastion-02 is CRITICAL: CRITICAL: 44.44% of data above the critical threshold [0.0] [16:11:57] https://blog.wikimedia.org/2017/09/11/introducing-wikimedia-cloud-services/ [16:11:59] COOL! [16:12:10] Also, is there a new domain name to replace wmflabs.org? [16:13:04] PROBLEM - Puppet errors on tools-bastion-03 is CRITICAL: CRITICAL: 44.44% of data above the critical threshold [0.0] [16:13:19] (03PS6) 10Jean-Frédéric: Add Egypt in Arabic eg_ar [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/377488 (https://phabricator.wikimedia.org/T174261) [16:13:29] halfak: AFAIK theres no plan to change the ssh stuff [16:14:35] I'm struggling with talking about "ores.wmflabs.org" as anything other than "The WMFLabs version" because of the domain. [16:14:53] I'm trying to be a good citizen and stop referring to stuff as "labs" [16:15:33] halfak: why not refer to it as pre-prod? [16:15:48] We call it "experimental" [16:15:55] It's not really part of a deploy train. [16:16:16] But it's the one that has "wmflabs" in the domain, while the other has "wikimedia" in the domain. [16:16:36] halfak: the Cloud VPS testing site [16:16:57] there is no WMFLabs despite what the URL says :) [16:19:07] :P So when is the new domain coming? [16:19:53] halfak: in January when we can get free wildcard certs [16:21:10] both mwcloud.org & toolforge.org will be a thing and there will be a really long phase out of the current domains [16:21:18] Oh cool. I'll be very happy to change our documentation at that time. I'll file a task now so I can keep track of the mess I'm making. [16:21:32] mwcloud? [16:21:36] mediawiki cloud? [16:21:44] wmcloud (sorry) [16:21:51] \o/ OK perfect [16:21:53] typing is hard :) [16:21:56] :D [16:22:00] mwmwmwmwmwmwwwmmwmwm [16:22:34] the dreaded "f" will be dropped too \o/ [16:23:10] it may actually be Q4 before we pull the trigger because of other changes that are coming [16:23:14] but it will happen [16:23:26] PROBLEM - Puppet errors on tools-proxy-01 is CRITICAL: CRITICAL: 55.56% of data above the critical threshold [0.0] [16:40:06] 10Tools: Need easier tool for working on redundancy than "Inhalte übernommen" Template Tool (german WP) - https://phabricator.wikimedia.org/T175698#3600535 (10Aklapper) [16:43:04] RECOVERY - Puppet errors on tools-bastion-03 is OK: OK: Less than 1.00% above the threshold [0.0] [16:43:28] RECOVERY - Puppet errors on tools-proxy-01 is OK: OK: Less than 1.00% above the threshold [0.0] [16:47:31] RECOVERY - Puppet errors on tools-proxy-02 is OK: OK: Less than 1.00% above the threshold [0.0] [16:48:38] (03PS1) 10Jean-Frédéric: Add Irak in Arabic (iq_ar) [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/377508 (https://phabricator.wikimedia.org/T174340) [16:49:22] (03CR) 10Jean-Frédéric: "Tested locally in Docker, works." [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/377508 (https://phabricator.wikimedia.org/T174340) (owner: 10Jean-Frédéric) [16:49:30] RECOVERY - Puppet errors on tools-bastion-02 is OK: OK: Less than 1.00% above the threshold [0.0] [17:08:18] RECOVERY - Puppet errors on tools-bastion-05 is OK: OK: Less than 1.00% above the threshold [0.0] [17:14:39] 10Cloud-VPS, 10Operations-Software-Development: Install cumin in the WMCS infrastructure - https://phabricator.wikimedia.org/T175712#3601154 (10bd808) [17:14:57] 10Cloud-VPS, 10Operations-Software-Development: Cumin: create backend for OpenStack - https://phabricator.wikimedia.org/T175711#3601155 (10bd808) [17:36:20] 10PAWS, 10cloud-services-team (Kanban), 10User-bd808: Not able to edit user-config.py file in PAWS - https://phabricator.wikimedia.org/T175167#3601284 (10bd808) >>! In T175167#3599457, @Amishas157 wrote: > The dir where you created `user-config.py` is `/paws` which works for me as well. But creating it in `/... [17:41:33] 10PAWS, 10cloud-services-team (Kanban), 10User-bd808: Not able to edit user-config.py file in PAWS - https://phabricator.wikimedia.org/T175167#3601312 (10bd808) 05stalled>03Resolved Documentation updated: https://www.mediawiki.org/w/index.php?title=Manual%3APywikibot%2FPAWS&type=revision&diff=2561666&old... [18:22:59] PROBLEM - Puppet errors on tools-exec-1425 is CRITICAL: CRITICAL: 33.33% of data above the critical threshold [0.0] [18:48:07] 10Cloud-VPS, 10Operations-Software-Development: Install cumin in the WMCS infrastructure - https://phabricator.wikimedia.org/T175712#3601071 (10hashar) As a side effect, #beta-cluster-infrastructure and #continuous-integration-infrastructure would need a way to have a per project cumin master. We don't have a... [19:03:01] RECOVERY - Puppet errors on tools-exec-1425 is OK: OK: Less than 1.00% above the threshold [0.0] [19:11:21] home directory (~) vs /home directory; root directory (/) vs /root lol [19:12:03] labs vs labs vs labs [19:25:55] 10PAWS, 10cloud-services-team (Kanban), 10User-bd808: Not able to edit user-config.py file in PAWS - https://phabricator.wikimedia.org/T175167#3584797 (10zhuyifei1999) We should clarify somewhere (FAQ? Glossary? Terminology?) that: * by `home directory` we almost always mean your home directory, where your p... [19:31:33] !log rcm Neon: Installing security updates [19:31:35] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Rcm/SAL [19:33:54] (03PS1) 10Awight: Catch all scoring-platform tags. [labs/tools/wikibugs2] - 10https://gerrit.wikimedia.org/r/377539 [19:34:36] (03CR) 10Merlijn van Deen: [C: 032] Catch all scoring-platform tags. [labs/tools/wikibugs2] - 10https://gerrit.wikimedia.org/r/377539 (owner: 10Awight) [19:34:50] :D That was alarmingly fast. [19:35:54] (03Merged) 10jenkins-bot: Catch all scoring-platform tags. [labs/tools/wikibugs2] - 10https://gerrit.wikimedia.org/r/377539 (owner: 10Awight) [19:36:03] (03CR) 10jenkins-bot: Catch all scoring-platform tags. [labs/tools/wikibugs2] - 10https://gerrit.wikimedia.org/r/377539 (owner: 10Awight) [19:38:50] !log tools.wikibugs Updated channels.yaml to: 2867f29064269aa03542206d84bde09070551c9e Catch all scoring-platform tags. [20:00:26] zhuyifei1999_: yeah. this was a good reminder of the "command-line tax" of using unix things [20:01:19] lol [20:22:06] 10Cloud-VPS, 10VPS-Projects, 10Recommendation-API: Grant Bmansurov access to "Recommendation-api" Cloud VPS Project - https://phabricator.wikimedia.org/T175643#3602092 (10bmansurov) @bd808 thanks for chiming in. I can't get hold of either @schana or @ellery. Would one of the Cloud Services team members be ab... [20:23:08] (03CR) 10Hashar: "recheck" [labs/tools/ZppixBot] - 10https://gerrit.wikimedia.org/r/374710 (owner: 10Reception123) [20:26:05] 10Cloud-VPS, 10VPS-Projects, 10Recommendation-API: Grant Bmansurov access to "Recommendation-api" Cloud VPS Project - https://phabricator.wikimedia.org/T175643#3602124 (10Krenair) schana was on Phabricator under a week ago talking about a request they made in relation to this project. Is this urgent? [20:29:56] 10Cloud-VPS, 10VPS-Projects, 10Recommendation-API: Grant Bmansurov access to "Recommendation-api" Cloud VPS Project - https://phabricator.wikimedia.org/T175643#3602151 (10bmansurov) Kind of urgent because {T174739} is due the end of September, beginning of October. I also need to catch up on the project as I... [20:33:55] (03CR) 10Hashar: "recheck" [labs/tools/ZppixBot] - 10https://gerrit.wikimedia.org/r/374710 (owner: 10Reception123) [20:40:11] !log recommendation-api Added DarTar as project admin [20:40:13] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Recommendation-api/SAL [20:41:53] 10Cloud-VPS, 10VPS-Projects, 10Recommendation-API: Grant Bmansurov access to "Recommendation-api" Cloud VPS Project - https://phabricator.wikimedia.org/T175643#3602208 (10DarTar) 05Open>03Resolved a:03DarTar This has been taken care of, thanks @bd808 @Krenair. [20:43:04] 10Cloud-VPS, 10VPS-Projects, 10Recommendation-API: Grant Bmansurov access to "Recommendation-api" Cloud VPS Project - https://phabricator.wikimedia.org/T175643#3602226 (10Krenair) I did some searching around and it looks to me like DarTar's approval is good for this project [20:43:53] 10Cloud-VPS, 10VPS-Projects, 10Recommendation-API: Grant Bmansurov access to "Recommendation-api" Cloud VPS Project - https://phabricator.wikimedia.org/T175643#3598818 (10Krenair) *mumbles something about phabricator's conflict detection or lack thereof* [20:57:51] valhallasw`cloud: do changes auto deploy to wikibugs when merged? [22:03:55] bd808: I think what i was missing from yesterday is adding one entry here: https://wikitech.wikimedia.org/wiki/Hiera:Dashiki/host/dashiki-01 [22:05:00] nuria_: ah. role magic [22:05:09] bd808: indeed [22:05:35] bd808: how do those hiera chnages become effective? [22:05:44] bd808: puppet needs to run right? [22:05:48] correct [22:05:52] bd808: ok [22:06:01] bd808: does puppet agent -tv still work? [22:06:09] bd808: I am from teh past [22:06:11] *the [22:06:22] yes, that's the magic command :) [22:32:51] PROBLEM - Puppet staleness on tools-mail is CRITICAL: CRITICAL: 60.00% of data above the critical threshold [43200.0] [23:15:21] 10Toolforge, 10Tools: Response time from https://tools.wmflabs.org/scholia/ varies - https://phabricator.wikimedia.org/T175703#3602619 (10bd808) [23:29:45] 10Toolforge, 10Tools: Response time from https://tools.wmflabs.org/scholia/ varies - https://phabricator.wikimedia.org/T175703#3600662 (10bd808) Based on the contents of `/data/project/scholia/service.manifest`, this tool is running on Grid Engine. The first thing I would personally try is migrating it to run... [23:30:33] 10Tool-Article-request, 10Toolforge, 10User-Matthewrbowker: Articlerequest tool goes up and down often - https://phabricator.wikimedia.org/T175623#3602669 (10bd808) [23:43:11] 10Tool-Article-request, 10Toolforge, 10User-Matthewrbowker: Articlerequest tool goes up and down often - https://phabricator.wikimedia.org/T175623#3598142 (10bd808) 98% (9,838 of the last 10,000) of requests to the articlerequest tool are from http://www.uptimerobot.com/. Each hit by uptimerobot is actually... [23:43:46] 10Tool-Article-request, 10Toolforge, 10User-Matthewrbowker: Uptimerobot monitoring for the Articlerequest tool flaps - https://phabricator.wikimedia.org/T175623#3602721 (10bd808) [23:45:44] 10Tool-Article-request, 10Toolforge, 10User-Matthewrbowker: Uptimerobot monitoring for the Articlerequest tool flaps - https://phabricator.wikimedia.org/T175623#3598142 (10yuvipanda) From my experience with uptimerobot, it's quite flaky - we got one or two false positive alerts each day from it. I've switche...