[00:41:35] 10serviceops, 10Beta-Cluster-Infrastructure, 10Release Pipeline, 10Core Platform Team Backlog (Next), and 2 others: Migrate Beta cluster services to use Kubernetes - https://phabricator.wikimedia.org/T220235 (10mobrovac) p:05Normal→03High Raising the prio and moving to //next// since we'll have to atta... [00:50:16] 10serviceops, 10Operations, 10cloud-services-team: Change /r/p/ to /r/ on all hosts (where https://gerrit.wikimedia.org/r/p/ exists) - https://phabricator.wikimedia.org/T222093 (10Dzahn) [01:35:08] 10serviceops, 10Operations, 10cloud-services-team: Change /r/p/ to /r/ on all hosts (where https://gerrit.wikimedia.org/r/p/ exists) - https://phabricator.wikimedia.org/T222093 (10Dzahn) [01:35:32] 10serviceops, 10Operations, 10cloud-services-team: Change /r/p/ to /r/ on all hosts (where https://gerrit.wikimedia.org/r/p/ exists) - https://phabricator.wikimedia.org/T222093 (10Dzahn) [08:18:30] 10serviceops, 10Operations, 10Release Pipeline, 10Release-Engineering-Team, and 5 others: Introduce wikidata termbox SSR to kubernetes - https://phabricator.wikimedia.org/T220402 (10Pablo-WMDE) @mobrovac During T221755 & T221754 we tended to [[ https://ssr-termbox.wmflabs.org/?spec | `/?spec` ]] and [[ htt... [10:19:30] 10serviceops, 10Operations, 10User-Elukey: Renew certs for mcrouter on all application servers. - https://phabricator.wikimedia.org/T221346 (10elukey) Looks sane to me! [11:01:01] 10serviceops, 10Operations, 10Prod-Kubernetes, 10Kubernetes, and 2 others: migrate endpoint from old registry instance to new one - https://phabricator.wikimedia.org/T221101 (10fsero) [11:01:18] 10serviceops, 10Operations, 10Prod-Kubernetes, 10Kubernetes, and 2 others: migrate endpoint from old registry instance to new one - https://phabricator.wikimedia.org/T221101 (10fsero) [12:56:29] hiya fsero how's https://gerrit.wikimedia.org/r/c/operations/deployment-charts/+/506166 looking? [14:11:54] 10serviceops, 10Operations, 10cloud-services-team: Change /r/p/ to /r/ on all hosts (where https://gerrit.wikimedia.org/r/p/ exists) - https://phabricator.wikimedia.org/T222093 (10Paladox) [14:12:12] 10serviceops, 10Operations, 10cloud-services-team: Change /r/p/ to /r/ on all hosts (where https://gerrit.wikimedia.org/r/p/ exists) - https://phabricator.wikimedia.org/T222093 (10Paladox) [14:39:58] ottomata: let me take a look and ill back to yo [14:43:08] thanaks [15:33:04] 10serviceops, 10Operations, 10Patch-For-Review, 10User-jijiki: Ramp up percentage of users on php7.2 to 100% on both API and appserver clusters - https://phabricator.wikimedia.org/T219150 (10Jdforrester-WMF) [15:33:24] 10serviceops, 10Operations, 10User-jijiki: Ramp up percentage of users on php7.2 to 100% on both API and appserver clusters - https://phabricator.wikimedia.org/T219150 (10Jdforrester-WMF) [16:05:33] Hi, I have someone asking about container operations, who can I direct him to? [16:05:48] or just "anyone at service ops"? [16:07:30] Why not tell that someone to join this channel [16:07:33] And ask? [16:07:48] I will [16:08:03] Great :) [16:08:39] 10serviceops, 10Operations: TEC3:Q4 Tracking task - https://phabricator.wikimedia.org/T220403 (10thcipriani) [16:12:06] 10serviceops, 10Operations, 10Release Pipeline, 10Release-Engineering-Team: Migrate ORES to kubernetes - https://phabricator.wikimedia.org/T220400 (10thcipriani) [16:12:42] 10serviceops, 10Gerrit, 10Operations, 10cloud-services-team: Change /r/p/ to /r/ on all hosts (where https://gerrit.wikimedia.org/r/p/ exists) - https://phabricator.wikimedia.org/T222093 (10Paladox) [16:12:49] 10serviceops, 10Operations, 10Release Pipeline, 10Release-Engineering-Team, and 2 others: TEC3:O3:O3.1:Q4 Goal - Move cpjobqueue, Wikidata Termbox SSR (new service), Kask (session storage service) and ORES (partially) through the production CD Pipeline - https://phabricator.wikimedia.org/T220398 (10thcipria... [16:14:20] 10serviceops, 10Release Pipeline, 10Release-Engineering-Team, 10Core Platform Team Backlog (Watching / External), 10Services (watching): TEC3:O3:O3.1:Q3 Goal - Move cxserver, citoid, changeprop, eventgate (new service) and ORES (partially) through the production ... - https://phabricator.wikimedia.org/T212801 [16:35:55] fsero: do you think it'll possible for me to work on the eventgate chart stuff more today? :) [16:36:07] https://www.irccloud.com/pastebin/hiJtZ0TK/ [16:36:27] the metrics-config ConfigMap i think is missing wmf.releasename [16:36:43] while is unlikely there are two of them i think is more descriptive [16:36:55] ottomata: you need to be patient with me :P [17:18:49] fsero: ok i can ad that [17:18:59] wasn't sure, it wasn't really descriptive before (metrics-config-production?) [17:19:05] but ya i agree more descriptive is better [17:19:22] 10serviceops, 10Operations, 10Prod-Kubernetes, 10Kubernetes, 10User-fsero: placeholder task for migration problems - https://phabricator.wikimedia.org/T222210 (10fsero) [17:19:22] sorry for the impatience, I only get to overlap with you early in my day :) [19:05:10] fsero: does [19:05:11] CLUSTER=staging scap-helm eventgate-analytics install -n analytics stable/eventgate [19:05:15] seem right to you? [19:05:57] It does [19:06:00] k [19:06:06] oh with values files [19:10:18] fsero: i'm going to deploy new chart service to staging, ok? [19:10:59] Ok [19:35:18] fsero: is there any worry that doing scap-helm eventgate-analytics [19:35:26] will overlap namespaces with the currently deployed one? [19:35:34] will that cause any weirdness? [19:45:21] So… Node 6 is EOL as of the end of today, so we're dropping all the production services that haven't bumped to node 10 on the pipeline yet, right? ;-) [19:58:52] btw volans, you fixed puppet6 support in your check_puppet run script today :) [19:59:17] paladox: did I? [19:59:21] yup [19:59:29] puppet6 has different behavour [19:59:43] interms of when it hits a failure, it no longer appears to set "failed" [19:59:54] i filed that bug upstream some where. [20:00:12] apparently it was already doing that, just in some rare case :) [20:01:01] oh, it's worse in puppet6. [20:01:30] volans https://tickets.puppetlabs.com/browse/PUP-9396 [20:02:55] mmmh I'm not sure my patch fixes it though, as I guess the total resources counter would still report something != 0 [20:03:17] if the failure is in the middle of the run [20:03:28] I guess when failing to download the catalog it might just work [20:05:30] volans it does :) [20:05:45] if you make a syntax error, the puppet check says puppet is ok [20:05:48] even though it failed [20:06:31] but with your fix, it says "CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle." [23:14:44] 10serviceops, 10Operations, 10Release Pipeline, 10Release-Engineering-Team, and 5 others: Introduce wikidata termbox SSR to kubernetes - https://phabricator.wikimedia.org/T220402 (10mobrovac) >>! In T220402#5146064, @Pablo-WMDE wrote: > @mobrovac During T221755 & T221754 we tended to [[ https://ssr-termbox...