[00:55:16] PROBLEM - proton endpoints health on proton1002 is CRITICAL: /{domain}/v1/pdf/{title}/{format}/{type} (Respond file not found for a nonexistent title) is CRITICAL: Test Respond file not found for a nonexistent title returned the unexpected status 503 (expecting: 404) [00:56:17] RECOVERY - proton endpoints health on proton1002 is OK: All endpoints are healthy [03:27:36] PROBLEM - MariaDB Slave Lag: s1 on dbstore1002 is CRITICAL: CRITICAL slave_sql_lag Replication lag: 819.08 seconds [03:32:57] RECOVERY - MariaDB Slave Lag: s1 on dbstore1002 is OK: OK slave_sql_lag Replication lag: 268.96 seconds [07:14:46] PROBLEM - proton endpoints health on proton2002 is CRITICAL: /{domain}/v1/pdf/{title}/{format}/{type} (Print the Bar page from en.wp.org in A4 format using optimized for reading on mobile devices) is CRITICAL: Test Print the Bar page from en.wp.org in A4 format using optimized for reading on mobile devices returned the unexpected status 503 (expecting: 200) [07:15:47] RECOVERY - proton endpoints health on proton2002 is OK: All endpoints are healthy [08:11:46] PROBLEM - proton endpoints health on proton1001 is CRITICAL: /{domain}/v1/pdf/{title}/{format}/{type} (Print the Foo page from en.wp.org in letter format) is CRITICAL: Test Print the Foo page from en.wp.org in letter format returned the unexpected status 503 (expecting: 200) [08:12:47] RECOVERY - proton endpoints health on proton1001 is OK: All endpoints are healthy [11:17:43] (03PS1) 10Urbanecm: Autoconfirmed should require 10 edits&4 days on zhwikivoyage [mediawiki-config] - 10https://gerrit.wikimedia.org/r/441592 (https://phabricator.wikimedia.org/T198006) [11:22:29] (03PS1) 10Urbanecm: Add namespace aliases to zhwikivoyage [mediawiki-config] - 10https://gerrit.wikimedia.org/r/441593 (https://phabricator.wikimedia.org/T198007) [13:27:17] PROBLEM - Esams HTTP 5xx reqs/min on graphite1001 is CRITICAL: CRITICAL: 30.00% of data above the critical threshold [1000.0] https://grafana.wikimedia.org/dashboard/file/varnish-aggregate-client-status-codes.json?panelId=3&fullscreen&orgId=1&var-site=esams&var-cache_type=All&var-status_type=5 [13:27:37] PROBLEM - Misc HTTP 5xx reqs/min on graphite1001 is CRITICAL: CRITICAL: 30.00% of data above the critical threshold [1000.0] https://grafana.wikimedia.org/dashboard/file/varnish-aggregate-client-status-codes.json?panelId=3&fullscreen&orgId=1&var-site=All&var-cache_type=misc&var-status_type=5 [13:30:15] seems already resolved, query.wikidata.org --^ [13:35:57] RECOVERY - Esams HTTP 5xx reqs/min on graphite1001 is OK: OK: Less than 1.00% above the threshold [250.0] https://grafana.wikimedia.org/dashboard/file/varnish-aggregate-client-status-codes.json?panelId=3&fullscreen&orgId=1&var-site=esams&var-cache_type=All&var-status_type=5 [13:36:17] RECOVERY - Misc HTTP 5xx reqs/min on graphite1001 is OK: OK: Less than 1.00% above the threshold [250.0] https://grafana.wikimedia.org/dashboard/file/varnish-aggregate-client-status-codes.json?panelId=3&fullscreen&orgId=1&var-site=All&var-cache_type=misc&var-status_type=5 [17:14:37] PROBLEM - exim queue on mx1001 is CRITICAL: CRITICAL: 3593 mails in exim queue. [17:22:16] herron ^^ [17:40:50] be back after vacation [19:14:56] PROBLEM - proton endpoints health on proton2002 is CRITICAL: /{domain}/v1/pdf/{title}/{format}/{type} (Print the Foo page from en.wp.org in letter format) is CRITICAL: Test Print the Foo page from en.wp.org in letter format returned the unexpected status 503 (expecting: 200) [19:17:06] RECOVERY - proton endpoints health on proton2002 is OK: All endpoints are healthy [20:07:27] PROBLEM - proton endpoints health on proton1002 is CRITICAL: /{domain}/v1/pdf/{title}/{format}/{type} (Respond file not found for a nonexistent title) is CRITICAL: Test Respond file not found for a nonexistent title returned the unexpected status 503 (expecting: 404) [20:08:36] RECOVERY - proton endpoints health on proton1002 is OK: All endpoints are healthy [20:10:26] PROBLEM - proton endpoints health on proton2002 is CRITICAL: /{domain}/v1/pdf/{title}/{format}/{type} (Print the Foo page from en.wp.org in letter format) is CRITICAL: Test Print the Foo page from en.wp.org in letter format returned the unexpected status 503 (expecting: 200) [20:11:36] RECOVERY - proton endpoints health on proton2002 is OK: All endpoints are healthy [22:03:16] PROBLEM - proton endpoints health on proton1002 is CRITICAL: /{domain}/v1/pdf/{title}/{format}/{type} (Print the Foo page from en.wp.org in letter format) is CRITICAL: Test Print the Foo page from en.wp.org in letter format returned the unexpected status 503 (expecting: 200) [22:04:17] RECOVERY - proton endpoints health on proton1002 is OK: All endpoints are healthy [22:45:17] RECOVERY - exim queue on mx1001 is OK: OK: Less than 1000 mails in exim queue.