[00:00:39] !log rmoen Synchronized wmf-config/InitialiseSettings.php: (no message) (duration: 00m 11s) [00:00:43] Mjbmr: Done [00:00:46] Logged the message, Master [00:00:49] Thanks! [00:00:56] Mjbmr: thank you [00:01:03] np [00:01:04] My first schema change [00:01:22] RoanKattouw: cheers [00:01:39] rmoen: Did you run the population script as well? [00:01:43] uh [00:01:44] oops [00:01:52] no [00:02:15] RoanKattouw: where to run populateShortUrlTable.php from ? [00:02:29] [16:58] RoanKattouw Then do mwscript extensions/ShortUrl/populateShortUrlTable.php --wiki=eswikibooks [00:02:32] oh [00:02:33] You can run that command from anywhere [00:02:40] lol [00:02:46] Done [00:02:50] Awesome [00:02:52] * RoanKattouw high-fives rmoen [00:03:26] 6operations, 6WMF-Legal, 6WMF-NDA-Requests: Add multichill to WMF-NDA group - https://phabricator.wikimedia.org/T87097#1271100 (10Dzahn) 5Open>3Resolved Thank you! It's done then. You are also in the NDA LDAP group which lets you login on icinga, graphite, etc. [00:04:01] rmoen: see RoanKattouw's comment: https://wikitech.wikimedia.org/wiki/How_to_do_a_schema_change [00:05:44] haha that's old [00:05:49] Mjbmr: I've read this just have never done it until now. [00:05:54] You can tell from the comment that was before I lived in SF [00:05:57] I've lived here for 3 years now [00:06:12] Before you became a native english speaker [00:06:14] :) [00:41:52] 6operations, 6Labs, 10Tool-Labs, 7Monitoring: Add catchall tests for toollabs to catchpoint - https://phabricator.wikimedia.org/T97321#1271185 (10yuvipanda) 5Open>3declined a:3yuvipanda Superseeded by T97748 and friends. We're having fine grained tests there rather than catchall ones. [00:56:15] (03CR) 10BryanDavis: "Over 24 hours with groups 0+1 logging to logstash and everything looks great. Monday morning seems like a good time to start logging every" [mediawiki-config] - 10https://gerrit.wikimedia.org/r/209172 (https://phabricator.wikimedia.org/T88732) (owner: 10BryanDavis) [02:16:36] PROBLEM - puppet last run on mw2157 is CRITICAL puppet fail [02:24:25] !log l10nupdate Synchronized php-1.26wmf4/cache/l10n: (no message) (duration: 06m 06s) [02:24:36] Logged the message, Master [02:27:14] hmm, can anyone help get a new project-logo uploaded? It looks like in ori's change today the wikimania2015 one went missing [02:27:25] when it was just added a day or two ago and they are trying to launch Registration [02:27:36] so getting it uploaded asap would be really really nice :) [02:27:41] since they just emailed me freaked out [02:29:10] !log LocalisationUpdate completed (1.26wmf4) at 2015-05-08 02:28:07+00:00 [02:29:22] Logged the message, Master [02:34:35] RECOVERY - puppet last run on mw2157 is OK Puppet is currently enabled, last run 24 seconds ago with 0 failures [02:44:59] !log l10nupdate Synchronized php-1.26wmf5/cache/l10n: (no message) (duration: 05m 47s) [02:45:13] Logged the message, Master [02:49:18] !log LocalisationUpdate completed (1.26wmf5) at 2015-05-08 02:48:15+00:00 [02:49:28] Logged the message, Master [02:51:00] 6operations, 6Labs: Investigate ways of getting off raid6 for labs store - https://phabricator.wikimedia.org/T96063#1271404 (10coren) >>! In T96063#1268232, @mark wrote: > With the current stability and performance problems of NFS with RAID6, this is definitely not a "nice to have" but something that needs to... [02:59:26] PROBLEM - puppet last run on eeden is CRITICAL puppet fail [03:17:26] RECOVERY - puppet last run on eeden is OK Puppet is currently enabled, last run 1 minute ago with 0 failures [03:24:27] jamesofur: still need help? [03:24:33] aye [03:25:36] jamesofur: what project and what should the logo be? [03:25:56] https://wikimania2015.wikimedia.org/wiki/Main_Page and https://wikimania2015.wikimedia.org/wiki/File:Wiki.svg [03:29:37] hmm [03:30:51] oh heh [03:30:52] I see [03:31:38] (03PS1) 10Legoktm: Use png for wikimania2015wiki logo [mediawiki-config] - 10https://gerrit.wikimedia.org/r/209675 [03:32:03] (03CR) 10Legoktm: [C: 032] Use png for wikimania2015wiki logo [mediawiki-config] - 10https://gerrit.wikimedia.org/r/209675 (owner: 10Legoktm) [03:32:08] (03Merged) 10jenkins-bot: Use png for wikimania2015wiki logo [mediawiki-config] - 10https://gerrit.wikimedia.org/r/209675 (owner: 10Legoktm) [03:33:06] !log legoktm Synchronized w/static/images/project-logos/wikimania2015wiki.png: Use png for wikimania2015wiki logo (duration: 00m 12s) [03:33:15] Logged the message, Master [03:33:21] * jamesofur sees it [03:33:26] woot [04:03:46] legoktm, Jamesofur|cloud: thank you [04:14:15] (03PS1) 10KartikMistry: Fix wikiname: roa-rupwiki -> roa_rupwiki [mediawiki-config] - 10https://gerrit.wikimedia.org/r/209676 [04:14:37] Can anyone quickly fix ^^ [04:15:24] (03CR) 10Ori.livneh: [C: 032] Fix wikiname: roa-rupwiki -> roa_rupwiki [mediawiki-config] - 10https://gerrit.wikimedia.org/r/209676 (owner: 10KartikMistry) [04:15:30] (03Merged) 10jenkins-bot: Fix wikiname: roa-rupwiki -> roa_rupwiki [mediawiki-config] - 10https://gerrit.wikimedia.org/r/209676 (owner: 10KartikMistry) [04:15:59] ori: thanks! [04:16:39] !log ori Synchronized wmf-config/InitialiseSettings.php: I4c70ce4d0: Fix wikiname: roa-rupwiki -> roa_rupwiki (duration: 00m 12s) [04:16:50] Logged the message, Master [04:17:48] kart_: your turn! https://github.com/wikimedia/jquery.uls/pull/186 [04:17:49] ;) [04:17:54] (i'm kidding, it's not urgent) [04:18:05] :) [04:30:16] (03PS4) 10MZMcBride: Rsyncing slow-parse logs from fluorine to dumps.wikimedia.org [puppet] - 10https://gerrit.wikimedia.org/r/49678 (https://phabricator.wikimedia.org/T98563) (owner: 10Ottomata) [04:30:20] (03PS5) 10MZMcBride: Rsyncing slow-parse logs from fluorine to dumps.wikimedia.org [puppet] - 10https://gerrit.wikimedia.org/r/49678 (https://phabricator.wikimedia.org/T98563) (owner: 10Ottomata) [04:32:24] (03CR) 10MZMcBride: "Huh, I got some "database connection wasn't allocated" error message from Gerrit after adding https://phabricator.wikimedia.org/T98563 to " [puppet] - 10https://gerrit.wikimedia.org/r/49678 (https://phabricator.wikimedia.org/T98563) (owner: 10Ottomata) [04:37:08] (03Abandoned) 10KartikMistry: Use dblist for contenttranslation [mediawiki-config] - 10https://gerrit.wikimedia.org/r/199576 (owner: 10KartikMistry) [04:38:08] (03PS7) 10KartikMistry: Add initial Debian packaging for apertium-dan-nor [debs/contenttranslation/apertium-dan-nor] - 10https://gerrit.wikimedia.org/r/195905 (https://phabricator.wikimedia.org/T91493) [04:41:40] (03PS1) 10BryanDavis: beta: Remove custom apache2 log config [puppet] - 10https://gerrit.wikimedia.org/r/209680 (https://phabricator.wikimedia.org/T98289) [04:41:46] yuvipanda: ^ [04:42:00] looking [04:42:17] bd808: are you cherry picking or should I just merge? [04:42:21] it’s clearly beta only so I don’t mind [04:42:41] hmm... maybe I should cherry-pick first [04:42:50] easier to undo if needed [04:43:02] bd808: the only other commit I made today required two follow up commits to fix, so maybe yeah do cherry pick :) [04:43:39] git to keep that commit count high! I'm only the #17 committer to ops/puppet [04:43:47] according to github anyway [04:44:56] :P [04:54:59] (03CR) 10Yuvipanda: [C: 032] beta: Remove custom apache2 log config [puppet] - 10https://gerrit.wikimedia.org/r/209680 (https://phabricator.wikimedia.org/T98289) (owner: 10BryanDavis) [04:55:44] bd808: done [04:56:13] cool [05:04:41] (03PS1) 10Andrew Bogott: Rename CONF.libvirt_images_type to CONF.libvirt.images_type [puppet] - 10https://gerrit.wikimedia.org/r/209681 [05:05:31] (03PS1) 10BryanDavis: beta: fix symlinks in docroot/bits/static/master [mediawiki-config] - 10https://gerrit.wikimedia.org/r/209682 [05:05:46] (03CR) 10Andrew Bogott: [C: 032] Rename CONF.libvirt_images_type to CONF.libvirt.images_type [puppet] - 10https://gerrit.wikimedia.org/r/209681 (owner: 10Andrew Bogott) [05:09:18] (03PS1) 10OliverKeyes: Add fluorine rsync connector [puppet] - 10https://gerrit.wikimedia.org/r/209684 [05:09:56] PROBLEM - puppet last run on palladium is CRITICAL puppet fail [05:10:37] (03PS2) 10OliverKeyes: Add fluorine rsync connector [puppet] - 10https://gerrit.wikimedia.org/r/209684 [05:14:27] !log LocalisationUpdate ResourceLoader cache refresh completed at Fri May 8 05:13:23 UTC 2015 (duration 13m 22s) [05:14:40] Logged the message, Master [05:16:26] RECOVERY - puppet last run on palladium is OK Puppet is currently enabled, last run 9 seconds ago with 0 failures [06:03:12] (03PS6) 10KartikMistry: CX: Use RESTBase API for page fetch [puppet] - 10https://gerrit.wikimedia.org/r/207378 [06:13:52] (03PS8) 10Yuvipanda: [WIP]mesos: Add simple mesos module [puppet] - 10https://gerrit.wikimedia.org/r/208483 [06:21:09] (03PS9) 10Yuvipanda: [WIP]mesos: Add simple mesos module [puppet] - 10https://gerrit.wikimedia.org/r/208483 [06:28:35] (03PS10) 10Yuvipanda: [WIP]mesos: Add simple mesos module [puppet] - 10https://gerrit.wikimedia.org/r/208483 [06:29:07] PROBLEM - puppet last run on db1034 is CRITICAL puppet fail [06:29:14] (03CR) 10jenkins-bot: [V: 04-1] [WIP]mesos: Add simple mesos module [puppet] - 10https://gerrit.wikimedia.org/r/208483 (owner: 10Yuvipanda) [06:32:15] PROBLEM - puppet last run on ms-fe2001 is CRITICAL Puppet has 1 failures [06:32:16] PROBLEM - puppet last run on mw2134 is CRITICAL Puppet has 1 failures [06:32:25] PROBLEM - puppet last run on mw2016 is CRITICAL Puppet has 1 failures [06:32:26] PROBLEM - puppet last run on cp4014 is CRITICAL Puppet has 1 failures [06:33:36] PROBLEM - puppet last run on mw1170 is CRITICAL Puppet has 1 failures [06:33:55] PROBLEM - puppet last run on mw2212 is CRITICAL Puppet has 1 failures [06:33:55] PROBLEM - puppet last run on mw2184 is CRITICAL Puppet has 1 failures [06:34:05] PROBLEM - puppet last run on mw2045 is CRITICAL Puppet has 1 failures [06:34:06] PROBLEM - puppet last run on mw1251 is CRITICAL Puppet has 2 failures [06:34:45] PROBLEM - puppet last run on mw2003 is CRITICAL Puppet has 1 failures [06:35:06] PROBLEM - puppet last run on mw2136 is CRITICAL Puppet has 1 failures [06:42:05] 6operations, 5Patch-For-Review: Scale up and out our puppetmaster infrastructure - https://phabricator.wikimedia.org/T98128#1271725 (10Joe) @chasemp we do exported resources, which don't really work in a masterless config [06:46:45] RECOVERY - puppet last run on mw1170 is OK Puppet is currently enabled, last run 33 seconds ago with 0 failures [06:46:56] RECOVERY - puppet last run on ms-fe2001 is OK Puppet is currently enabled, last run 1 minute ago with 0 failures [06:46:56] RECOVERY - puppet last run on mw2212 is OK Puppet is currently enabled, last run 32 seconds ago with 0 failures [06:46:57] RECOVERY - puppet last run on mw2134 is OK Puppet is currently enabled, last run 21 seconds ago with 0 failures [06:47:05] RECOVERY - puppet last run on db1034 is OK Puppet is currently enabled, last run 48 seconds ago with 0 failures [06:47:06] RECOVERY - puppet last run on mw2016 is OK Puppet is currently enabled, last run 38 seconds ago with 0 failures [06:47:06] RECOVERY - puppet last run on mw2045 is OK Puppet is currently enabled, last run 8 seconds ago with 0 failures [06:47:06] RECOVERY - puppet last run on mw1251 is OK Puppet is currently enabled, last run 16 seconds ago with 0 failures [06:47:06] RECOVERY - puppet last run on cp4014 is OK Puppet is currently enabled, last run 20 seconds ago with 0 failures [06:47:46] RECOVERY - puppet last run on mw2003 is OK Puppet is currently enabled, last run 46 seconds ago with 0 failures [06:48:15] RECOVERY - puppet last run on mw2136 is OK Puppet is currently enabled, last run 56 seconds ago with 0 failures [06:48:35] RECOVERY - puppet last run on mw2184 is OK Puppet is currently enabled, last run 1 minute ago with 0 failures [06:52:09] (03PS11) 10Yuvipanda: [WIP]mesos: Add simple mesos module [puppet] - 10https://gerrit.wikimedia.org/r/208483 [07:22:51] 6operations, 6Release-Engineering, 10Wikimedia-Language-setup, 10Wikimedia-Site-requests: nb subdomain redirects - https://phabricator.wikimedia.org/T86924#1271748 (10jayvdb) [07:26:07] ACKNOWLEDGEMENT - Unmerged changes on repository puppet on rhodium is CRITICAL: There are 48 unmerged changes in puppet (dir /var/lib/git/operations/puppet). alexandros kosiaris still being onlined. Ignore [07:26:07] ACKNOWLEDGEMENT - puppetmaster backend https on rhodium is CRITICAL: HTTP CRITICAL - Invalid HTTP response received from host on port 8141: HTTP/1.1 500 Internal Server Error alexandros kosiaris still being onlined. Ignore [07:36:05] PROBLEM - puppet last run on einsteinium is CRITICAL Puppet last ran 4 hours ago [07:42:49] 6operations: Upgrade salt to 2014.7 (investigating) - https://phabricator.wikimedia.org/T88971#1271786 (10ArielGlenn) [07:44:47] 6operations, 6Labs: upgrade salt in labs - https://phabricator.wikimedia.org/T98578#1271787 (10ArielGlenn) 3NEW a:3ArielGlenn [07:52:06] PROBLEM - DPKG on sca1002 is CRITICAL: DPKG CRITICAL dpkg reports broken packages [07:53:45] RECOVERY - DPKG on sca1002 is OK: All packages OK [08:04:09] 6operations, 6Labs: upgrade salt in labs - https://phabricator.wikimedia.org/T98578#1271817 (10ArielGlenn) [08:04:11] 6operations: Upgrade salt to 2014.7 (investigating) - https://phabricator.wikimedia.org/T88971#1271816 (10ArielGlenn) [08:05:02] 6operations: Upgrade salt to 2014.7 (investigating) - https://phabricator.wikimedia.org/T88971#1024437 (10ArielGlenn) [08:05:03] 6operations, 6Labs: upgrade salt in labs - https://phabricator.wikimedia.org/T98578#1271819 (10ArielGlenn) 5Open>3Resolved This has been done. Some instances were skipped, a full list follows. 1) Instances that were shut off at the time: Instance: i-000000fd Status: SHUTOFF hostname: mwreview-merl Insta... [08:05:56] 6operations, 6Labs: upgrade salt in labs - https://phabricator.wikimedia.org/T98578#1271822 (10ArielGlenn) [08:08:00] 6operations: Upgrade salt to 2014.7 (investigating) - https://phabricator.wikimedia.org/T88971#1271835 (10ArielGlenn) [08:13:18] 6operations: fix trebuchet-trigger (git deploy) publish.runner arguments - https://phabricator.wikimedia.org/T98581#1271844 (10ArielGlenn) 3NEW [08:14:02] 6operations: fix trebuchet-trigger (git deploy) publish.runner arguments - https://phabricator.wikimedia.org/T98581#1271851 (10ArielGlenn) a:3ArielGlenn [08:14:12] 6operations: fix trebuchet-trigger (git deploy) publish.runner arguments - https://phabricator.wikimedia.org/T98581#1271844 (10ArielGlenn) [08:16:49] 6operations: Upgrade salt to 2014.7 (investigating) - https://phabricator.wikimedia.org/T88971#1271855 (10ArielGlenn) Close to done. See the blocking tasks. [08:30:37] 6operations, 6Labs, 10Labs-Infrastructure: Investigate ways of getting off raid6 for labs store - https://phabricator.wikimedia.org/T96063#1271872 (10mark) [08:32:07] 6operations, 6Labs, 10Labs-Infrastructure: Investigate ways of getting off raid6 for labs store - https://phabricator.wikimedia.org/T96063#1207452 (10mark) @Coren: Where can I see the mapping of raid array (md125 etc) to shelf? Is this documented? [08:33:35] PROBLEM - Disk space on analytics1017 is CRITICAL: DISK CRITICAL - free space: /var/lib/hadoop/data/b 81128 MB (4% inode=99%): /var/lib/hadoop/data/c 81178 MB (4% inode=99%): /var/lib/hadoop/data/d 83091 MB (4% inode=99%): /var/lib/hadoop/data/e 74765 MB (3% inode=99%): /var/lib/hadoop/data/f 83114 MB (4% inode=99%): /var/lib/hadoop/data/g 83387 MB (4% inode=99%): /var/lib/hadoop/data/h 82980 MB (4% inode=99%): /var/lib/hadoop/data/i [08:34:04] 6operations, 6Labs, 10Labs-Infrastructure: Migrate Labs NFS storage from RAID6 to RAID10 - https://phabricator.wikimedia.org/T96063#1271874 (10mark) [08:37:29] PROBLEM - Disk space on analytics1014 is CRITICAL: DISK CRITICAL - free space: /var/lib/hadoop/data/g 78636 MB (4% inode=99%): /var/lib/hadoop/data/c 82689 MB (4% inode=99%): /var/lib/hadoop/data/e 79088 MB (4% inode=99%): /var/lib/hadoop/data/f 82575 MB (4% inode=99%): /var/lib/hadoop/data/h 80635 MB (4% inode=99%): /var/lib/hadoop/data/l 83500 MB (4% inode=99%): /var/lib/hadoop/data/b 77944 MB (4% inode=99%): /var/lib/hadoop/data/k [08:46:18] (03CR) 10Alexandros Kosiaris: [C: 032] puppetmaster: DRY on inclusion of hiera configuration [puppet] - 10https://gerrit.wikimedia.org/r/209272 (owner: 10Alexandros Kosiaris) [08:46:39] (03CR) 10Alexandros Kosiaris: [C: 032] puppetmaster: Move backups to the role class [puppet] - 10https://gerrit.wikimedia.org/r/209271 (owner: 10Alexandros Kosiaris) [08:46:54] (03CR) 10Alexandros Kosiaris: [C: 032] puppetmaster::logstash. Avoid out of module dependencies [puppet] - 10https://gerrit.wikimedia.org/r/209270 (owner: 10Alexandros Kosiaris) [08:47:23] (03CR) 10Alexandros Kosiaris: [C: 032] puppetmaster: latest to present [puppet] - 10https://gerrit.wikimedia.org/r/209269 (owner: 10Alexandros Kosiaris) [08:47:53] (03CR) 10Alexandros Kosiaris: [C: 032] puppetmaster::gitpuppet lint cleanups [puppet] - 10https://gerrit.wikimedia.org/r/209268 (owner: 10Alexandros Kosiaris) [08:52:05] (03CR) 10Alexandros Kosiaris: [C: 032] puppetmaster::config. Minor lints [puppet] - 10https://gerrit.wikimedia.org/r/209266 (owner: 10Alexandros Kosiaris) [08:54:44] (03CR) 10Alexandros Kosiaris: [C: 032 V: 032] Add initial Debian packaging for apertium-dan-nor [debs/contenttranslation/apertium-dan-nor] - 10https://gerrit.wikimedia.org/r/195905 (https://phabricator.wikimedia.org/T91493) (owner: 10KartikMistry) [08:55:57] (03CR) 10Alexandros Kosiaris: [C: 032 V: 032] Added initial Debian package for apertium-kaz [debs/contenttranslation/apertium-kaz] - 10https://gerrit.wikimedia.org/r/209463 (https://phabricator.wikimedia.org/T95876) (owner: 10KartikMistry) [09:11:53] 6operations, 10Datasets-General-or-Unknown, 10Wikidata, 3Wikidata-Sprint-2015-04-07, and 2 others: Wikidata dumps contain old-style serialization. - https://phabricator.wikimedia.org/T74348#1271902 (10daniel) The double-check didn't turn anything up either. The dump seems to be clean. [09:14:39] 6operations, 10Datasets-General-or-Unknown, 10Wikidata, 3Wikidata-Sprint-2015-04-07, and 2 others: Wikidata dumps contain old-style serialization. - https://phabricator.wikimedia.org/T74348#1271909 (10ArielGlenn) Great news! [09:18:06] 6operations: snaphot1004 running dumps very slowly, investigate - https://phabricator.wikimedia.org/T98585#1271915 (10ArielGlenn) 3NEW a:3ArielGlenn [09:20:30] 6operations: snaphot1004 running dumps very slowly, investigate - https://phabricator.wikimedia.org/T98585#1271929 (10ArielGlenn) Suspect memory leak in dumpBackup.php and/or all the stuff it uses from mw core. Running the command by hand instead of from the wrapper script shows that it is indeed somewhere in th... [09:26:46] PROBLEM - puppet last run on dbstore2001 is CRITICAL puppet fail [09:39:27] !log uploaded to apt.wikimedia.org jessie-wikimedia: apertium-dan-nor_1.0.0~r48173-1 [09:39:27] !log uploaded to apt.wikimedia.org jessie-wikimedia: apertium-kaz_0.1.0~r60155-1 [09:39:36] Logged the message, Master [09:39:45] Logged the message, Master [09:43:05] RECOVERY - puppet last run on dbstore2001 is OK Puppet is currently enabled, last run 4 seconds ago with 0 failures [09:47:36] (03PS2) 10Dereckson: Alphabetical order for groupOverrides [mediawiki-config] - 10https://gerrit.wikimedia.org/r/209286 [09:47:43] (03CR) 10jenkins-bot: [V: 04-1] Alphabetical order for groupOverrides [mediawiki-config] - 10https://gerrit.wikimedia.org/r/209286 (owner: 10Dereckson) [09:51:13] (03PS3) 10Dereckson: Alphabetical order for groupOverrides [mediawiki-config] - 10https://gerrit.wikimedia.org/r/209286 [09:51:52] (03CR) 10Dereckson: [C: 031] "PS3: rebased." [mediawiki-config] - 10https://gerrit.wikimedia.org/r/209286 (owner: 10Dereckson) [10:06:39] (03PS1) 10Dereckson: Add flood user group on ca.wikinews [mediawiki-config] - 10https://gerrit.wikimedia.org/r/209699 (https://phabricator.wikimedia.org/T98576) [10:26:16] PROBLEM - HTTP 5xx req/min on graphite1001 is CRITICAL 35.71% of data above the critical threshold [500.0] [10:40:56] RECOVERY - HTTP 5xx req/min on graphite1001 is OK Less than 1.00% above the threshold [250.0] [10:42:39] (03PS12) 10Giuseppe Lavagetto: etcd: create puppet module [puppet] - 10https://gerrit.wikimedia.org/r/208928 (https://phabricator.wikimedia.org/T97973) [10:48:19] (03PS13) 10Giuseppe Lavagetto: etcd: create puppet module [puppet] - 10https://gerrit.wikimedia.org/r/208928 (https://phabricator.wikimedia.org/T97973) [10:52:25] PROBLEM - Varnishkafka Delivery Errors per minute on cp4017 is CRITICAL 11.11% of data above the critical threshold [20000.0] [10:54:37] Wow.. 21.3 million this morning. [10:55:36] RECOVERY - Varnishkafka Delivery Errors per minute on cp4017 is OK Less than 1.00% above the threshold [0.0] [11:06:01] (03PS2) 10Dereckson: Enable NewUserMessage on bh.wikipedia [mediawiki-config] - 10https://gerrit.wikimedia.org/r/209146 (https://phabricator.wikimedia.org/T97920) [11:06:12] (03CR) 10Dereckson: "PS2: wmgNewUserMessageOnAutoCreate, rebased" [mediawiki-config] - 10https://gerrit.wikimedia.org/r/209146 (https://phabricator.wikimedia.org/T97920) (owner: 10Dereckson) [11:09:16] PROBLEM - puppet last run on cp4014 is CRITICAL puppet fail [11:12:11] (03CR) 10Alexandros Kosiaris: [C: 032] puppetmaster::config Avoid out of module dependencies [puppet] - 10https://gerrit.wikimedia.org/r/209267 (owner: 10Alexandros Kosiaris) [11:17:07] 6operations, 6Labs, 10Labs-Infrastructure: Migrate Labs NFS storage from RAID6 to RAID10 - https://phabricator.wikimedia.org/T96063#1272083 (10coren) @mark: It's in the slides (https://commons.wikimedia.org/wiki/File:WMF_Labs_storage_presentation.pdf) but also ridiculously straightforward: shelves are mapped... [11:19:57] 6operations, 6Labs, 10Labs-Infrastructure: Migrate Labs NFS storage from RAID6 to RAID10 - https://phabricator.wikimedia.org/T96063#1272087 (10coren) A note: while it will probably increase the amount of necessary juggling, the entire setup would be //immensely// improved with raid10 if - rather than one she... [11:22:57] 6operations, 6Analytics-Kanban: Event Logging data is not showing up in Graphite anymore since last week - https://phabricator.wikimedia.org/T98380#1272092 (10fgiunchedi) @milimetric no problem -- I'll give a bit more context :) we've switched statsd implementation from txstatsd to statsite for performance/eff... [11:25:11] (03PS1) 10KartikMistry: CX: Hide 'crh' [puppet] - 10https://gerrit.wikimedia.org/r/209704 [11:25:53] akosiaris: when you've time, https://gerrit.wikimedia.org/r/#/c/209704/ [11:26:20] akosiaris: and kaz-tat :) [11:27:17] RECOVERY - puppet last run on cp4014 is OK Puppet is currently enabled, last run 1 minute ago with 0 failures [11:27:59] (03CR) 10Filippo Giunchedi: [C: 032 V: 032] sentry3: add Sentry Smart CDU detection [software/librenms] - 10https://gerrit.wikimedia.org/r/209443 (https://phabricator.wikimedia.org/T84416) (owner: 10Filippo Giunchedi) [11:29:04] hey all [11:30:05] one of my catgraph instances, i-00000184.eqiad.wmflabs (sylvester) is in SHUTOFF state. i tried rebooting it from the labs console, waited 10 minutes and reloaded the page no change. [11:31:25] now, when i click "reboot" i get a little popup status message "failed to reboot instance sylvester". "get console output" shows a similar thing "Failed to get console output for instance sylvester." [11:31:43] is there some maintenance going on or something? [11:31:59] Coren: maybe you know something? [11:32:59] yesterday the instance was up, today it's down for no apparent reason. [11:45:43] (03CR) 10Mjbmr: [C: 031] Enable NewUserMessage on bh.wikipedia [mediawiki-config] - 10https://gerrit.wikimedia.org/r/209146 (https://phabricator.wikimedia.org/T97920) (owner: 10Dereckson) [11:49:32] !log deploy librenms 2fa805ff [11:49:41] Logged the message, Master [11:51:55]