[00:02:58] gn8 folks [00:39:08] New patchset: Asher; "adding percona mysql checks" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1850 [00:39:25] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1850 [00:43:44] New review: Asher; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1850 [00:43:45] Change merged: Asher; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1850 [00:48:29] !log preilly synchronized php-1.18/extensions/MobileFrontend/MobileFrontend.php 'update to mobile frontend to fix random link' [00:48:30] Logged the message, Master [00:48:41] !log pushing quick fix for special random [00:48:42] Logged the message, Master [01:33:23] New patchset: Asher; "install just the new mysql check files on eqiad dbs" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1851 [01:45:42] New review: Asher; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1851 [01:45:43] Change merged: Asher; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1851 [02:04:31] !log LocalisationUpdate completed (1.18) at Thu Jan 12 02:04:31 UTC 2012 [02:04:33] Logged the message, Master [02:18:42] PROBLEM - Misc_Db_Lag on storage3 is CRITICAL: CHECK MySQL REPLICATION - lag - CRITICAL - Seconds_Behind_Master : 1455s [02:24:02] PROBLEM - MySQL replication status on storage3 is CRITICAL: CHECK MySQL REPLICATION - lag - CRITICAL - Seconds_Behind_Master : 1775s [02:28:32] RECOVERY - Misc_Db_Lag on storage3 is OK: CHECK MySQL REPLICATION - lag - OK - Seconds_Behind_Master : 0s [02:33:52] RECOVERY - MySQL replication status on storage3 is OK: CHECK MySQL REPLICATION - lag - OK - Seconds_Behind_Master : 0s [02:53:58] RECOVERY - Puppet freshness on srv272 is OK: puppet ran at Thu Jan 12 02:53:41 UTC 2012 [02:58:31] http://www.facebook.com/FinancialSTOCK like this page please [04:17:51] RECOVERY - MySQL disk space on es1004 is OK: DISK OK [04:23:31] RECOVERY - Disk space on es1004 is OK: DISK OK [04:36:40] PROBLEM - MySQL slave status on es1004 is CRITICAL: CRITICAL: Slave running: expected Yes, got No [09:09:36] PROBLEM - Puppet freshness on db22 is CRITICAL: Puppet has not run in the last 10 hours [09:47:26] siebrand, do you know how can I use http://integration.mediawiki.org/testswarm/ on mobile browsers? [09:48:21] I've tried on an Android tablet, but with the andoid browser it says that it doesn't need tests for it, with another browser I have it recognizes it as "websafari 3.2" but never gives tests to run [09:48:37] *Mobile Safari 3.2 [09:50:34] you need to let it sit there till there are tests ready [10:00:32] PROBLEM - MySQL disk space on es1004 is CRITICAL: DISK CRITICAL - free space: /a 440055 MB (3% inode=99%): [10:00:52] p858snake|l, there were tests ready, I checked with other browsers [10:01:22] did they connect at the same time, to get the same test batch? [10:03:16] p858snake|l, another browser previously connected received more tests but still the mobile browser wasn't sent any [10:07:02] PROBLEM - Disk space on es1004 is CRITICAL: DISK CRITICAL - free space: /a 408755 MB (3% inode=99%): [10:24:00] RECOVERY - MySQL slave status on es1004 is OK: OK: [11:25:44] PROBLEM - Puppet freshness on db1044 is CRITICAL: Puppet has not run in the last 10 hours [11:26:44] PROBLEM - Puppet freshness on db1006 is CRITICAL: Puppet has not run in the last 10 hours [11:27:44] PROBLEM - Puppet freshness on db1001 is CRITICAL: Puppet has not run in the last 10 hours [11:27:44] PROBLEM - Puppet freshness on db1018 is CRITICAL: Puppet has not run in the last 10 hours [11:36:47] PROBLEM - Puppet freshness on db1007 is CRITICAL: Puppet has not run in the last 10 hours [11:36:47] PROBLEM - Puppet freshness on db1005 is CRITICAL: Puppet has not run in the last 10 hours [11:36:47] PROBLEM - Puppet freshness on db1010 is CRITICAL: Puppet has not run in the last 10 hours [11:36:47] PROBLEM - Puppet freshness on db1009 is CRITICAL: Puppet has not run in the last 10 hours [11:36:47] PROBLEM - Puppet freshness on db1020 is CRITICAL: Puppet has not run in the last 10 hours [11:36:48] PROBLEM - Puppet freshness on db1021 is CRITICAL: Puppet has not run in the last 10 hours [11:36:48] PROBLEM - Puppet freshness on db1022 is CRITICAL: Puppet has not run in the last 10 hours [11:36:49] PROBLEM - Puppet freshness on db1038 is CRITICAL: Puppet has not run in the last 10 hours [11:36:49] PROBLEM - Puppet freshness on db1034 is CRITICAL: Puppet has not run in the last 10 hours [11:36:50] PROBLEM - Puppet freshness on db1047 is CRITICAL: Puppet has not run in the last 10 hours [11:36:50] PROBLEM - Puppet freshness on db1033 is CRITICAL: Puppet has not run in the last 10 hours [11:36:51] PROBLEM - Puppet freshness on db1043 is CRITICAL: Puppet has not run in the last 10 hours [11:36:51] PROBLEM - Puppet freshness on db1025 is CRITICAL: Puppet has not run in the last 10 hours [11:36:52] PROBLEM - Puppet freshness on db1048 is CRITICAL: Puppet has not run in the last 10 hours [11:40:27] PROBLEM - Puppet freshness on db1002 is CRITICAL: Puppet has not run in the last 10 hours [11:40:28] PROBLEM - Puppet freshness on db1035 is CRITICAL: Puppet has not run in the last 10 hours [11:40:28] PROBLEM - Puppet freshness on db1042 is CRITICAL: Puppet has not run in the last 10 hours [11:42:37] PROBLEM - Puppet freshness on db1004 is CRITICAL: Puppet has not run in the last 10 hours [11:43:37] PROBLEM - Puppet freshness on db1011 is CRITICAL: Puppet has not run in the last 10 hours [11:44:27] PROBLEM - Puppet freshness on db1030 is CRITICAL: Puppet has not run in the last 10 hours [11:46:37] PROBLEM - Puppet freshness on db1019 is CRITICAL: Puppet has not run in the last 10 hours [11:46:37] PROBLEM - Puppet freshness on db1041 is CRITICAL: Puppet has not run in the last 10 hours [11:46:37] PROBLEM - Puppet freshness on db1008 is CRITICAL: Puppet has not run in the last 10 hours [11:47:27] PROBLEM - Puppet freshness on db1017 is CRITICAL: Puppet has not run in the last 10 hours [11:48:27] PROBLEM - Puppet freshness on db1012 is CRITICAL: Puppet has not run in the last 10 hours [11:48:27] PROBLEM - Puppet freshness on db1028 is CRITICAL: Puppet has not run in the last 10 hours [11:49:27] PROBLEM - Puppet freshness on db1027 is CRITICAL: Puppet has not run in the last 10 hours [11:49:27] PROBLEM - Puppet freshness on db1015 is CRITICAL: Puppet has not run in the last 10 hours [11:50:37] PROBLEM - Puppet freshness on db1046 is CRITICAL: Puppet has not run in the last 10 hours [11:51:37] PROBLEM - Puppet freshness on db1003 is CRITICAL: Puppet has not run in the last 10 hours [11:51:37] PROBLEM - Puppet freshness on db1026 is CRITICAL: Puppet has not run in the last 10 hours [11:51:37] PROBLEM - Puppet freshness on db1045 is CRITICAL: Puppet has not run in the last 10 hours [11:51:37] PROBLEM - Puppet freshness on db1039 is CRITICAL: Puppet has not run in the last 10 hours [11:51:37] PROBLEM - Puppet freshness on db1013 is CRITICAL: Puppet has not run in the last 10 hours [11:52:27] PROBLEM - Puppet freshness on db1014 is CRITICAL: Puppet has not run in the last 10 hours [11:52:27] PROBLEM - Puppet freshness on db1024 is CRITICAL: Puppet has not run in the last 10 hours [11:53:37] PROBLEM - Puppet freshness on db1029 is CRITICAL: Puppet has not run in the last 10 hours [11:54:37] PROBLEM - Puppet freshness on db1031 is CRITICAL: Puppet has not run in the last 10 hours [11:54:38] PROBLEM - Puppet freshness on db1016 is CRITICAL: Puppet has not run in the last 10 hours [11:55:37] PROBLEM - Puppet freshness on db1040 is CRITICAL: Puppet has not run in the last 10 hours [12:07:05] Nemo_bis: AFAIK it's just not enabled for mobile browsers. [12:09:05] siebrand, ah ok, I thought the browser list was configured, not just showing all possible browsers :) [12:41:22] New patchset: Mark Bergsma; "Add generic::debconf::set definition for preseeding" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1852 [12:41:59] New patchset: Mark Bergsma; "Install all Mailman languages" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1853 [12:42:21] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1852 [12:42:22] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1852 [12:51:59] New patchset: Mark Bergsma; "Install all Mailman languages" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1853 [12:52:27] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1853 [12:52:28] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1853 [12:53:35] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1847 [12:53:36] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1847 [12:56:11] New patchset: Mark Bergsma; "Escape value var" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1854 [12:56:35] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1854 [12:56:36] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1854 [13:00:49] New patchset: Mark Bergsma; "Use the noninteractive debconf frontend" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1855 [13:01:26] New patchset: Mark Bergsma; "Use the noninteractive debconf frontend" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1855 [13:02:13] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1855 [13:02:14] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1855 [13:10:15] New patchset: Mark Bergsma; "Correct mailman default_server_language" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1856 [13:10:31] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1856 [13:10:31] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1856 [13:12:28] New patchset: Mark Bergsma; "Add newline in comparison string" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1857 [13:12:52] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1857 [13:12:53] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1857 [13:21:26] PROBLEM - Puppet freshness on ms1002 is CRITICAL: Puppet has not run in the last 10 hours [13:27:55] New patchset: Mark Bergsma; "Attempt to get the debconf test to work" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1858 [13:28:16] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1858 [13:28:16] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1858 [13:41:11] New patchset: Mark Bergsma; "Simplify test again" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1859 [13:41:26] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1859 [13:41:32] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1859 [13:41:32] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1859 [13:45:52] New patchset: Mark Bergsma; "Add quotes" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1860 [13:46:09] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1860 [13:46:10] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1860 [14:00:40] New patchset: Mark Bergsma; "Install a DNS recursor on sodium" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1861 [14:00:59] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1861 [14:01:00] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1861 [14:07:14] New review: Mark Bergsma; "Why does it need permissions for that dir anyway? e.g. df works without that..." [operations/puppet] (production); V: 0 C: 0; - https://gerrit.wikimedia.org/r/1845 [14:31:36] New review: Dzahn; "on sodium:" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1845 [14:31:36] Change merged: Dzahn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1845 [14:46:42] PROBLEM - MySQL disk space on db16 is CRITICAL: Connection refused by host [14:47:02] PROBLEM - Disk space on mw1092 is CRITICAL: Connection refused by host [14:47:32] PROBLEM - RAID on srv196 is CRITICAL: Connection refused by host [14:47:32] PROBLEM - Disk space on srv200 is CRITICAL: Connection refused by host [14:48:02] PROBLEM - Disk space on db18 is CRITICAL: Connection refused by host [14:48:22] PROBLEM - DPKG on es1001 is CRITICAL: Connection refused by host [14:49:52] PROBLEM - RAID on es1001 is CRITICAL: Connection refused by host [14:51:22] PROBLEM - mobile traffic loggers on cp1044 is CRITICAL: Connection refused by host [14:51:42] PROBLEM - DPKG on db16 is CRITICAL: Connection refused by host [14:54:32] PROBLEM - RAID on mw70 is CRITICAL: Connection refused by host [14:55:22] PROBLEM - DPKG on bast1001 is CRITICAL: Connection refused by host [14:55:22] PROBLEM - RAID on cp1043 is CRITICAL: Connection refused by host [14:56:02] PROBLEM - RAID on db25 is CRITICAL: Connection refused by host [14:56:02] PROBLEM - Disk space on db34 is CRITICAL: Connection refused by host [14:56:03] PROBLEM - RAID on db46 is CRITICAL: Connection refused by host [14:56:22] PROBLEM - MySQL disk space on db11 is CRITICAL: Connection refused by host [14:56:22] PROBLEM - DPKG on db13 is CRITICAL: Connection refused by host [14:56:22] PROBLEM - Disk space on mw1001 is CRITICAL: Connection refused by host [14:56:32] PROBLEM - DPKG on db18 is CRITICAL: Connection refused by host [14:56:52] PROBLEM - Disk space on mw46 is CRITICAL: Connection refused by host [14:57:12] RECOVERY - Disk space on sodium is OK: DISK OK [14:57:13] PROBLEM - Disk space on snapshot4 is CRITICAL: Connection refused by host [14:57:13] PROBLEM - Disk space on srv195 is CRITICAL: Connection refused by host [14:57:22] PROBLEM - Disk space on mw1080 is CRITICAL: Connection refused by host [14:57:32] PROBLEM - DPKG on srv271 is CRITICAL: Connection refused by host [14:57:32] PROBLEM - Disk space on bast1001 is CRITICAL: Connection refused by host [14:57:32] PROBLEM - DPKG on cp1041 is CRITICAL: Connection refused by host [14:57:42] PROBLEM - Disk space on db13 is CRITICAL: Connection refused by host [14:57:52] PROBLEM - DPKG on es4 is CRITICAL: Connection refused by host [14:58:02] PROBLEM - Disk space on es3 is CRITICAL: Connection refused by host [14:58:22] PROBLEM - DPKG on mw1146 is CRITICAL: Connection refused by host [14:58:22] PROBLEM - DPKG on mw1134 is CRITICAL: Connection refused by host [14:58:32] PROBLEM - RAID on mw55 is CRITICAL: Connection refused by host [14:58:32] PROBLEM - DPKG on mw1121 is CRITICAL: Connection refused by host [14:58:32] PROBLEM - DPKG on mw67 is CRITICAL: Connection refused by host [14:58:32] PROBLEM - RAID on mw69 is CRITICAL: Connection refused by host [14:58:32] PROBLEM - DPKG on mw70 is CRITICAL: Connection refused by host [14:58:33] PROBLEM - RAID on mw72 is CRITICAL: Connection refused by host [14:58:52] PROBLEM - DPKG on srv196 is CRITICAL: Connection refused by host [14:58:52] PROBLEM - RAID on srv210 is CRITICAL: Connection refused by host [14:59:22] PROBLEM - Disk space on virt3 is CRITICAL: Connection refused by host [14:59:22] PROBLEM - Disk space on mw1082 is CRITICAL: Connection refused by host [14:59:33] PROBLEM - Disk space on virt2 is CRITICAL: Connection refused by host [14:59:33] PROBLEM - MySQL disk space on db34 is CRITICAL: Connection refused by host [14:59:33] PROBLEM - DPKG on db46 is CRITICAL: Connection refused by host [14:59:52] PROBLEM - Disk space on db46 is CRITICAL: Connection refused by host [15:00:02] PROBLEM - RAID on mw1001 is CRITICAL: Connection refused by host [15:00:12] PROBLEM - RAID on mw1075 is CRITICAL: Connection refused by host [15:00:12] PROBLEM - RAID on mw1082 is CRITICAL: Connection refused by host [15:00:12] PROBLEM - Disk space on mw1121 is CRITICAL: Connection refused by host [15:00:12] PROBLEM - Disk space on mw1134 is CRITICAL: Connection refused by host [15:00:32] PROBLEM - Disk space on mw1146 is CRITICAL: Connection refused by host [15:00:32] PROBLEM - Disk space on mw67 is CRITICAL: Connection refused by host [15:00:32] PROBLEM - Disk space on mw70 is CRITICAL: Connection refused by host [15:00:42] PROBLEM - DPKG on mw55 is CRITICAL: Connection refused by host [15:00:42] PROBLEM - RAID on snapshot4 is CRITICAL: Connection refused by host [15:00:52] PROBLEM - Disk space on srv196 is CRITICAL: Connection refused by host [15:00:52] PROBLEM - RAID on srv208 is CRITICAL: Connection refused by host [15:00:52] PROBLEM - DPKG on srv210 is CRITICAL: Connection refused by host [15:01:02] PROBLEM - Disk space on srv236 is CRITICAL: Connection refused by host [15:01:12] PROBLEM - RAID on srv236 is CRITICAL: Connection refused by host [15:01:22] PROBLEM - Disk space on cp1043 is CRITICAL: Connection refused by host [15:01:22] PROBLEM - RAID on bast1001 is CRITICAL: Connection refused by host [15:01:22] PROBLEM - RAID on virt2 is CRITICAL: Connection refused by host [15:01:22] PROBLEM - RAID on virt3 is CRITICAL: Connection refused by host [15:01:32] PROBLEM - RAID on srv272 is CRITICAL: Connection refused by host [15:01:42] PROBLEM - DPKG on srv236 is CRITICAL: Connection refused by host [15:01:42] PROBLEM - MySQL disk space on db46 is CRITICAL: Connection refused by host [15:01:52] PROBLEM - MySQL disk space on db18 is CRITICAL: Connection refused by host [15:01:52] PROBLEM - DPKG on db11 is CRITICAL: Connection refused by host [15:01:52] PROBLEM - Disk space on es1 is CRITICAL: Connection refused by host [15:01:52] PROBLEM - MySQL disk space on es4 is CRITICAL: Connection refused by host [15:01:52] PROBLEM - Disk space on es1002 is CRITICAL: Connection refused by host [15:01:53] PROBLEM - MySQL disk space on es2 is CRITICAL: Connection refused by host [15:02:02] PROBLEM - RAID on db16 is CRITICAL: Connection refused by host [15:02:02] PROBLEM - RAID on es1 is CRITICAL: Connection refused by host [15:02:02] PROBLEM - RAID on mw1092 is CRITICAL: Connection refused by host [15:02:02] PROBLEM - RAID on es1002 is CRITICAL: Connection refused by host [15:02:02] PROBLEM - RAID on db11 is CRITICAL: Connection refused by host [15:02:12] PROBLEM - MySQL disk space on db13 is CRITICAL: Connection refused by host [15:02:22] PROBLEM - RAID on mw46 is CRITICAL: Connection refused by host [15:02:32] PROBLEM - Disk space on mw55 is CRITICAL: Connection refused by host [15:02:42] PROBLEM - RAID on es4 is CRITICAL: Connection refused by host [15:02:52] PROBLEM - DPKG on srv208 is CRITICAL: Connection refused by host [15:02:52] PROBLEM - Disk space on srv210 is CRITICAL: Connection refused by host [15:03:02] PROBLEM - Disk space on mw1075 is CRITICAL: Connection refused by host [15:03:12] PROBLEM - Disk space on srv272 is CRITICAL: Connection refused by host [15:03:12] PROBLEM - Disk space on es4 is CRITICAL: Connection refused by host [15:03:22] PROBLEM - DPKG on srv272 is CRITICAL: Connection refused by host [15:03:22] PROBLEM - Disk space on es1001 is CRITICAL: Connection refused by host [15:03:33] PROBLEM - RAID on db18 is CRITICAL: Connection refused by host [15:03:52] PROBLEM - DPKG on mw1048 is CRITICAL: Connection refused by host [15:04:03] PROBLEM - RAID on mw1104 is CRITICAL: Connection refused by host [15:04:03] PROBLEM - RAID on mw1121 is CRITICAL: Connection refused by host [15:04:13] PROBLEM - RAID on mw1134 is CRITICAL: Connection refused by host [15:04:32] PROBLEM - RAID on mw30 is CRITICAL: Connection refused by host [15:04:32] PROBLEM - RAID on mw67 is CRITICAL: Connection refused by host [15:04:32] PROBLEM - Disk space on mw69 is CRITICAL: Connection refused by host [15:04:42] PROBLEM - Disk space on mw72 is CRITICAL: Connection refused by host [15:04:42] RECOVERY - RAID on mw70 is OK: OK: no RAID installed [15:04:42] PROBLEM - Disk space on db11 is CRITICAL: Connection refused by host [15:04:52] PROBLEM - MySQL disk space on es1 is CRITICAL: Connection refused by host [15:05:02] PROBLEM - DPKG on srv195 is CRITICAL: Connection refused by host [15:05:02] PROBLEM - DPKG on mw1074 is CRITICAL: Connection refused by host [15:05:12] PROBLEM - DPKG on virt3 is CRITICAL: Connection refused by host [15:05:22] PROBLEM - RAID on srv271 is CRITICAL: Connection refused by host [15:05:22] PROBLEM - DPKG on mw46 is CRITICAL: Connection refused by host [15:05:22] PROBLEM - RAID on mw1146 is CRITICAL: Connection refused by host [15:05:25] New patchset: Dzahn; "revert change to check_disk nrpe command" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1862 [15:05:39] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1862 [15:05:42] PROBLEM - MySQL disk space on es1002 is CRITICAL: Connection refused by host [15:05:42] PROBLEM - DPKG on mw1075 is CRITICAL: Connection refused by host [15:05:52] PROBLEM - DPKG on mw1001 is CRITICAL: Connection refused by host [15:05:52] PROBLEM - DPKG on snapshot4 is CRITICAL: Connection refused by host [15:05:52] RECOVERY - RAID on db46 is OK: OK: State is Optimal, checked 2 logical device(s) [15:06:01] New review: Dzahn; "this worked fine on sodium, but obviously something happened, and we want to solve this in another w..." [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1862 [15:06:02] Change merged: Dzahn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1862 [15:06:12] PROBLEM - DPKG on mw72 is CRITICAL: Connection refused by host [15:06:12] PROBLEM - RAID on db13 is CRITICAL: Connection refused by host [15:06:22] PROBLEM - Disk space on srv208 is CRITICAL: Connection refused by host [15:06:22] RECOVERY - MySQL disk space on db16 is OK: DISK OK [15:06:32] RECOVERY - DPKG on db18 is OK: All packages OK [15:06:32] RECOVERY - Disk space on mw1092 is OK: DISK OK [15:06:42] PROBLEM - RAID on srv195 is CRITICAL: Connection refused by host [15:06:42] PROBLEM - mobile traffic loggers on cp1043 is CRITICAL: Connection refused by host [15:06:42] PROBLEM - Disk space on mw1048 is CRITICAL: Connection refused by host [15:07:02] RECOVERY - Disk space on snapshot4 is OK: DISK OK [15:07:12] RECOVERY - RAID on srv196 is OK: OK: no RAID installed [15:07:12] PROBLEM - Disk space on mw1088 is CRITICAL: Connection refused by host [15:07:22] RECOVERY - Disk space on srv200 is OK: DISK OK [15:07:42] PROBLEM - Disk space on cp1044 is CRITICAL: Connection refused by host [15:07:42] RECOVERY - Disk space on db13 is OK: DISK OK [15:07:42] RECOVERY - Disk space on db18 is OK: DISK OK [15:07:42] PROBLEM - DPKG on db25 is CRITICAL: Connection refused by host [15:07:52] PROBLEM - DPKG on es2 is CRITICAL: Connection refused by host [15:07:52] RECOVERY - DPKG on es4 is OK: All packages OK [15:07:52] PROBLEM - MySQL disk space on es3 is CRITICAL: Connection refused by host [15:08:02] PROBLEM - DPKG on mw30 is CRITICAL: Connection refused by host [15:08:12] RECOVERY - DPKG on es1001 is OK: All packages OK [15:08:12] RECOVERY - DPKG on mw1146 is OK: All packages OK [15:08:22] PROBLEM - DPKG on mw1142 is CRITICAL: Connection refused by host [15:08:22] RECOVERY - DPKG on mw1134 is OK: All packages OK [15:08:22] RECOVERY - RAID on mw55 is OK: OK: no RAID installed [15:08:22] PROBLEM - DPKG on mw33 is CRITICAL: Connection refused by host [15:08:22] RECOVERY - DPKG on mw1121 is OK: All packages OK [15:08:32] PROBLEM - RAID on mw33 is CRITICAL: Connection refused by host [15:08:33] PROBLEM - DPKG on mw1136 is CRITICAL: Connection refused by host [15:08:33] RECOVERY - DPKG on mw67 is OK: All packages OK [15:08:42] RECOVERY - DPKG on mw70 is OK: All packages OK [15:08:52] RECOVERY - RAID on srv210 is OK: OK: no RAID installed [15:08:52] RECOVERY - DPKG on srv196 is OK: All packages OK [15:09:12] RECOVERY - Disk space on virt3 is OK: DISK OK [15:09:12] RECOVERY - Disk space on mw1082 is OK: DISK OK [15:09:22] RECOVERY - DPKG on db46 is OK: All packages OK [15:09:32] PROBLEM - Disk space on db25 is CRITICAL: Connection refused by host [15:09:32] RECOVERY - RAID on es1001 is OK: OK: State is Optimal, checked 2 logical device(s) [15:09:42] PROBLEM - MySQL disk space on db45 is CRITICAL: Connection refused by host [15:09:42] PROBLEM - DPKG on srv190 is CRITICAL: Connection refused by host [15:09:42] PROBLEM - Disk space on mw1074 is CRITICAL: Connection refused by host [15:09:52] PROBLEM - Disk space on srv190 is CRITICAL: Connection refused by host [15:09:52] PROBLEM - RAID on srv229 is CRITICAL: Connection refused by host [15:09:52] PROBLEM - RAID on mw1048 is CRITICAL: Connection refused by host [15:09:52] PROBLEM - RAID on mw1037 is CRITICAL: Connection refused by host [15:09:52] RECOVERY - Disk space on virt2 is OK: DISK OK [15:10:02] PROBLEM - RAID on mw1074 is CRITICAL: Connection refused by host [15:10:02] PROBLEM - Disk space on srv276 is CRITICAL: Connection refused by host [15:10:02] RECOVERY - RAID on mw1075 is OK: OK: no RAID installed [15:10:02] RECOVERY - RAID on mw1082 is OK: OK: no RAID installed [15:10:02] RECOVERY - Disk space on mw1121 is OK: DISK OK [15:10:02] PROBLEM - Disk space on mw1136 is CRITICAL: Connection refused by host [15:10:03] RECOVERY - Disk space on mw1134 is OK: DISK OK [15:10:12] PROBLEM - Disk space on mw1141 is CRITICAL: Connection refused by host [15:10:13] PROBLEM - Disk space on mw33 is CRITICAL: Connection refused by host [15:10:13] PROBLEM - DPKG on db51 is CRITICAL: Connection refused by host [15:10:13] PROBLEM - RAID on db51 is CRITICAL: Connection refused by host [15:10:22] RECOVERY - Disk space on mw67 is OK: DISK OK [15:10:22] RECOVERY - Disk space on mw70 is OK: DISK OK [15:10:22] RECOVERY - Disk space on db46 is OK: DISK OK [15:10:22] PROBLEM - RAID on mw1080 is CRITICAL: Connection refused by host [15:10:32] RECOVERY - DPKG on mw55 is OK: All packages OK [15:10:32] RECOVERY - RAID on snapshot4 is OK: OK: no RAID installed [15:10:42] RECOVERY - Disk space on srv196 is OK: DISK OK [15:10:42] PROBLEM - Disk space on srv207 is CRITICAL: Connection refused by host [15:10:42] RECOVERY - RAID on srv208 is OK: OK: no RAID installed [15:10:42] RECOVERY - DPKG on srv210 is OK: All packages OK [15:10:42] PROBLEM - Disk space on mw1142 is CRITICAL: Connection refused by host [15:10:52] RECOVERY - Disk space on srv236 is OK: DISK OK [15:10:52] PROBLEM - RAID on mw1079 is CRITICAL: Connection refused by host [15:11:02] RECOVERY - Disk space on mw1146 is OK: DISK OK [15:11:02] RECOVERY - RAID on mw1001 is OK: OK: no RAID installed [15:11:12] PROBLEM - Disk space on cp1041 is CRITICAL: Connection refused by host [15:11:12] PROBLEM - mobile traffic loggers on cp1041 is CRITICAL: Connection refused by host [15:11:13] PROBLEM - RAID on cp1044 is CRITICAL: Connection refused by host [15:11:13] RECOVERY - RAID on bast1001 is OK: OK: no RAID installed [15:11:13] RECOVERY - RAID on virt3 is OK: OK: State is Optimal, checked 2 logical device(s) [15:11:13] RECOVERY - RAID on virt2 is OK: OK: State is Optimal, checked 2 logical device(s) [15:11:22] RECOVERY - RAID on srv272 is OK: OK: no RAID installed [15:11:22] PROBLEM - RAID on mw1088 is CRITICAL: Connection refused by host [15:11:32] RECOVERY - DPKG on db16 is OK: All packages OK [15:11:32] PROBLEM - RAID on db34 is CRITICAL: Connection refused by host [15:11:32] RECOVERY - MySQL disk space on db46 is OK: DISK OK [15:11:32] PROBLEM - Disk space on db51 is CRITICAL: Connection refused by host [15:11:32] PROBLEM - RAID on srv267 is CRITICAL: Connection refused by host [15:11:42] RECOVERY - Disk space on es1 is OK: DISK OK [15:11:42] RECOVERY - MySQL disk space on es4 is OK: DISK OK [15:11:42] PROBLEM - RAID on es2 is CRITICAL: Connection refused by host [15:11:42] RECOVERY - Disk space on es1002 is OK: DISK OK [15:11:52] RECOVERY - MySQL disk space on db18 is OK: DISK OK [15:11:52] RECOVERY - RAID on es1 is OK: OK: State is Optimal, checked 2 logical device(s) [15:11:52] RECOVERY - RAID on mw1092 is OK: OK: no RAID installed [15:11:52] RECOVERY - RAID on es1002 is OK: OK: State is Optimal, checked 2 logical device(s) [15:11:52] RECOVERY - RAID on db16 is OK: OK: 1 logical device(s) checked [15:11:52] RECOVERY - RAID on db11 is OK: OK: 1 logical device(s) checked [15:12:02] RECOVERY - MySQL disk space on db13 is OK: DISK OK [15:12:12] PROBLEM - Disk space on mw1104 is CRITICAL: Connection refused by host [15:12:12] PROBLEM - RAID on mw41 is CRITICAL: Connection refused by host [15:12:12] RECOVERY - RAID on mw46 is OK: OK: no RAID installed [15:12:22] RECOVERY - DPKG on db11 is OK: All packages OK [15:12:22] PROBLEM - Disk space on db45 is CRITICAL: Connection refused by host [15:12:22] RECOVERY - Disk space on mw55 is OK: DISK OK [15:12:32] RECOVERY - RAID on srv236 is OK: OK: no RAID installed [15:12:32] PROBLEM - DPKG on mw1104 is CRITICAL: Connection refused by host [15:12:32] RECOVERY - RAID on es4 is OK: OK: State is Optimal, checked 2 logical device(s) [15:12:42] PROBLEM - DPKG on cp1043 is CRITICAL: Connection refused by host [15:12:42] RECOVERY - Disk space on srv210 is OK: DISK OK [15:12:42] RECOVERY - DPKG on srv208 is OK: All packages OK [15:12:52] RECOVERY - Disk space on mw1075 is OK: DISK OK [15:13:02] RECOVERY - DPKG on srv236 is OK: All packages OK [15:13:02] PROBLEM - Disk space on srv267 is CRITICAL: Connection refused by host [15:13:02] RECOVERY - Disk space on srv272 is OK: DISK OK [15:13:02] RECOVERY - Disk space on es4 is OK: DISK OK [15:13:12] RECOVERY - DPKG on srv272 is OK: All packages OK [15:13:12] RECOVERY - Disk space on es1001 is OK: DISK OK [15:13:22] RECOVERY - RAID on db18 is OK: OK: 1 logical device(s) checked [15:13:42] PROBLEM - RAID on db45 is CRITICAL: Connection refused by host [15:13:43] PROBLEM - DPKG on mw1036 is CRITICAL: Connection refused by host [15:13:43] PROBLEM - DPKG on mw1037 is CRITICAL: Connection refused by host [15:13:43] RECOVERY - DPKG on mw1048 is OK: All packages OK [15:13:52] RECOVERY - RAID on mw1104 is OK: OK: no RAID installed [15:13:52] RECOVERY - RAID on mw1121 is OK: OK: no RAID installed [15:14:02] RECOVERY - RAID on mw1134 is OK: OK: no RAID installed [15:14:02] PROBLEM - Disk space on srv283 is CRITICAL: Connection refused by host [15:14:12] PROBLEM - RAID on mw1142 is CRITICAL: Connection refused by host [15:14:12] PROBLEM - RAID on mw1141 is CRITICAL: Connection refused by host [15:14:12] PROBLEM - DPKG on mw41 is CRITICAL: Connection refused by host [15:14:12] PROBLEM - RAID on srv235 is CRITICAL: Connection refused by host [15:14:22] RECOVERY - RAID on mw30 is OK: OK: no RAID installed [15:14:23] RECOVERY - Disk space on mw69 is OK: DISK OK [15:14:23] RECOVERY - RAID on mw67 is OK: OK: no RAID installed [15:14:23] PROBLEM - RAID on es3 is CRITICAL: Connection refused by host [15:14:32] PROBLEM - DPKG on srv267 is CRITICAL: Connection refused by host [15:14:32] RECOVERY - Disk space on db11 is OK: DISK OK [15:14:32] PROBLEM - DPKG on fenari is CRITICAL: Connection refused by host [15:14:42] PROBLEM - RAID on srv190 is CRITICAL: Connection refused by host [15:14:42] RECOVERY - MySQL disk space on es1 is OK: DISK OK [15:14:52] PROBLEM - DPKG on srv235 is CRITICAL: Connection refused by host [15:14:52] RECOVERY - DPKG on srv195 is OK: All packages OK [15:14:52] RECOVERY - Disk space on mw72 is OK: DISK OK [15:14:52] RECOVERY - DPKG on mw1074 is OK: All packages OK [15:15:02] RECOVERY - DPKG on virt3 is OK: All packages OK [15:15:02] PROBLEM - DPKG on mw1067 is CRITICAL: Connection refused by host [15:15:12] RECOVERY - RAID on srv271 is OK: OK: no RAID installed [15:15:13] PROBLEM - RAID on srv276 is CRITICAL: Connection refused by host [15:15:13] RECOVERY - DPKG on mw46 is OK: All packages OK [15:15:13] RECOVERY - DPKG on bast1001 is OK: All packages OK [15:15:13] RECOVERY - RAID on mw1146 is OK: OK: no RAID installed [15:15:22] PROBLEM - MySQL disk space on db51 is CRITICAL: Connection refused by host [15:15:22] PROBLEM - DPKG on cp1044 is CRITICAL: Connection refused by host [15:15:22] PROBLEM - Disk space on mw1079 is CRITICAL: Connection refused by host [15:15:22] PROBLEM - DPKG on mw1003 is CRITICAL: Connection refused by host [15:15:32] RECOVERY - MySQL disk space on es1002 is OK: DISK OK [15:15:32] RECOVERY - DPKG on mw1075 is OK: All packages OK [15:15:42] RECOVERY - DPKG on snapshot4 is OK: All packages OK [15:15:43] RECOVERY - DPKG on mw1001 is OK: All packages OK [15:15:43] RECOVERY - Disk space on db34 is OK: DISK OK [15:15:43] PROBLEM - DPKG on db45 is CRITICAL: Connection refused by host [15:15:43] RECOVERY - RAID on db25 is OK: OK: 1 logical device(s) checked [15:15:52] PROBLEM - RAID on mw1152 is CRITICAL: Connection refused by host [15:15:52] PROBLEM - DPKG on mw1079 is CRITICAL: Connection refused by host [15:16:02] PROBLEM - RAID on db50 is CRITICAL: Connection refused by host [15:16:02] RECOVERY - DPKG on mw72 is OK: All packages OK [15:16:02] RECOVERY - MySQL disk space on db11 is OK: DISK OK [15:16:02] PROBLEM - DPKG on mw1088 is CRITICAL: Connection refused by host [15:16:02] RECOVERY - RAID on db13 is OK: OK: 1 logical device(s) checked [15:16:12] RECOVERY - DPKG on db13 is OK: All packages OK [15:16:13] RECOVERY - Disk space on srv208 is OK: DISK OK [15:16:13] PROBLEM - Disk space on mw1037 is CRITICAL: Connection refused by host [15:16:22] PROBLEM - Disk space on mw1003 is CRITICAL: Connection refused by host [15:16:23] PROBLEM - Disk space on mw1002 is CRITICAL: Connection refused by host [15:16:23] RECOVERY - Disk space on mw1001 is OK: DISK OK [15:16:32] RECOVERY - RAID on srv195 is OK: OK: no RAID installed [15:16:32] PROBLEM - Disk space on mw1036 is CRITICAL: Connection refused by host [15:16:32] RECOVERY - Disk space on mw1048 is OK: DISK OK [15:16:42] RECOVERY - Disk space on mw46 is OK: DISK OK [15:16:52] PROBLEM - Disk space on mw41 is CRITICAL: Connection refused by host [15:17:02] RECOVERY - Disk space on mw1080 is OK: DISK OK [15:17:02] RECOVERY - Disk space on mw1088 is OK: DISK OK [15:17:12] RECOVERY - Disk space on srv195 is OK: DISK OK [15:17:12] PROBLEM - RAID on srv207 is CRITICAL: Connection refused by host [15:17:12] PROBLEM - RAID on srv254 is CRITICAL: Connection refused by host [15:17:12] PROBLEM - Disk space on mw1050 is CRITICAL: Connection refused by host [15:17:22] RECOVERY - DPKG on srv271 is OK: All packages OK [15:17:22] PROBLEM - DPKG on srv276 is CRITICAL: Connection refused by host [15:17:22] PROBLEM - Disk space on srv285 is CRITICAL: Connection refused by host [15:17:32] PROBLEM - mobile traffic loggers on cp1042 is CRITICAL: Connection refused by host [15:17:32] RECOVERY - Disk space on bast1001 is OK: DISK OK [15:17:32] RECOVERY - DPKG on cp1041 is OK: All packages OK [15:17:32] RECOVERY - DPKG on db25 is OK: All packages OK [15:17:42] RECOVERY - MySQL disk space on es3 is OK: DISK OK [15:17:42] PROBLEM - Disk space on es1003 is CRITICAL: Connection refused by host [15:17:52] PROBLEM - Disk space on fenari is CRITICAL: Connection refused by host [15:17:52] RECOVERY - DPKG on mw30 is OK: All packages OK [15:17:52] PROBLEM - DPKG on es1003 is CRITICAL: Connection refused by host [15:17:52] PROBLEM - RAID on es1003 is CRITICAL: Connection refused by host [15:18:02] PROBLEM - DPKG on mw1152 is CRITICAL: Connection refused by host [15:18:12] PROBLEM - DPKG on mw1111 is CRITICAL: Connection refused by host [15:18:12] RECOVERY - DPKG on mw1142 is OK: All packages OK [15:18:12] RECOVERY - DPKG on mw33 is OK: All packages OK [15:18:12] RECOVERY - Disk space on es3 is OK: DISK OK [15:18:12] RECOVERY - DPKG on es2 is OK: All packages OK [15:18:33] PROBLEM - DPKG on mw1141 is CRITICAL: Connection refused by host [15:18:34] RECOVERY - DPKG on mw1136 is OK: All packages OK [15:18:42] PROBLEM - DPKG on srv207 is CRITICAL: Connection refused by host [15:18:42] PROBLEM - DPKG on mw42 is CRITICAL: Connection refused by host [15:18:42] RECOVERY - RAID on mw72 is OK: OK: no RAID installed [15:18:42] RECOVERY - RAID on mw69 is OK: OK: no RAID installed [15:18:42] RECOVERY - RAID on mw33 is OK: OK: no RAID installed [15:18:52] PROBLEM - DPKG on mw1159 is CRITICAL: Connection refused by host [15:18:52] PROBLEM - Disk space on mw1067 is CRITICAL: Connection refused by host [15:19:02] PROBLEM - DPKG on db50 is CRITICAL: Connection refused by host [15:19:22] RECOVERY - Disk space on db25 is OK: DISK OK [15:19:32] PROBLEM - RAID on srv261 is CRITICAL: Connection refused by host [15:19:32] PROBLEM - DPKG on db47 is CRITICAL: Connection refused by host [15:19:32] PROBLEM - Disk space on srv235 is CRITICAL: Connection refused by host [15:19:32] PROBLEM - Disk space on mw1073 is CRITICAL: Connection refused by host [15:19:42] RECOVERY - MySQL disk space on db34 is OK: DISK OK [15:19:42] RECOVERY - Disk space on mw1074 is OK: DISK OK [15:19:42] RECOVERY - Disk space on srv190 is OK: DISK OK [15:19:42] RECOVERY - RAID on srv229 is OK: OK: no RAID installed [15:19:42] PROBLEM - RAID on mw1036 is CRITICAL: Connection refused by host [15:19:42] RECOVERY - RAID on mw1048 is OK: OK: no RAID installed [15:19:43] RECOVERY - RAID on mw1037 is OK: OK: no RAID installed [15:19:52] PROBLEM - RAID on srv231 is CRITICAL: Connection refused by host [15:19:52] PROBLEM - Disk space on mw1111 is CRITICAL: Connection refused by host [15:19:52] RECOVERY - RAID on mw1074 is OK: OK: no RAID installed [15:19:52] RECOVERY - Disk space on mw1136 is OK: DISK OK [15:20:02] RECOVERY - DPKG on srv190 is OK: All packages OK [15:20:03] PROBLEM - Disk space on mw1159 is CRITICAL: Connection refused by host [15:20:03] RECOVERY - Disk space on mw1141 is OK: DISK OK [15:20:12] PROBLEM - RAID on mw1002 is CRITICAL: Connection refused by host [15:20:12] PROBLEM - Disk space on mw42 is CRITICAL: Connection refused by host [15:20:22] RECOVERY - Disk space on mw33 is OK: DISK OK [15:20:22] PROBLEM - Disk space on db50 is CRITICAL: Connection refused by host [15:20:22] RECOVERY - RAID on mw1080 is OK: OK: no RAID installed [15:20:32] RECOVERY - DPKG on db51 is OK: All packages OK [15:20:32] RECOVERY - RAID on db51 is OK: OK: State is Optimal, checked 2 logical device(s) [15:20:42] PROBLEM - RAID on mw1067 is CRITICAL: Connection refused by host [15:20:43] PROBLEM - RAID on mw1050 is CRITICAL: Connection refused by host [15:20:43] RECOVERY - Disk space on mw1142 is OK: DISK OK [15:20:52] RECOVERY - Disk space on srv207 is OK: DISK OK [15:20:52] PROBLEM - RAID on mw1003 is CRITICAL: Connection refused by host [15:20:52] PROBLEM - DPKG on srv261 is CRITICAL: Connection refused by host [15:20:52] PROBLEM - DPKG on srv254 is CRITICAL: Connection refused by host [15:20:52] PROBLEM - DPKG on srv283 is CRITICAL: Connection refused by host [15:21:02] PROBLEM - RAID on mw1073 is CRITICAL: Connection refused by host [15:21:02] PROBLEM - Disk space on srv263 is CRITICAL: Connection refused by host [15:21:02] RECOVERY - RAID on mw1079 is OK: OK: no RAID installed [15:21:02] PROBLEM - MySQL disk space on storage3 is CRITICAL: Connection refused by host [15:21:02] RECOVERY - Disk space on cp1041 is OK: DISK OK [15:21:02] RECOVERY - mobile traffic loggers on cp1041 is OK: PROCS OK: 2 processes with command name varnishncsa [15:21:03] RECOVERY - RAID on cp1044 is OK: OK: Active: 4, Working: 4, Failed: 0, Spare: 0 [15:21:12] PROBLEM - RAID on cp1042 is CRITICAL: Connection refused by host [15:21:22] PROBLEM - MySQL disk space on db50 is CRITICAL: Connection refused by host [15:21:22] PROBLEM - Disk space on db47 is CRITICAL: Connection refused by host [15:21:22] RECOVERY - Disk space on db51 is OK: DISK OK [15:21:22] RECOVERY - RAID on db34 is OK: OK: 1 logical device(s) checked [15:21:32] RECOVERY - RAID on srv267 is OK: OK: no RAID installed [15:21:32] PROBLEM - RAID on srv283 is CRITICAL: Connection refused by host [15:21:32] RECOVERY - RAID on mw1088 is OK: OK: no RAID installed [15:21:32] RECOVERY - MySQL disk space on es2 is OK: DISK OK [15:21:32] RECOVERY - RAID on es2 is OK: OK: State is Optimal, checked 2 logical device(s) [15:21:42] PROBLEM - RAID on fenari is CRITICAL: Connection refused by host [15:22:03] PROBLEM - Disk space on srv258 is CRITICAL: Connection refused by host [15:22:03] RECOVERY - RAID on mw41 is OK: OK: no RAID installed [15:22:03] RECOVERY - Disk space on mw1104 is OK: DISK OK [15:22:12] PROBLEM - MySQL disk space on es1003 is CRITICAL: Connection refused by host [15:22:13] RECOVERY - Disk space on db45 is OK: DISK OK [15:22:32] PROBLEM - DPKG on srv231 is CRITICAL: Connection refused by host [15:22:42] RECOVERY - DPKG on mw1104 is OK: All packages OK [15:22:52] PROBLEM - Disk space on emery is CRITICAL: Connection refused by host [15:22:52] RECOVERY - Disk space on srv267 is OK: DISK OK [15:23:02] PROBLEM - Disk space on srv261 is CRITICAL: Connection refused by host [15:23:02] PROBLEM - RAID on mw1023 is CRITICAL: Connection refused by host [15:23:02] PROBLEM - Disk space on srv231 is CRITICAL: Connection refused by host [15:23:12] PROBLEM - Disk space on srv238 is CRITICAL: Connection refused by host [15:23:32] PROBLEM - MySQL disk space on db47 is CRITICAL: Connection refused by host [15:23:32] RECOVERY - RAID on db45 is OK: OK: State is Optimal, checked 2 logical device(s) [15:23:42] PROBLEM - DPKG on mw1046 is CRITICAL: Connection refused by host [15:23:42] RECOVERY - DPKG on mw1036 is OK: All packages OK [15:23:42] PROBLEM - DPKG on mw1053 is CRITICAL: Connection refused by host [15:23:42] PROBLEM - DPKG on mw1050 is CRITICAL: Connection refused by host [15:23:42] RECOVERY - DPKG on mw1037 is OK: All packages OK [15:23:52] PROBLEM - DPKG on mw1073 is CRITICAL: Connection refused by host [15:23:52] PROBLEM - RAID on mw1111 is CRITICAL: Connection refused by host [15:24:02] PROBLEM - RAID on mw1143 is CRITICAL: Connection refused by host [15:24:02] PROBLEM - RAID on mw1133 is CRITICAL: Connection refused by host [15:24:02] PROBLEM - DPKG on mw18 is CRITICAL: Connection refused by host [15:24:02] RECOVERY - RAID on mw1142 is OK: OK: no RAID installed [15:24:02] RECOVERY - RAID on mw1141 is OK: OK: no RAID installed [15:24:02] RECOVERY - DPKG on mw41 is OK: All packages OK [15:24:03] PROBLEM - DPKG on cp1042 is CRITICAL: Connection refused by host [15:24:12] PROBLEM - RAID on srv285 is CRITICAL: Connection refused by host [15:24:12] PROBLEM - RAID on mw4 is CRITICAL: Connection refused by host [15:24:22] PROBLEM - Disk space on srv254 is CRITICAL: Connection refused by host [15:24:22] RECOVERY - RAID on es3 is OK: OK: State is Optimal, checked 2 logical device(s) [15:24:22] RECOVERY - DPKG on srv267 is OK: All packages OK [15:24:32] PROBLEM - DPKG on mw1002 is CRITICAL: Connection refused by host [15:24:32] RECOVERY - RAID on srv190 is OK: OK: no RAID installed [15:24:42] RECOVERY - DPKG on fenari is OK: All packages OK [15:24:42] PROBLEM - Disk space on mw1152 is CRITICAL: Connection refused by host [15:24:42] PROBLEM - RAID on srv238 is CRITICAL: Connection refused by host [15:24:52] PROBLEM - DPKG on mw1023 is CRITICAL: Connection refused by host [15:24:52] PROBLEM - DPKG on srv285 is CRITICAL: Connection refused by host [15:25:02] RECOVERY - DPKG on mw1067 is OK: All packages OK [15:25:02] RECOVERY - RAID on srv276 is OK: OK: no RAID installed [15:25:02] PROBLEM - Disk space on cp1042 is CRITICAL: Connection refused by host [15:25:02] PROBLEM - DPKG on mw1008 is CRITICAL: Connection refused by host [15:25:12] PROBLEM - DPKG on storage3 is CRITICAL: Connection refused by host [15:25:12] RECOVERY - MySQL disk space on db51 is OK: DISK OK [15:25:13] PROBLEM - RAID on emery is CRITICAL: Connection refused by host [15:25:13] RECOVERY - DPKG on cp1044 is OK: All packages OK [15:25:13] RECOVERY - Disk space on mw1079 is OK: DISK OK [15:25:13] RECOVERY - DPKG on mw1003 is OK: All packages OK [15:25:22] PROBLEM - RAID on mw18 is CRITICAL: Connection refused by host [15:25:32] PROBLEM - RAID on mw1159 is CRITICAL: Connection refused by host [15:25:32] PROBLEM - DPKG on mw1009 is CRITICAL: Connection refused by host [15:25:42] RECOVERY - DPKG on db45 is OK: All packages OK [15:25:43] RECOVERY - RAID on mw1152 is OK: OK: no RAID installed [15:25:52] RECOVERY - DPKG on mw1079 is OK: All packages OK [15:25:52] PROBLEM - DPKG on emery is CRITICAL: Connection refused by host [15:25:52] RECOVERY - DPKG on mw1088 is OK: All packages OK [15:25:52] RECOVERY - RAID on db50 is OK: OK: State is Optimal, checked 2 logical device(s) [15:26:02] RECOVERY - Disk space on mw1037 is OK: DISK OK [15:26:12] PROBLEM - Disk space on mw1008 is CRITICAL: Connection refused by host [15:26:12] RECOVERY - Disk space on mw1003 is OK: DISK OK [15:26:22] PROBLEM - Disk space on mw1046 is CRITICAL: Connection refused by host [15:26:33] PROBLEM - RAID on searchidx2 is CRITICAL: Connection refused by host [15:26:33] PROBLEM - RAID on mw42 is CRITICAL: Connection refused by host [15:26:33] PROBLEM - DPKG on mw4 is CRITICAL: Connection refused by host [15:26:33] PROBLEM - Disk space on mw1009 is CRITICAL: Connection refused by host [15:26:42] RECOVERY - Disk space on mw1036 is OK: DISK OK [15:26:42] RECOVERY - Disk space on mw41 is OK: DISK OK [15:26:52] PROBLEM - Disk space on mw1023 is CRITICAL: Connection refused by host [15:26:52] PROBLEM - DPKG on searchidx2 is CRITICAL: Connection refused by host [15:27:02] PROBLEM - RAID on srv263 is CRITICAL: Connection refused by host [15:27:02] RECOVERY - RAID on srv207 is OK: OK: no RAID installed [15:27:12] RECOVERY - DPKG on srv276 is OK: All packages OK [15:27:12] RECOVERY - Disk space on srv285 is OK: DISK OK [15:27:22] RECOVERY - Disk space on mw1050 is OK: DISK OK [15:27:22] RECOVERY - Disk space on cp1044 is OK: DISK OK [15:27:32] RECOVERY - Disk space on es1003 is OK: DISK OK [15:27:42] RECOVERY - Disk space on fenari is OK: DISK OK [15:27:43] RECOVERY - DPKG on es1003 is OK: All packages OK [15:27:43] RECOVERY - RAID on es1003 is OK: OK: State is Optimal, checked 2 logical device(s) [15:27:52] PROBLEM - RAID on srv258 is CRITICAL: Connection refused by host [15:27:52] RECOVERY - DPKG on mw1152 is OK: All packages OK [15:28:12] PROBLEM - Disk space on mw1053 is CRITICAL: Connection refused by host [15:28:13] PROBLEM - Disk space on mw4 is CRITICAL: Connection refused by host [15:28:22] PROBLEM - DPKG on mw1143 is CRITICAL: Connection refused by host [15:28:32] RECOVERY - DPKG on mw1141 is OK: All packages OK [15:28:33] RECOVERY - DPKG on srv207 is OK: All packages OK [15:28:42] PROBLEM - DPKG on mw1133 is CRITICAL: Connection refused by host [15:28:42] PROBLEM - DPKG on srv258 is CRITICAL: Connection refused by host [15:28:52] RECOVERY - Disk space on mw1067 is OK: DISK OK [15:28:52] PROBLEM - DPKG on srv263 is CRITICAL: Connection refused by host [15:28:52] PROBLEM - Disk space on searchidx2 is CRITICAL: Connection refused by host [15:28:52] PROBLEM - Disk space on mw18 is CRITICAL: Connection refused by host [15:29:02] RECOVERY - DPKG on db50 is OK: All packages OK [15:29:02] RECOVERY - DPKG on mw42 is OK: All packages OK [15:29:02] PROBLEM - RAID on storage3 is CRITICAL: Connection refused by host [15:29:02] PROBLEM - DPKG on srv238 is CRITICAL: Connection refused by host [15:29:22] RECOVERY - Disk space on srv235 is OK: DISK OK [15:29:22] RECOVERY - MySQL disk space on db45 is OK: DISK OK [15:29:22] RECOVERY - Disk space on mw1073 is OK: DISK OK [15:29:32] PROBLEM - RAID on mw1008 is CRITICAL: Connection refused by host [15:29:32] PROBLEM - RAID on mw1046 is CRITICAL: Connection refused by host [15:29:32] RECOVERY - RAID on mw1036 is OK: OK: no RAID installed [15:29:32] PROBLEM - Disk space on storage3 is CRITICAL: Connection refused by host [15:29:42] RECOVERY - Disk space on srv276 is OK: DISK OK [15:29:42] PROBLEM - RAID on mw1009 is CRITICAL: Connection refused by host [15:29:42] PROBLEM - RAID on mw1053 is CRITICAL: Connection refused by host [15:29:42] RECOVERY - Disk space on mw1111 is OK: DISK OK [15:29:52] PROBLEM - Disk space on mw1133 is CRITICAL: Connection refused by host [15:29:52] RECOVERY - RAID on srv231 is OK: OK: no RAID installed [15:30:02] RECOVERY - Disk space on mw42 is OK: DISK OK [15:30:02] RECOVERY - RAID on mw1002 is OK: OK: no RAID installed [15:30:12] RECOVERY - Disk space on db50 is OK: DISK OK [15:30:32] RECOVERY - RAID on mw1067 is OK: OK: no RAID installed [15:30:32] RECOVERY - RAID on mw1050 is OK: OK: no RAID installed [15:30:39] New patchset: Mark Bergsma; "Install files in web docroot" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1863 [15:30:42] RECOVERY - RAID on mw1003 is OK: OK: no RAID installed [15:30:42] RECOVERY - DPKG on srv261 is OK: All packages OK [15:30:42] RECOVERY - DPKG on srv283 is OK: All packages OK [15:30:52] RECOVERY - RAID on srv261 is OK: OK: no RAID installed [15:30:52] RECOVERY - MySQL disk space on storage3 is OK: DISK OK [15:30:52] RECOVERY - RAID on mw1073 is OK: OK: no RAID installed [15:30:55] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1863 [15:31:02] RECOVERY - RAID on cp1042 is OK: OK: Active: 4, Working: 4, Failed: 0, Spare: 0 [15:31:09] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1863 [15:31:09] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1863 [15:31:12] RECOVERY - MySQL disk space on db50 is OK: DISK OK [15:31:12] RECOVERY - Disk space on db47 is OK: DISK OK [15:31:22] RECOVERY - RAID on srv283 is OK: OK: no RAID installed [15:31:23] PROBLEM - Disk space on mw1143 is CRITICAL: Connection refused by host [15:31:32] RECOVERY - RAID on fenari is OK: OK: Active: 2, Working: 2, Failed: 0, Spare: 0 [15:31:43] PROBLEM - DPKG on mw1107 is CRITICAL: Connection refused by host [15:32:03] RECOVERY - MySQL disk space on es1003 is OK: DISK OK [15:32:22] RECOVERY - DPKG on srv231 is OK: All packages OK [15:32:42] RECOVERY - Disk space on emery is OK: DISK OK [15:32:42] PROBLEM - RAID on srv278 is CRITICAL: Connection refused by host [15:32:42] RECOVERY - Disk space on srv258 is OK: DISK OK [15:32:52] RECOVERY - Disk space on srv261 is OK: DISK OK [15:32:52] RECOVERY - Disk space on srv231 is OK: DISK OK [15:32:52] RECOVERY - RAID on mw1023 is OK: OK: no RAID installed [15:33:03] PROBLEM - RAID on mw1025 is CRITICAL: Connection refused by host [15:33:03] RECOVERY - Disk space on srv238 is OK: DISK OK [15:33:22] RECOVERY - MySQL disk space on db47 is OK: DISK OK [15:33:32] PROBLEM - DPKG on mw1035 is CRITICAL: Connection refused by host [15:33:32] RECOVERY - DPKG on mw1046 is OK: All packages OK [15:33:32] RECOVERY - DPKG on mw1050 is OK: All packages OK [15:33:32] RECOVERY - DPKG on mw1053 is OK: All packages OK [15:33:42] RECOVERY - Disk space on srv283 is OK: DISK OK [15:33:42] RECOVERY - DPKG on mw1073 is OK: All packages OK [15:33:42] RECOVERY - RAID on mw1111 is OK: OK: no RAID installed [15:33:52] RECOVERY - DPKG on mw18 is OK: All packages OK [15:33:52] RECOVERY - RAID on mw1143 is OK: OK: no RAID installed [15:33:52] PROBLEM - RAID on mw1107 is CRITICAL: Connection refused by host [15:33:52] RECOVERY - RAID on mw1133 is OK: OK: no RAID installed [15:33:52] RECOVERY - DPKG on cp1042 is OK: All packages OK [15:34:02] RECOVERY - RAID on srv285 is OK: OK: no RAID installed [15:34:02] RECOVERY - RAID on srv235 is OK: OK: no RAID installed [15:34:02] RECOVERY - RAID on mw4 is OK: OK: no RAID installed [15:34:12] RECOVERY - Disk space on srv254 is OK: DISK OK [15:34:22] RECOVERY - DPKG on mw1002 is OK: All packages OK [15:34:32] RECOVERY - Disk space on mw1152 is OK: DISK OK [15:34:32] RECOVERY - DPKG on srv235 is OK: All packages OK [15:34:32] RECOVERY - RAID on srv238 is OK: OK: no RAID installed [15:34:42] PROBLEM - DPKG on mw1096 is CRITICAL: Connection refused by host [15:34:43] RECOVERY - DPKG on srv285 is OK: All packages OK [15:34:43] RECOVERY - DPKG on mw1023 is OK: All packages OK [15:34:52] RECOVERY - Disk space on cp1042 is OK: DISK OK [15:34:52] RECOVERY - DPKG on mw1008 is OK: All packages OK [15:35:02] RECOVERY - DPKG on storage3 is OK: All packages OK [15:35:02] PROBLEM - DPKG on mw1025 is CRITICAL: Connection refused by host [15:35:02] RECOVERY - RAID on emery is OK: OK: Active: 2, Working: 2, Failed: 0, Spare: 0 [15:35:12] RECOVERY - RAID on mw18 is OK: OK: no RAID installed [15:35:22] RECOVERY - DPKG on mw1009 is OK: All packages OK [15:35:42] RECOVERY - DPKG on emery is OK: All packages OK [15:36:02] RECOVERY - Disk space on mw1008 is OK: DISK OK [15:36:02] RECOVERY - Disk space on mw1002 is OK: DISK OK [15:36:13] RECOVERY - Disk space on mw1046 is OK: DISK OK [15:36:13] PROBLEM - Disk space on mw1096 is CRITICAL: Connection refused by host [15:36:13] PROBLEM - RAID on mw1096 is CRITICAL: Connection refused by host [15:36:22] PROBLEM - Disk space on mw1025 is CRITICAL: Connection refused by host [15:36:22] RECOVERY - RAID on mw42 is OK: OK: no RAID installed [15:36:22] RECOVERY - DPKG on mw4 is OK: All packages OK [15:36:22] RECOVERY - Disk space on mw1009 is OK: DISK OK [15:36:22] RECOVERY - RAID on searchidx2 is OK: OK: State is Optimal, checked 4 logical device(s) [15:36:32] PROBLEM - Disk space on mw1035 is CRITICAL: Connection refused by host [15:36:42] RECOVERY - Disk space on mw1023 is OK: DISK OK [15:36:52] RECOVERY - RAID on srv263 is OK: OK: no RAID installed [15:37:02] RECOVERY - DPKG on searchidx2 is OK: All packages OK [15:37:13] RECOVERY - mobile traffic loggers on cp1042 is OK: PROCS OK: 2 processes with command name varnishncsa [15:37:42] RECOVERY - RAID on srv258 is OK: OK: no RAID installed [15:37:52] RECOVERY - DPKG on mw1111 is OK: All packages OK [15:38:21] RECOVERY - Disk space on mw1053 is OK: DISK OK [15:38:21] RECOVERY - Disk space on mw4 is OK: DISK OK [15:38:51] PROBLEM - RAID on ms1004 is CRITICAL: Connection refused by host [15:38:51] PROBLEM - Disk space on mw1029 is CRITICAL: Connection refused by host [15:38:51] PROBLEM - Disk space on mw1032 is CRITICAL: Connection refused by host [15:38:51] PROBLEM - Disk space on mw1047 is CRITICAL: Connection refused by host [15:39:21] PROBLEM - Disk space on sodium is CRITICAL: DISK CRITICAL - /var/spool/exim4/db is not accessible: Permission denied [15:39:41] RECOVERY - Disk space on mw18 is OK: DISK OK [15:40:02] PROBLEM - Disk space on mw1070 is CRITICAL: Connection refused by host [15:40:31] RECOVERY - DPKG on mw1133 is OK: All packages OK [15:40:31] RECOVERY - DPKG on mw1143 is OK: All packages OK [15:41:01] RECOVERY - Disk space on storage3 is OK: DISK OK [15:41:01] RECOVERY - Disk space on searchidx2 is OK: DISK OK [15:41:12] PROBLEM - Disk space on mw1077 is CRITICAL: Connection refused by host [15:41:31] New patchset: Mark Bergsma; "Add rewrite rules for eqiad and esams" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1864 [15:41:41] PROBLEM - Disk space on mw1068 is CRITICAL: Connection refused by host [15:41:46] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1864 [15:41:51] RECOVERY - DPKG on db47 is OK: All packages OK [15:42:01] PROBLEM - RAID on mw1032 is CRITICAL: Connection refused by host [15:42:01] RECOVERY - RAID on mw1008 is OK: OK: no RAID installed [15:42:01] RECOVERY - RAID on mw1009 is OK: OK: no RAID installed [15:42:04] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1864 [15:42:04] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1864 [15:42:11] PROBLEM - RAID on mw1068 is CRITICAL: Connection refused by host [15:42:11] PROBLEM - RAID on mw1070 is CRITICAL: Connection refused by host [15:42:11] RECOVERY - RAID on mw1046 is OK: OK: no RAID installed [15:42:11] RECOVERY - RAID on mw1053 is OK: OK: no RAID installed [15:42:11] RECOVERY - RAID on mw1096 is OK: OK: no RAID installed [15:42:21] RECOVERY - Disk space on mw1133 is OK: DISK OK [15:42:21] RECOVERY - Disk space on mw1143 is OK: DISK OK [15:43:11] RECOVERY - Disk space on srv263 is OK: DISK OK [15:43:11] RECOVERY - DPKG on srv254 is OK: All packages OK [15:43:11] RECOVERY - DPKG on srv258 is OK: All packages OK [15:43:11] RECOVERY - RAID on storage3 is OK: OK: State is Optimal, checked 14 logical device(s) [15:43:21] RECOVERY - DPKG on srv263 is OK: All packages OK [15:43:31] RECOVERY - Disk space on cp1043 is OK: DISK OK [15:43:31] RECOVERY - DPKG on srv238 is OK: All packages OK [15:44:01] RECOVERY - DPKG on cp1043 is OK: All packages OK [15:46:01] PROBLEM - DPKG on mw1047 is CRITICAL: Connection refused by host [15:46:01] PROBLEM - DPKG on mw1051 is CRITICAL: Connection refused by host [15:46:01] PROBLEM - DPKG on mw1057 is CRITICAL: Connection refused by host [15:46:01] RECOVERY - DPKG on mw1035 is OK: All packages OK [15:46:05] New patchset: Jgreen; "adding one-size-fits-all offhost_backups script for aluminium, grosley, storage3" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1865 [15:46:11] RECOVERY - RAID on mw1107 is OK: OK: no RAID installed [15:47:01] PROBLEM - DPKG on mw1070 is CRITICAL: Connection refused by host [15:47:21] RECOVERY - RAID on cp1043 is OK: OK: Active: 4, Working: 4, Failed: 0, Spare: 0 [15:47:31] RECOVERY - mobile traffic loggers on cp1043 is OK: PROCS OK: 2 processes with command name varnishncsa [15:47:41] RECOVERY - DPKG on mw1096 is OK: All packages OK [15:48:01] PROBLEM - Disk space on db44 is CRITICAL: Connection refused by host [15:48:11] PROBLEM - DPKG on mw1077 is CRITICAL: Connection refused by host [15:48:21] RECOVERY - Disk space on mw1025 is OK: DISK OK [15:48:41] PROBLEM - Disk space on mw1051 is CRITICAL: Connection refused by host [15:48:41] RECOVERY - Disk space on mw1096 is OK: DISK OK [15:48:51] PROBLEM - Disk space on mw1044 is CRITICAL: Connection refused by host [15:48:51] RECOVERY - DPKG on mw1025 is OK: All packages OK [15:49:01] PROBLEM - Disk space on mw1042 is CRITICAL: Connection refused by host [15:49:01] RECOVERY - Disk space on mw1035 is OK: DISK OK [15:49:21] PROBLEM - Disk space on mw1093 is CRITICAL: Connection refused by host [15:49:31] PROBLEM - RAID on srv269 is CRITICAL: Connection refused by host [15:49:31] PROBLEM - Disk space on mw1071 is CRITICAL: Connection refused by host [15:49:31] PROBLEM - DPKG on srv288 is CRITICAL: Connection refused by host [15:49:51] PROBLEM - Disk space on mw1057 is CRITICAL: Connection refused by host [15:49:51] PROBLEM - Disk space on mw1090 is CRITICAL: Connection refused by host [15:49:51] RECOVERY - Disk space on mw1070 is OK: DISK OK [15:50:01] PROBLEM - Disk space on mw1064 is CRITICAL: Connection refused by host [15:50:21] PROBLEM - DPKG on mw1134 is CRITICAL: Connection refused by host [15:50:21] PROBLEM - DPKG on mw1148 is CRITICAL: Connection refused by host [15:51:01] PROBLEM - Disk space on mw1069 is CRITICAL: Connection refused by host [15:51:11] RECOVERY - DPKG on mw1107 is OK: All packages OK [15:51:31] RECOVERY - Disk space on mw1068 is OK: DISK OK [15:51:51] PROBLEM - DPKG on ms1004 is CRITICAL: Connection refused by host [15:51:51] PROBLEM - RAID on mw1029 is CRITICAL: Connection refused by host [15:51:51] RECOVERY - RAID on mw1025 is OK: OK: no RAID installed [15:52:01] PROBLEM - MySQL disk space on es1001 is CRITICAL: Connection refused by host [15:52:01] PROBLEM - RAID on mw1071 is CRITICAL: Connection refused by host [15:52:01] PROBLEM - RAID on mw1069 is CRITICAL: Connection refused by host [15:52:01] PROBLEM - RAID on mw1090 is CRITICAL: Connection refused by host [15:52:01] PROBLEM - RAID on mw1077 is CRITICAL: Connection refused by host [15:52:01] PROBLEM - RAID on mw1093 is CRITICAL: Connection refused by host [15:52:01] RECOVERY - RAID on mw1068 is OK: OK: no RAID installed [15:52:02] RECOVERY - RAID on mw1070 is OK: OK: no RAID installed [15:55:04] New patchset: Hashar; "testswarm: update fetcher to r108075" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1866 [15:55:19] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1866 [15:55:41] PROBLEM - DPKG on db44 is CRITICAL: Connection refused by host [15:55:51] PROBLEM - DPKG on mw1029 is CRITICAL: Connection refused by host [15:55:51] PROBLEM - DPKG on mw1042 is CRITICAL: Connection refused by host [15:55:51] PROBLEM - DPKG on mw1044 is CRITICAL: Connection refused by host [15:55:51] RECOVERY - DPKG on mw1047 is OK: All packages OK [15:55:51] RECOVERY - DPKG on mw1051 is OK: All packages OK [15:56:01] RECOVERY - DPKG on mw1057 is OK: All packages OK [15:56:51] RECOVERY - DPKG on mw1070 is OK: All packages OK [15:56:58] New review: Jgreen; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1865 [15:56:59] Change merged: Jgreen; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1865 [15:57:01] PROBLEM - Disk space on srv275 is CRITICAL: Connection refused by host [15:57:21] PROBLEM - DPKG on mw1071 is CRITICAL: Connection refused by host [15:57:21] PROBLEM - DPKG on mw1093 is CRITICAL: Connection refused by host [15:57:41] PROBLEM - DPKG on mw1090 is CRITICAL: Connection refused by host [15:57:41] PROBLEM - RAID on mw1158 is CRITICAL: Connection refused by host [15:57:51] PROBLEM - DPKG on mw1086 is CRITICAL: Connection refused by host [15:58:21] PROBLEM - Disk space on mw1012 is CRITICAL: Connection refused by host [15:58:21] RECOVERY - Disk space on mw1032 is OK: DISK OK [15:58:21] RECOVERY - Disk space on mw1029 is OK: DISK OK [15:58:21] RECOVERY - RAID on ms1004 is OK: OK: State is Optimal, checked 2 logical device(s) [15:58:31] PROBLEM - Disk space on mw1001 is CRITICAL: Connection refused by host [15:58:31] RECOVERY - DPKG on mw1077 is OK: All packages OK [15:58:31] RECOVERY - Disk space on mw1051 is OK: DISK OK [15:58:41] RECOVERY - Disk space on mw1047 is OK: DISK OK [15:59:02] PROBLEM - RAID on srv201 is CRITICAL: Connection refused by host [15:59:11] RECOVERY - Disk space on mw1093 is OK: DISK OK [15:59:21] PROBLEM - RAID on mw19 is CRITICAL: Connection refused by host [15:59:21] RECOVERY - Disk space on mw1071 is OK: DISK OK [15:59:41] PROBLEM - DPKG on mw74 is CRITICAL: Connection refused by host [15:59:41] PROBLEM - Disk space on mw1075 is CRITICAL: Connection refused by host [15:59:41] PROBLEM - RAID on es1001 is CRITICAL: Connection refused by host [15:59:41] RECOVERY - Disk space on mw1057 is OK: DISK OK [15:59:41] RECOVERY - Disk space on mw1090 is OK: DISK OK [16:00:11] PROBLEM - DPKG on mw1147 is CRITICAL: Connection refused by host [16:00:11] PROBLEM - DPKG on mw1158 is CRITICAL: Connection refused by host [16:00:12] RECOVERY - DPKG on mw1159 is OK: All packages OK [16:00:31] PROBLEM - Disk space on mw1086 is CRITICAL: Connection refused by host [16:00:51] RECOVERY - Disk space on mw1069 is OK: DISK OK [16:01:01] PROBLEM - DPKG on es1001 is CRITICAL: Connection refused by host [16:01:01] PROBLEM - MySQL disk space on db44 is CRITICAL: Connection refused by host [16:01:01] RECOVERY - Disk space on mw1077 is OK: DISK OK [16:01:11] PROBLEM - DPKG on srv269 is CRITICAL: Connection refused by host [16:01:11] PROBLEM - Disk space on srv256 is CRITICAL: Connection refused by host [16:01:11] PROBLEM - DPKG on aluminium is CRITICAL: Connection refused by host [16:01:11] PROBLEM - RAID on aluminium is CRITICAL: Connection refused by host [16:01:31] PROBLEM - Disk space on mw74 is CRITICAL: Connection refused by host [16:01:41] PROBLEM - RAID on srv272 is CRITICAL: Connection refused by host [16:01:42] PROBLEM - RAID on mw1012 is CRITICAL: Connection refused by host [16:01:42] PROBLEM - RAID on mw1001 is CRITICAL: Connection refused by host [16:01:42] RECOVERY - DPKG on ms1004 is OK: All packages OK [16:01:42] RECOVERY - RAID on mw1032 is OK: OK: no RAID installed [16:01:42] RECOVERY - RAID on mw1029 is OK: OK: no RAID installed [16:01:51] PROBLEM - DPKG on es1002 is CRITICAL: Connection refused by host [16:01:51] PROBLEM - RAID on mw1064 is CRITICAL: Connection refused by host [16:01:51] PROBLEM - RAID on mw1086 is CRITICAL: Connection refused by host [16:01:51] PROBLEM - RAID on mw1075 is CRITICAL: Connection refused by host [16:01:51] RECOVERY - RAID on mw1071 is OK: OK: no RAID installed [16:01:51] RECOVERY - RAID on mw1069 is OK: OK: no RAID installed [16:01:51] RECOVERY - RAID on mw1077 is OK: OK: no RAID installed [16:01:52] RECOVERY - RAID on mw1090 is OK: OK: no RAID installed [16:01:52] RECOVERY - RAID on mw1093 is OK: OK: no RAID installed [16:02:01] PROBLEM - Disk space on mw1134 is CRITICAL: Connection refused by host [16:02:01] PROBLEM - Disk space on mw1148 is CRITICAL: Connection refused by host [16:02:01] PROBLEM - Disk space on mw1147 is CRITICAL: Connection refused by host [16:02:11] RECOVERY - Disk space on mw1159 is OK: DISK OK [16:02:21] PROBLEM - Disk space on mw1158 is CRITICAL: Connection refused by host [16:02:42] PROBLEM - RAID on srv202 is CRITICAL: Connection refused by host [16:02:42] PROBLEM - Disk space on srv201 is CRITICAL: Connection refused by host [16:02:51] PROBLEM - Disk space on srv236 is CRITICAL: Connection refused by host [16:02:51] PROBLEM - RAID on srv256 is CRITICAL: Connection refused by host [16:03:01] PROBLEM - RAID on srv270 is CRITICAL: Connection refused by host [16:03:02] PROBLEM - RAID on srv275 is CRITICAL: Connection refused by host [16:03:02] PROBLEM - DPKG on srv272 is CRITICAL: Connection refused by host [16:03:12] PROBLEM - RAID on bast1001 is CRITICAL: Connection refused by host [16:03:12] PROBLEM - Disk space on es1001 is CRITICAL: Connection refused by host [16:03:31] PROBLEM - Disk space on es1002 is CRITICAL: Connection refused by host [16:03:31] PROBLEM - RAID on es1002 is CRITICAL: Connection refused by host [16:04:12] PROBLEM - Disk space on srv288 is CRITICAL: Connection refused by host [16:04:51] PROBLEM - DPKG on srv270 is CRITICAL: Connection refused by host [16:04:51] PROBLEM - Disk space on srv272 is CRITICAL: Connection refused by host [16:05:11] PROBLEM - DPKG on srv275 is CRITICAL: Connection refused by host [16:05:31] RECOVERY - DPKG on db44 is OK: All packages OK [16:05:39] Change abandoned: Hashar; "wrong change :b" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1866 [16:05:41] RECOVERY - DPKG on mw1029 is OK: All packages OK [16:05:41] RECOVERY - DPKG on mw1042 is OK: All packages OK [16:05:41] RECOVERY - DPKG on mw1044 is OK: All packages OK [16:05:51] PROBLEM - DPKG on mw1064 is CRITICAL: Connection refused by host [16:06:51] PROBLEM - Disk space on srv270 is CRITICAL: Connection refused by host [16:06:51] RECOVERY - Disk space on srv275 is OK: DISK OK [16:07:01] PROBLEM - RAID on srv288 is CRITICAL: Connection refused by host [16:07:01] PROBLEM - DPKG on bast1001 is CRITICAL: Connection refused by host [16:07:11] RECOVERY - DPKG on mw1071 is OK: All packages OK [16:07:11] RECOVERY - DPKG on mw1093 is OK: All packages OK [16:07:21] PROBLEM - MySQL disk space on es1002 is CRITICAL: Connection refused by host [16:07:31] RECOVERY - DPKG on mw1090 is OK: All packages OK [16:07:31] RECOVERY - RAID on mw1158 is OK: OK: no RAID installed [16:07:41] RECOVERY - Disk space on db44 is OK: DISK OK [16:07:41] RECOVERY - DPKG on mw1086 is OK: All packages OK [16:07:51] PROBLEM - RAID on mw1147 is CRITICAL: Connection refused by host [16:07:51] PROBLEM - DPKG on mw1075 is CRITICAL: Connection refused by host [16:07:51] PROBLEM - DPKG on mw1012 is CRITICAL: Connection refused by host [16:07:51] PROBLEM - DPKG on mw1001 is CRITICAL: Connection refused by host [16:07:51] RECOVERY - RAID on mw1159 is OK: OK: no RAID installed [16:08:02] PROBLEM - RAID on mw1134 is CRITICAL: Connection refused by host [16:08:21] RECOVERY - Disk space on mw1001 is OK: DISK OK [16:08:32] RECOVERY - Disk space on mw1044 is OK: DISK OK [16:08:41] RECOVERY - Disk space on mw1042 is OK: DISK OK [16:08:51] RECOVERY - RAID on srv201 is OK: OK: no RAID installed [16:09:01] RECOVERY - RAID on mw19 is OK: OK: no RAID installed [16:09:22] PROBLEM - Disk space on bast1001 is CRITICAL: Connection refused by host [16:09:22] RECOVERY - RAID on srv269 is OK: OK: no RAID installed [16:09:22] RECOVERY - DPKG on srv288 is OK: All packages OK [16:09:32] RECOVERY - RAID on es1001 is OK: OK: State is Optimal, checked 2 logical device(s) [16:09:42] RECOVERY - Disk space on mw1064 is OK: DISK OK [16:10:01] RECOVERY - DPKG on mw74 is OK: All packages OK [16:10:01] RECOVERY - DPKG on mw1134 is OK: All packages OK [16:10:01] RECOVERY - DPKG on mw1148 is OK: All packages OK [16:10:01] RECOVERY - DPKG on mw1147 is OK: All packages OK [16:10:01] RECOVERY - DPKG on mw1158 is OK: All packages OK [16:10:21] RECOVERY - Disk space on mw1086 is OK: DISK OK [16:10:51] RECOVERY - MySQL disk space on db44 is OK: DISK OK [16:10:51] RECOVERY - DPKG on es1001 is OK: All packages OK [16:11:01] RECOVERY - DPKG on srv269 is OK: All packages OK [16:11:01] RECOVERY - Disk space on srv256 is OK: DISK OK [16:11:01] RECOVERY - DPKG on aluminium is OK: All packages OK [16:11:01] RECOVERY - RAID on aluminium is OK: OK: Active: 2, Working: 2, Failed: 0, Spare: 0 [16:11:21] RECOVERY - Disk space on mw74 is OK: DISK OK [16:11:31] RECOVERY - RAID on srv272 is OK: OK: no RAID installed [16:11:31] RECOVERY - RAID on mw1012 is OK: OK: no RAID installed [16:11:31] RECOVERY - RAID on mw1001 is OK: OK: no RAID installed [16:11:41] RECOVERY - MySQL disk space on es1001 is OK: DISK OK [16:11:41] RECOVERY - RAID on mw1064 is OK: OK: no RAID installed [16:11:41] RECOVERY - DPKG on es1002 is OK: All packages OK [16:11:41] RECOVERY - RAID on mw1075 is OK: OK: no RAID installed [16:11:41] RECOVERY - RAID on mw1086 is OK: OK: no RAID installed [16:11:51] RECOVERY - Disk space on mw1134 is OK: DISK OK [16:11:51] RECOVERY - Disk space on mw1148 is OK: DISK OK [16:11:51] RECOVERY - Disk space on mw1147 is OK: DISK OK [16:12:11] RECOVERY - Disk space on mw1158 is OK: DISK OK [16:12:31] RECOVERY - Disk space on srv201 is OK: DISK OK [16:12:31] RECOVERY - RAID on srv202 is OK: OK: no RAID installed [16:12:41] RECOVERY - Disk space on srv236 is OK: DISK OK [16:12:41] RECOVERY - RAID on srv256 is OK: OK: no RAID installed [16:12:51] RECOVERY - RAID on srv270 is OK: OK: no RAID installed [16:12:51] RECOVERY - RAID on srv275 is OK: OK: no RAID installed [16:12:51] RECOVERY - DPKG on srv272 is OK: All packages OK [16:13:01] RECOVERY - RAID on bast1001 is OK: OK: no RAID installed [16:13:01] RECOVERY - Disk space on es1001 is OK: DISK OK [16:13:21] RECOVERY - Disk space on es1002 is OK: DISK OK [16:13:21] RECOVERY - RAID on es1002 is OK: OK: State is Optimal, checked 2 logical device(s) [16:14:01] RECOVERY - Disk space on srv288 is OK: DISK OK [16:14:41] RECOVERY - DPKG on srv270 is OK: All packages OK [16:14:41] RECOVERY - Disk space on srv272 is OK: DISK OK [16:15:01] RECOVERY - DPKG on srv275 is OK: All packages OK [16:15:41] RECOVERY - DPKG on mw1064 is OK: All packages OK [16:16:42] RECOVERY - Disk space on srv270 is OK: DISK OK [16:16:51] RECOVERY - RAID on srv288 is OK: OK: no RAID installed [16:16:51] RECOVERY - DPKG on bast1001 is OK: All packages OK [16:17:12] RECOVERY - MySQL disk space on es1002 is OK: DISK OK [16:17:41] RECOVERY - DPKG on mw1012 is OK: All packages OK [16:17:41] RECOVERY - DPKG on mw1001 is OK: All packages OK [16:18:34] RECOVERY - RAID on mw1134 is OK: OK: no RAID installed [16:18:34] RECOVERY - RAID on mw1147 is OK: OK: no RAID installed [16:19:04] RECOVERY - Disk space on mw1012 is OK: DISK OK [16:20:14] RECOVERY - Disk space on mw1075 is OK: DISK OK [16:21:04] RECOVERY - Disk space on bast1001 is OK: DISK OK [16:25:55] New patchset: Hashar; "testswarm: update fetcher to r108075" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1867 [16:26:12] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1867 [16:26:24] RECOVERY - DPKG on mw1075 is OK: All packages OK [16:28:17] New patchset: Jgreen; "adding root@grosley's key to logmover authorized_keys" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1868 [16:28:32] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1868 [16:28:50] New review: Jgreen; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1868 [16:28:50] Change merged: Jgreen; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1868 [16:31:53] New patchset: Jgreen; "removing stale root@grosley ssh key" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1869 [16:32:08] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1869 [16:32:34] New review: Jgreen; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1869 [16:32:34] Change merged: Jgreen; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1869 [16:33:13] New review: Dzahn; "Change made by Timo & reviewed by hashar in CodeReview." [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1867 [16:33:14] Change merged: Dzahn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1867 [16:38:28] New review: Dzahn; "careful, you need to make sure a key is defined as absent to make sure it's gone. just removing it h..." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1869 [16:52:29] New patchset: Pyoungmeister; "commenting out sodium preseed for manual install" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1870 [16:52:43] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1870 [16:53:54] New review: Pyoungmeister; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1870 [16:53:55] Change merged: Pyoungmeister; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1870 [16:55:53] New patchset: Hashar; "testswarm: minor fix following change r1867" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1871 [16:57:38] New review: Hashar; "SVN changes:" [operations/puppet] (production) C: 0; - https://gerrit.wikimedia.org/r/1871 [16:58:38] New patchset: Jgreen; "offhost backups crons, adjustments" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1872 [16:58:53] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1872 [16:59:23] New review: Jgreen; "(no comment)" [operations/puppet] (production); V: 1 C: 0; - https://gerrit.wikimedia.org/r/1872 [16:59:39] New review: Jgreen; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1872 [16:59:39] Change merged: Jgreen; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1872 [17:01:15] New review: Demon; "I'm wondering if we should make an integration/testswarm repo (like we did with integration/jenkins)..." [operations/puppet] (production) C: 1; - https://gerrit.wikimedia.org/r/1871 [17:07:27] New review: Dzahn; "catching exceptions is a good thing and cosmetic fix, sure" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1871 [17:07:28] Change merged: Dzahn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1871 [17:16:22] PROBLEM - mailman on sodium is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [17:30:02] PROBLEM - spamassassin on sodium is CRITICAL: Connection refused by host [17:30:02] PROBLEM - HTTPS on sodium is CRITICAL: Connection refused [17:32:32] PROBLEM - HTTP on sodium is CRITICAL: Connection refused [17:32:43] PROBLEM - DPKG on sodium is CRITICAL: Connection refused by host [17:32:52] PROBLEM - RAID on sodium is CRITICAL: Connection refused by host [17:56:51] RECOVERY - HTTP on sodium is OK: HTTP OK HTTP/1.1 200 OK - 452 bytes in 0.054 seconds [18:02:28] New patchset: Mark Bergsma; "Convert spamassassin's local.cf to a template" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1873 [18:02:43] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1873 [18:04:46] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1873 [18:04:47] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1873 [18:08:36] New patchset: Mark Bergsma; "include network::constants" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1874 [18:08:51] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1874 [18:08:58] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1874 [18:08:58] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1874 [18:14:41] RECOVERY - mailman on sodium is OK: PROCS OK: 10 processes with args mailman [18:16:21] RECOVERY - spamassassin on sodium is OK: PROCS OK: 4 processes with args spamd [18:20:31] RECOVERY - RAID on sodium is OK: OK: Active: 4, Working: 4, Failed: 0, Spare: 0 [18:20:41] RECOVERY - DPKG on sodium is OK: All packages OK [18:21:04] New patchset: Mark Bergsma; "require lighttpd to be installed before mailman" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1875 [18:21:19] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1875 [18:22:03] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1875 [18:22:04] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1875 [18:25:06] New review: Demon; "Shun the nonbeliever!" [test/mediawiki/core] (master); V: 0 C: -1; - https://gerrit.wikimedia.org/r/1841 [19:13:04] New patchset: Asher; "remove duplicate file definition" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1876 [19:13:21] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1876 [19:14:13] New review: Asher; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1876 [19:14:13] Change merged: Asher; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1876 [19:15:46] RECOVERY - Puppet freshness on db1018 is OK: puppet ran at Thu Jan 12 19:15:21 UTC 2012 [19:16:47] RECOVERY - Puppet freshness on db1011 is OK: puppet ran at Thu Jan 12 19:16:39 UTC 2012 [19:16:47] RECOVERY - Puppet freshness on db1040 is OK: puppet ran at Thu Jan 12 19:16:43 UTC 2012 [19:18:16] RECOVERY - Puppet freshness on db1019 is OK: puppet ran at Thu Jan 12 19:17:53 UTC 2012 [19:19:16] PROBLEM - Puppet freshness on db22 is CRITICAL: Puppet has not run in the last 10 hours [19:19:46] RECOVERY - Puppet freshness on db1020 is OK: puppet ran at Thu Jan 12 19:19:42 UTC 2012 [19:20:16] RECOVERY - Puppet freshness on db1029 is OK: puppet ran at Thu Jan 12 19:19:56 UTC 2012 [19:20:16] RECOVERY - Puppet freshness on db1034 is OK: puppet ran at Thu Jan 12 19:20:00 UTC 2012 [19:22:46] RECOVERY - Puppet freshness on db1042 is OK: puppet ran at Thu Jan 12 19:22:21 UTC 2012 [19:23:17] RECOVERY - Puppet freshness on db1027 is OK: puppet ran at Thu Jan 12 19:22:52 UTC 2012 [19:23:17] RECOVERY - Puppet freshness on db1010 is OK: puppet ran at Thu Jan 12 19:23:09 UTC 2012 [19:23:46] RECOVERY - Puppet freshness on db1021 is OK: puppet ran at Thu Jan 12 19:23:18 UTC 2012 [19:23:46] RECOVERY - Puppet freshness on db1002 is OK: puppet ran at Thu Jan 12 19:23:34 UTC 2012 [19:24:17] RECOVERY - Puppet freshness on db1015 is OK: puppet ran at Thu Jan 12 19:24:07 UTC 2012 [19:24:46] RECOVERY - Puppet freshness on db1012 is OK: puppet ran at Thu Jan 12 19:24:36 UTC 2012 [19:25:06] New patchset: Asher; "fix password scope" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1877 [19:25:16] RECOVERY - Puppet freshness on db1047 is OK: puppet ran at Thu Jan 12 19:25:09 UTC 2012 [19:25:21] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1877 [19:25:52] New review: Asher; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1877 [19:25:52] Change merged: Asher; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1877 [19:26:22] RECOVERY - Puppet freshness on db1046 is OK: puppet ran at Thu Jan 12 19:25:52 UTC 2012 [19:27:22] RECOVERY - Puppet freshness on db1016 is OK: puppet ran at Thu Jan 12 19:27:19 UTC 2012 [19:27:23] RECOVERY - Puppet freshness on db1028 is OK: puppet ran at Thu Jan 12 19:27:20 UTC 2012 [19:27:52] RECOVERY - Puppet freshness on db1001 is OK: puppet ran at Thu Jan 12 19:27:39 UTC 2012 [19:28:22] RECOVERY - Puppet freshness on db1041 is OK: puppet ran at Thu Jan 12 19:28:06 UTC 2012 [19:28:52] RECOVERY - Puppet freshness on db1033 is OK: puppet ran at Thu Jan 12 19:28:23 UTC 2012 [19:28:52] RECOVERY - Puppet freshness on db1009 is OK: puppet ran at Thu Jan 12 19:28:24 UTC 2012 [19:29:52] RECOVERY - Puppet freshness on db1026 is OK: puppet ran at Thu Jan 12 19:29:36 UTC 2012 [19:29:52] RECOVERY - Puppet freshness on db1031 is OK: puppet ran at Thu Jan 12 19:29:52 UTC 2012 [19:30:52] RECOVERY - Puppet freshness on db1048 is OK: puppet ran at Thu Jan 12 19:30:23 UTC 2012 [19:31:43] New patchset: Asher; "and include the pw class" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1878 [19:31:52] RECOVERY - Puppet freshness on db1013 is OK: puppet ran at Thu Jan 12 19:31:44 UTC 2012 [19:31:59] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1878 [19:32:52] RECOVERY - Puppet freshness on db1006 is OK: puppet ran at Thu Jan 12 19:32:24 UTC 2012 [19:33:22] RECOVERY - Puppet freshness on db1038 is OK: puppet ran at Thu Jan 12 19:33:11 UTC 2012 [19:33:51] New review: Asher; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1878 [19:33:51] Change merged: Asher; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1878 [19:34:52] RECOVERY - Puppet freshness on db1043 is OK: puppet ran at Thu Jan 12 19:34:33 UTC 2012 [19:36:22] RECOVERY - Puppet freshness on db1003 is OK: puppet ran at Thu Jan 12 19:35:56 UTC 2012 [19:36:52] RECOVERY - Puppet freshness on db1025 is OK: puppet ran at Thu Jan 12 19:36:33 UTC 2012 [19:36:53] RECOVERY - Puppet freshness on db1044 is OK: puppet ran at Thu Jan 12 19:36:43 UTC 2012 [19:37:52] RECOVERY - Puppet freshness on db1004 is OK: puppet ran at Thu Jan 12 19:37:28 UTC 2012 [19:37:52] RECOVERY - Puppet freshness on db1008 is OK: puppet ran at Thu Jan 12 19:37:36 UTC 2012 [19:41:22] RECOVERY - Puppet freshness on db1022 is OK: puppet ran at Thu Jan 12 19:41:03 UTC 2012 [19:41:22] RECOVERY - Puppet freshness on db1005 is OK: puppet ran at Thu Jan 12 19:41:10 UTC 2012 [19:41:22] RECOVERY - Puppet freshness on db1045 is OK: puppet ran at Thu Jan 12 19:41:12 UTC 2012 [19:41:22] RECOVERY - Puppet freshness on db1017 is OK: puppet ran at Thu Jan 12 19:41:21 UTC 2012 [19:42:22] RECOVERY - Puppet freshness on db1039 is OK: puppet ran at Thu Jan 12 19:42:00 UTC 2012 [19:42:23] RECOVERY - Puppet freshness on db1014 is OK: puppet ran at Thu Jan 12 19:42:13 UTC 2012 [19:42:23] RECOVERY - Puppet freshness on db1030 is OK: puppet ran at Thu Jan 12 19:42:18 UTC 2012 [19:44:52] RECOVERY - Puppet freshness on db1024 is OK: puppet ran at Thu Jan 12 19:44:33 UTC 2012 [19:45:22] RECOVERY - Puppet freshness on db1035 is OK: puppet ran at Thu Jan 12 19:44:56 UTC 2012 [19:45:23] RECOVERY - Puppet freshness on db1007 is OK: puppet ran at Thu Jan 12 19:45:05 UTC 2012 [19:51:08] New patchset: Pyoungmeister; "giving diedrik access to various boxes a la rt 2256" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1879 [19:51:59] New review: Pyoungmeister; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1879 [19:52:40] New patchset: Asher; "username switch for these is -l, not -u" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1880 [19:53:47] Change abandoned: Pyoungmeister; "wrong branch" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1879 [19:54:56] New review: Asher; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1880 [19:54:56] Change merged: Asher; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1880 [19:55:42] New patchset: Pyoungmeister; "adding shell access for diedrik rt 2256" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1881 [19:56:01] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1881 [19:56:18] New review: Pyoungmeister; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1881 [19:56:19] Change merged: Pyoungmeister; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1881 [19:57:27] New patchset: Ryan Lane; "test" [operations/software] (master) - https://gerrit.wikimedia.org/r/1882 [19:58:51] New review: Bhartshorne; "(no comment)" [operations/software] (master); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1882 [19:58:51] Change merged: Bhartshorne; [operations/software] (master) - https://gerrit.wikimedia.org/r/1882 [20:07:31] !log reedy synchronized php-1.18/includes/specials/SpecialSearch.php 'r108751' [20:07:33] Logged the message, Master [20:12:23] New patchset: Bhartshorne; "another test" [operations/software] (master) - https://gerrit.wikimedia.org/r/1883 [20:12:24] New review: gerrit2; "Lint check passed." [operations/software] (master); V: 1 - https://gerrit.wikimedia.org/r/1883 [20:15:31] New patchset: Bhartshorne; "removing test file" [operations/software] (master) - https://gerrit.wikimedia.org/r/1884 [20:15:32] New review: gerrit2; "Lint check passed." [operations/software] (master); V: 1 - https://gerrit.wikimedia.org/r/1884 [20:15:58] New review: Bhartshorne; "(no comment)" [operations/software] (master); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1883 [20:15:58] Change merged: Bhartshorne; [operations/software] (master) - https://gerrit.wikimedia.org/r/1883 [20:16:14] New review: Bhartshorne; "(no comment)" [operations/software] (master); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1884 [20:16:15] Change merged: Bhartshorne; [operations/software] (master) - https://gerrit.wikimedia.org/r/1884 [20:21:49] PROBLEM - Host srv187 is DOWN: PING CRITICAL - Packet loss = 100% [20:21:59] PROBLEM - Host srv189 is DOWN: PING CRITICAL - Packet loss = 100% [20:21:59] PROBLEM - Host srv188 is DOWN: PING CRITICAL - Packet loss = 100% [20:24:33] New patchset: Bhartshorne; "initial import of geturls" [operations/software] (master) - https://gerrit.wikimedia.org/r/1885 [20:26:13] New review: Bhartshorne; "(no comment)" [operations/software] (master); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1885 [20:26:13] Change merged: Bhartshorne; [operations/software] (master) - https://gerrit.wikimedia.org/r/1885 [20:35:19] RECOVERY - Host srv189 is UP: PING OK - Packet loss = 0%, RTA = 0.24 ms [20:35:29] RECOVERY - Host srv188 is UP: PING OK - Packet loss = 0%, RTA = 0.39 ms [20:37:39] RECOVERY - Host srv187 is UP: PING OK - Packet loss = 0%, RTA = 0.44 ms [20:51:51] New patchset: Asher; "install percona-toolkit on hardy db's" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1886 [20:52:06] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1886 [20:52:15] New review: Asher; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1886 [20:52:16] Change merged: Asher; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1886 [20:54:59] RECOVERY - Puppet freshness on ms1002 is OK: puppet ran at Thu Jan 12 20:54:29 UTC 2012 [20:59:55] New patchset: Asher; "install percona nagios scripts on all dbs" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1887 [21:00:12] New patchset: Jgreen; "adding khorn shell access to storage3" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1888 [21:00:26] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1888 [21:00:34] New review: Asher; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1887 [21:00:34] Change merged: Asher; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1887 [21:00:49] New review: Jgreen; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1888 [21:00:50] Change merged: Jgreen; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1888 [21:08:27] !log renaming mobile1 to virt0 [21:08:29] Logged the message, Master [21:11:09] PROBLEM - Host mobile1 is DOWN: PING CRITICAL - Packet loss = 100% [21:11:20] !log rebuilding mobile1 as virt0 [21:11:22] Logged the message, Master [21:15:43] New patchset: Ryan Lane; "Decommissioning mobile1" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1889 [21:15:58] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1889 [21:16:47] New review: Ryan Lane; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1889 [21:16:47] Change merged: Ryan Lane; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1889 [21:43:10] !log Adding back mgmt info for mobile1, changing mobile2 to virt0 [21:43:12] Logged the message, Master [21:43:21] !log rebuilding mobile2 as virt0 [21:43:22] Logged the message, Master [21:50:59] PROBLEM - RAID on ms1002 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [21:54:09] !log moved new virt0 from squid vlan to public-services2 [21:54:11] Logged the message, Master [21:54:17] !log relabeled port at virt0 [21:54:18] Logged the message, Master [22:06:44] PROBLEM - Host mobile2 is DOWN: PING CRITICAL - Packet loss = 100% [22:23:47] New patchset: Lcarr; "putting ganglia1001 in puppet" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1890 [22:24:02] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1890 [22:24:21] New review: Lcarr; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1890 [22:24:22] Change merged: Lcarr; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1890 [22:31:31] We're starting office hours with Sue Gardner in a moment, in #wikimedia-office, if anyone is interested. [22:31:52] topic? [22:31:55] rats [22:33:15] apergos: sopa, i think [22:33:28] yes, someone said in the channel :-) [23:01:39] New patchset: Asher; "mysql conf reorg, define clusters and masters for all prod clusters s1-7" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1891 [23:01:54] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1891 [23:05:45] New review: Asher; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1891 [23:05:46] Change merged: Asher; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1891 [23:09:39] New patchset: Asher; "Revert "mysql conf reorg, define clusters and masters for all prod clusters s1-7" - var scope issue" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1892 [23:09:53] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1892 [23:11:46] New review: Asher; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1892 [23:11:47] Change merged: Asher; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1892 [23:12:33] * binasher just had a "it would be good to use labs" moment  [23:24:58] New patchset: Asher; "mysql conf reorg, define clusters and masters for all prod clusters" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1893 [23:25:18] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1893 [23:36:18] siebrand: I replied to the translation tools workshop on the talk page, https://meta.wikimedia.org/wiki/Talk:Translation_tools_workshop,_2012 [23:52:54] New patchset: Asher; "enable and force ssl for graphite" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1894 [23:53:32] New review: Asher; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1894 [23:54:22] Thehelpfulone: I have bad experiences with webex record. I'll try but not sure if that'll work... [23:54:57] heh [23:55:19] do you have any screen capture software perhaps as an alternative?