[00:00:49] RECOVERY - Puppet freshness on cp1022 is OK: puppet ran at Mon Oct 21 00:00:41 UTC 2013 [00:00:49] PROBLEM - Puppet freshness on cp1022 is CRITICAL: No successful Puppet run in the last 10 hours [00:00:49] RECOVERY - Puppet freshness on cp1031 is OK: puppet ran at Mon Oct 21 00:00:46 UTC 2013 [00:00:59] RECOVERY - Apache HTTP on mw1109 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.062 second response time [00:01:19] PROBLEM - Puppet freshness on cp1031 is CRITICAL: No successful Puppet run in the last 10 hours [00:04:09] PROBLEM - MySQL Slave Delay on db53 is CRITICAL: CRIT replication delay 316 seconds [00:05:49] RECOVERY - Puppet freshness on cp1028 is OK: puppet ran at Mon Oct 21 00:05:47 UTC 2013 [00:06:29] PROBLEM - Puppet freshness on cp1028 is CRITICAL: No successful Puppet run in the last 10 hours [00:18:49] RECOVERY - Puppet freshness on cp1025 is OK: puppet ran at Mon Oct 21 00:18:41 UTC 2013 [00:19:29] PROBLEM - Puppet freshness on cp1025 is CRITICAL: No successful Puppet run in the last 10 hours [00:19:49] RECOVERY - Puppet freshness on cp1023 is OK: puppet ran at Mon Oct 21 00:19:42 UTC 2013 [00:20:39] PROBLEM - Puppet freshness on cp1023 is CRITICAL: No successful Puppet run in the last 10 hours [00:21:49] RECOVERY - Puppet freshness on cp1035 is OK: puppet ran at Mon Oct 21 00:21:43 UTC 2013 [00:21:49] RECOVERY - Puppet freshness on cp1021 is OK: puppet ran at Mon Oct 21 00:21:43 UTC 2013 [00:21:59] PROBLEM - Puppet freshness on cp1021 is CRITICAL: No successful Puppet run in the last 10 hours [00:22:09] PROBLEM - Puppet freshness on cp1035 is CRITICAL: No successful Puppet run in the last 10 hours [00:28:50] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Mon Oct 21 00:28:45 UTC 2013 [00:28:59] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [00:32:49] RECOVERY - Puppet freshness on cp1029 is OK: puppet ran at Mon Oct 21 00:32:41 UTC 2013 [00:33:09] PROBLEM - Puppet freshness on cp1029 is CRITICAL: No successful Puppet run in the last 10 hours [00:33:49] RECOVERY - Puppet freshness on cp1032 is OK: puppet ran at Mon Oct 21 00:33:47 UTC 2013 [00:34:49] PROBLEM - Puppet freshness on cp1032 is CRITICAL: No successful Puppet run in the last 10 hours [00:35:49] RECOVERY - Puppet freshness on cp1027 is OK: puppet ran at Mon Oct 21 00:35:43 UTC 2013 [00:36:29] PROBLEM - Puppet freshness on cp1027 is CRITICAL: No successful Puppet run in the last 10 hours [00:38:43] <^d> !log bringing gerrit down to troubleshoot replication [00:39:01] Logged the message, Master [00:39:26] i was just about to mention it was down :P [00:41:49] RECOVERY - Puppet freshness on cp1036 is OK: puppet ran at Mon Oct 21 00:41:45 UTC 2013 [00:41:59] PROBLEM - Puppet freshness on cp1036 is CRITICAL: No successful Puppet run in the last 10 hours [00:42:49] RECOVERY - Puppet freshness on cp1024 is OK: puppet ran at Mon Oct 21 00:42:41 UTC 2013 [00:43:23] should nagios be reporting that gerrit's down? [00:43:47] <^d> There's some work-in-progress on icinga alerts. [00:43:49] PROBLEM - Puppet freshness on cp1024 is CRITICAL: No successful Puppet run in the last 10 hours [00:43:49] RECOVERY - Puppet freshness on cp1030 is OK: puppet ran at Mon Oct 21 00:43:46 UTC 2013 [00:43:50] PROBLEM - Puppet freshness on cp1030 is CRITICAL: No successful Puppet run in the last 10 hours [00:44:59] RECOVERY - Puppet freshness on cp1026 is OK: puppet ran at Mon Oct 21 00:44:56 UTC 2013 [00:45:11] errr, yeah, sorry trademark gods [00:45:59] PROBLEM - Puppet freshness on cp1026 is CRITICAL: No successful Puppet run in the last 10 hours [00:46:20] <^d> Gee gerrit, why you have to be so stupid today? [00:51:47] i wonder if watchmouse allows scheduling downtime [00:51:56] (it did notice gerrit) [00:54:15] <^d> Even if it did, this isn't scheduled. [00:54:22] <^d> Nor do I have access to watchmouse. [00:54:49] PROBLEM - SSH on lvs1001 is CRITICAL: Server answer: [00:55:14] well i interpreted the !log as "intentional" even if not scheduled [00:55:18] but whatever [00:55:38] hey ^d [00:55:49] RECOVERY - SSH on lvs1001 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [00:55:50] RECOVERY - Puppet freshness on cp1033 is OK: puppet ran at Mon Oct 21 00:55:48 UTC 2013 [00:55:55] so weird seeing aude in this TZ [00:56:03] got to meet shawn pearce (gerrit developer) today :) [00:56:14] at teh google? [00:56:15] hi jeremyb :) [00:56:17] yep [00:56:19] PROBLEM - Puppet freshness on cp1033 is CRITICAL: No successful Puppet run in the last 10 hours [00:56:22] what's doing there on a weekend? [00:56:28] gsoc summit [00:56:30] mentor summit [00:56:42] aha [00:56:59] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Mon Oct 21 00:56:53 UTC 2013 [00:57:49] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [01:00:59] RECOVERY - Puppet freshness on cp1022 is OK: puppet ran at Mon Oct 21 01:00:55 UTC 2013 [01:00:59] RECOVERY - Puppet freshness on cp1031 is OK: puppet ran at Mon Oct 21 01:00:55 UTC 2013 [01:01:19] PROBLEM - Puppet freshness on cp1031 is CRITICAL: No successful Puppet run in the last 10 hours [01:01:49] PROBLEM - Puppet freshness on cp1022 is CRITICAL: No successful Puppet run in the last 10 hours [01:05:50] RECOVERY - Puppet freshness on cp1028 is OK: puppet ran at Mon Oct 21 01:05:47 UTC 2013 [01:06:29] PROBLEM - Puppet freshness on cp1028 is CRITICAL: No successful Puppet run in the last 10 hours [01:18:49] RECOVERY - Puppet freshness on cp1025 is OK: puppet ran at Mon Oct 21 01:18:41 UTC 2013 [01:19:29] PROBLEM - Puppet freshness on cp1025 is CRITICAL: No successful Puppet run in the last 10 hours [01:19:49] RECOVERY - Puppet freshness on cp1023 is OK: puppet ran at Mon Oct 21 01:19:47 UTC 2013 [01:20:39] PROBLEM - Puppet freshness on cp1023 is CRITICAL: No successful Puppet run in the last 10 hours [01:21:40] <^d> *sigh* This is not how I planned to spend my sunday evening. [01:21:49] RECOVERY - Puppet freshness on cp1035 is OK: puppet ran at Mon Oct 21 01:21:47 UTC 2013 [01:21:49] RECOVERY - Puppet freshness on cp1021 is OK: puppet ran at Mon Oct 21 01:21:47 UTC 2013 [01:21:59] PROBLEM - Puppet freshness on cp1021 is CRITICAL: No successful Puppet run in the last 10 hours [01:22:09] PROBLEM - Puppet freshness on cp1035 is CRITICAL: No successful Puppet run in the last 10 hours [01:28:49] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Mon Oct 21 01:28:44 UTC 2013 [01:28:59] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [01:32:49] RECOVERY - Puppet freshness on cp1029 is OK: puppet ran at Mon Oct 21 01:32:46 UTC 2013 [01:33:09] PROBLEM - Puppet freshness on cp1029 is CRITICAL: No successful Puppet run in the last 10 hours [01:33:49] RECOVERY - Puppet freshness on cp1032 is OK: puppet ran at Mon Oct 21 01:33:46 UTC 2013 [01:34:09] RECOVERY - MySQL Slave Delay on db53 is OK: OK replication delay 0 seconds [01:34:49] PROBLEM - Puppet freshness on cp1032 is CRITICAL: No successful Puppet run in the last 10 hours [01:35:49] RECOVERY - Puppet freshness on cp1027 is OK: puppet ran at Mon Oct 21 01:35:41 UTC 2013 [01:36:29] PROBLEM - Puppet freshness on cp1027 is CRITICAL: No successful Puppet run in the last 10 hours [01:41:49] RECOVERY - Puppet freshness on cp1036 is OK: puppet ran at Mon Oct 21 01:41:42 UTC 2013 [01:41:59] PROBLEM - Puppet freshness on cp1036 is CRITICAL: No successful Puppet run in the last 10 hours [01:42:39] RECOVERY - Puppet freshness on cp1024 is OK: puppet ran at Mon Oct 21 01:42:38 UTC 2013 [01:42:49] PROBLEM - Puppet freshness on cp1024 is CRITICAL: No successful Puppet run in the last 10 hours [01:43:49] RECOVERY - Puppet freshness on cp1030 is OK: puppet ran at Mon Oct 21 01:43:43 UTC 2013 [01:43:50] PROBLEM - Puppet freshness on cp1030 is CRITICAL: No successful Puppet run in the last 10 hours [01:44:49] RECOVERY - Puppet freshness on cp1026 is OK: puppet ran at Mon Oct 21 01:44:43 UTC 2013 [01:44:59] PROBLEM - Puppet freshness on cp1026 is CRITICAL: No successful Puppet run in the last 10 hours [01:55:59] RECOVERY - Puppet freshness on cp1033 is OK: puppet ran at Mon Oct 21 01:55:58 UTC 2013 [01:56:19] PROBLEM - Puppet freshness on cp1033 is CRITICAL: No successful Puppet run in the last 10 hours [01:56:49] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Mon Oct 21 01:56:44 UTC 2013 [01:57:49] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [02:00:49] RECOVERY - Puppet freshness on cp1031 is OK: puppet ran at Mon Oct 21 02:00:40 UTC 2013 [02:00:49] RECOVERY - Puppet freshness on cp1022 is OK: puppet ran at Mon Oct 21 02:00:45 UTC 2013 [02:01:19] PROBLEM - Puppet freshness on cp1031 is CRITICAL: No successful Puppet run in the last 10 hours [02:01:49] PROBLEM - Puppet freshness on cp1022 is CRITICAL: No successful Puppet run in the last 10 hours [02:05:49] RECOVERY - Puppet freshness on cp1028 is OK: puppet ran at Mon Oct 21 02:05:44 UTC 2013 [02:06:29] PROBLEM - Puppet freshness on cp1028 is CRITICAL: No successful Puppet run in the last 10 hours [02:10:47] !log LocalisationUpdate completed (1.22wmf21) at Mon Oct 21 02:10:47 UTC 2013 [02:11:10] Logged the message, Master [02:15:19] PROBLEM - Puppet freshness on copper is CRITICAL: No successful Puppet run in the last 10 hours [02:18:49] RECOVERY - Puppet freshness on cp1025 is OK: puppet ran at Mon Oct 21 02:18:44 UTC 2013 [02:19:25] !log LocalisationUpdate completed (1.22wmf22) at Mon Oct 21 02:19:25 UTC 2013 [02:19:29] PROBLEM - Puppet freshness on cp1025 is CRITICAL: No successful Puppet run in the last 10 hours [02:19:38] Logged the message, Master [02:19:59] RECOVERY - Puppet freshness on cp1023 is OK: puppet ran at Mon Oct 21 02:19:49 UTC 2013 [02:20:39] PROBLEM - Puppet freshness on cp1023 is CRITICAL: No successful Puppet run in the last 10 hours [02:22:09] RECOVERY - Puppet freshness on cp1035 is OK: puppet ran at Mon Oct 21 02:22:04 UTC 2013 [02:22:19] RECOVERY - Puppet freshness on cp1021 is OK: puppet ran at Mon Oct 21 02:22:10 UTC 2013 [02:22:59] PROBLEM - Puppet freshness on cp1021 is CRITICAL: No successful Puppet run in the last 10 hours [02:23:09] PROBLEM - Puppet freshness on cp1035 is CRITICAL: No successful Puppet run in the last 10 hours [02:28:49] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Mon Oct 21 02:28:46 UTC 2013 [02:28:59] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [02:32:49] RECOVERY - Puppet freshness on cp1029 is OK: puppet ran at Mon Oct 21 02:32:44 UTC 2013 [02:33:09] PROBLEM - Puppet freshness on cp1029 is CRITICAL: No successful Puppet run in the last 10 hours [02:33:59] RECOVERY - Puppet freshness on cp1032 is OK: puppet ran at Mon Oct 21 02:33:49 UTC 2013 [02:34:49] PROBLEM - Puppet freshness on cp1032 is CRITICAL: No successful Puppet run in the last 10 hours [02:35:49] RECOVERY - Puppet freshness on cp1027 is OK: puppet ran at Mon Oct 21 02:35:40 UTC 2013 [02:36:29] PROBLEM - Puppet freshness on cp1027 is CRITICAL: No successful Puppet run in the last 10 hours [02:39:00] !log LocalisationUpdate ResourceLoader cache refresh completed at Mon Oct 21 02:39:00 UTC 2013 [02:39:13] Logged the message, Master [02:41:49] RECOVERY - Puppet freshness on cp1036 is OK: puppet ran at Mon Oct 21 02:41:46 UTC 2013 [02:41:59] PROBLEM - Puppet freshness on cp1036 is CRITICAL: No successful Puppet run in the last 10 hours [02:42:50] RECOVERY - Puppet freshness on cp1024 is OK: puppet ran at Mon Oct 21 02:42:47 UTC 2013 [02:43:49] PROBLEM - Puppet freshness on cp1024 is CRITICAL: No successful Puppet run in the last 10 hours [02:43:59] RECOVERY - Puppet freshness on cp1030 is OK: puppet ran at Mon Oct 21 02:43:57 UTC 2013 [02:44:49] RECOVERY - Puppet freshness on cp1026 is OK: puppet ran at Mon Oct 21 02:44:42 UTC 2013 [02:44:49] PROBLEM - Puppet freshness on cp1030 is CRITICAL: No successful Puppet run in the last 10 hours [02:44:59] PROBLEM - Puppet freshness on cp1026 is CRITICAL: No successful Puppet run in the last 10 hours [02:55:49] RECOVERY - Puppet freshness on cp1033 is OK: puppet ran at Mon Oct 21 02:55:45 UTC 2013 [02:56:19] PROBLEM - Puppet freshness on cp1033 is CRITICAL: No successful Puppet run in the last 10 hours [02:56:49] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Mon Oct 21 02:56:45 UTC 2013 [02:57:49] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [03:00:49] RECOVERY - Puppet freshness on cp1031 is OK: puppet ran at Mon Oct 21 03:00:46 UTC 2013 [03:00:59] RECOVERY - Puppet freshness on cp1022 is OK: puppet ran at Mon Oct 21 03:00:56 UTC 2013 [03:01:19] PROBLEM - Puppet freshness on cp1031 is CRITICAL: No successful Puppet run in the last 10 hours [03:01:49] PROBLEM - Puppet freshness on cp1022 is CRITICAL: No successful Puppet run in the last 10 hours [03:05:49] RECOVERY - Puppet freshness on cp1028 is OK: puppet ran at Mon Oct 21 03:05:47 UTC 2013 [03:06:29] PROBLEM - Puppet freshness on cp1028 is CRITICAL: No successful Puppet run in the last 10 hours [03:14:59] PROBLEM - Apache HTTP on mw1109 is CRITICAL: Connection refused [03:18:49] RECOVERY - Puppet freshness on cp1025 is OK: puppet ran at Mon Oct 21 03:18:42 UTC 2013 [03:19:29] PROBLEM - Puppet freshness on cp1025 is CRITICAL: No successful Puppet run in the last 10 hours [03:19:59] RECOVERY - Puppet freshness on cp1023 is OK: puppet ran at Mon Oct 21 03:19:58 UTC 2013 [03:20:39] PROBLEM - Puppet freshness on cp1023 is CRITICAL: No successful Puppet run in the last 10 hours [03:21:39] !log on mw1109: stopped apache to test cgconfig [03:21:49] RECOVERY - Puppet freshness on cp1021 is OK: puppet ran at Mon Oct 21 03:21:43 UTC 2013 [03:21:55] Logged the message, Master [03:21:59] PROBLEM - Puppet freshness on cp1021 is CRITICAL: No successful Puppet run in the last 10 hours [03:21:59] RECOVERY - Puppet freshness on cp1035 is OK: puppet ran at Mon Oct 21 03:21:58 UTC 2013 [03:22:09] PROBLEM - Puppet freshness on cp1035 is CRITICAL: No successful Puppet run in the last 10 hours [03:28:49] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Mon Oct 21 03:28:41 UTC 2013 [03:28:59] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [03:32:50] RECOVERY - Puppet freshness on cp1029 is OK: puppet ran at Mon Oct 21 03:32:44 UTC 2013 [03:33:09] PROBLEM - Puppet freshness on cp1029 is CRITICAL: No successful Puppet run in the last 10 hours [03:34:09] RECOVERY - Puppet freshness on cp1032 is OK: puppet ran at Mon Oct 21 03:34:00 UTC 2013 [03:34:49] PROBLEM - Puppet freshness on cp1032 is CRITICAL: No successful Puppet run in the last 10 hours [03:35:49] RECOVERY - Puppet freshness on cp1027 is OK: puppet ran at Mon Oct 21 03:35:40 UTC 2013 [03:36:29] PROBLEM - Puppet freshness on cp1027 is CRITICAL: No successful Puppet run in the last 10 hours [03:41:49] RECOVERY - Puppet freshness on cp1036 is OK: puppet ran at Mon Oct 21 03:41:43 UTC 2013 [03:41:59] PROBLEM - Puppet freshness on cp1036 is CRITICAL: No successful Puppet run in the last 10 hours [03:42:59] RECOVERY - Puppet freshness on cp1024 is OK: puppet ran at Mon Oct 21 03:42:48 UTC 2013 [03:43:49] PROBLEM - Puppet freshness on cp1024 is CRITICAL: No successful Puppet run in the last 10 hours [03:43:49] RECOVERY - Puppet freshness on cp1030 is OK: puppet ran at Mon Oct 21 03:43:44 UTC 2013 [03:44:49] PROBLEM - Puppet freshness on cp1030 is CRITICAL: No successful Puppet run in the last 10 hours [03:44:49] RECOVERY - Puppet freshness on cp1026 is OK: puppet ran at Mon Oct 21 03:44:44 UTC 2013 [03:44:59] PROBLEM - Puppet freshness on cp1026 is CRITICAL: No successful Puppet run in the last 10 hours [03:55:49] RECOVERY - Puppet freshness on cp1033 is OK: puppet ran at Mon Oct 21 03:55:47 UTC 2013 [03:56:19] PROBLEM - Puppet freshness on cp1033 is CRITICAL: No successful Puppet run in the last 10 hours [03:56:50] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Mon Oct 21 03:56:48 UTC 2013 [03:57:49] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [04:00:49] RECOVERY - Puppet freshness on cp1031 is OK: puppet ran at Mon Oct 21 04:00:42 UTC 2013 [04:00:59] RECOVERY - Apache HTTP on mw1109 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.083 second response time [04:00:59] RECOVERY - Puppet freshness on cp1022 is OK: puppet ran at Mon Oct 21 04:00:57 UTC 2013 [04:01:19] PROBLEM - Puppet freshness on cp1031 is CRITICAL: No successful Puppet run in the last 10 hours [04:01:49] PROBLEM - Puppet freshness on cp1022 is CRITICAL: No successful Puppet run in the last 10 hours [04:05:59] RECOVERY - Puppet freshness on cp1028 is OK: puppet ran at Mon Oct 21 04:05:49 UTC 2013 [04:06:29] PROBLEM - Puppet freshness on cp1028 is CRITICAL: No successful Puppet run in the last 10 hours [04:18:49] RECOVERY - Puppet freshness on cp1025 is OK: puppet ran at Mon Oct 21 04:18:40 UTC 2013 [04:19:29] PROBLEM - Puppet freshness on cp1025 is CRITICAL: No successful Puppet run in the last 10 hours [04:19:49] RECOVERY - Puppet freshness on cp1023 is OK: puppet ran at Mon Oct 21 04:19:46 UTC 2013 [04:20:39] PROBLEM - Puppet freshness on cp1023 is CRITICAL: No successful Puppet run in the last 10 hours [04:21:59] RECOVERY - Puppet freshness on cp1035 is OK: puppet ran at Mon Oct 21 04:21:57 UTC 2013 [04:21:59] RECOVERY - Puppet freshness on cp1021 is OK: puppet ran at Mon Oct 21 04:21:57 UTC 2013 [04:22:09] PROBLEM - Puppet freshness on cp1035 is CRITICAL: No successful Puppet run in the last 10 hours [04:22:59] PROBLEM - Puppet freshness on cp1021 is CRITICAL: No successful Puppet run in the last 10 hours [04:28:49] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Mon Oct 21 04:28:46 UTC 2013 [04:28:59] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [04:32:49] RECOVERY - Puppet freshness on cp1029 is OK: puppet ran at Mon Oct 21 04:32:43 UTC 2013 [04:33:09] PROBLEM - Puppet freshness on cp1029 is CRITICAL: No successful Puppet run in the last 10 hours [04:33:49] RECOVERY - Puppet freshness on cp1032 is OK: puppet ran at Mon Oct 21 04:33:44 UTC 2013 [04:34:49] PROBLEM - Puppet freshness on cp1032 is CRITICAL: No successful Puppet run in the last 10 hours [04:35:49] RECOVERY - Puppet freshness on cp1027 is OK: puppet ran at Mon Oct 21 04:35:39 UTC 2013 [04:36:29] PROBLEM - Puppet freshness on cp1027 is CRITICAL: No successful Puppet run in the last 10 hours [04:39:09] RECOVERY - check_job_queue on fenari is OK: JOBQUEUE OK - all job queues below 10,000 [04:41:49] RECOVERY - Puppet freshness on cp1036 is OK: puppet ran at Mon Oct 21 04:41:45 UTC 2013 [04:41:59] PROBLEM - Puppet freshness on cp1036 is CRITICAL: No successful Puppet run in the last 10 hours [04:42:19] PROBLEM - check_job_queue on fenari is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [04:42:49] RECOVERY - Puppet freshness on cp1024 is OK: puppet ran at Mon Oct 21 04:42:41 UTC 2013 [04:43:49] PROBLEM - Puppet freshness on cp1024 is CRITICAL: No successful Puppet run in the last 10 hours [04:43:49] RECOVERY - Puppet freshness on cp1030 is OK: puppet ran at Mon Oct 21 04:43:47 UTC 2013 [04:44:49] RECOVERY - Puppet freshness on cp1026 is OK: puppet ran at Mon Oct 21 04:44:42 UTC 2013 [04:44:49] PROBLEM - Puppet freshness on cp1030 is CRITICAL: No successful Puppet run in the last 10 hours [04:44:59] PROBLEM - Puppet freshness on cp1026 is CRITICAL: No successful Puppet run in the last 10 hours [04:55:49] RECOVERY - Puppet freshness on cp1033 is OK: puppet ran at Mon Oct 21 04:55:45 UTC 2013 [04:56:19] PROBLEM - Puppet freshness on cp1033 is CRITICAL: No successful Puppet run in the last 10 hours [04:56:49] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Mon Oct 21 04:56:45 UTC 2013 [04:57:49] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [05:00:49] RECOVERY - Puppet freshness on cp1022 is OK: puppet ran at Mon Oct 21 05:00:41 UTC 2013 [05:00:49] RECOVERY - Puppet freshness on cp1031 is OK: puppet ran at Mon Oct 21 05:00:41 UTC 2013 [05:00:49] PROBLEM - Puppet freshness on cp1022 is CRITICAL: No successful Puppet run in the last 10 hours [05:01:19] PROBLEM - Puppet freshness on cp1031 is CRITICAL: No successful Puppet run in the last 10 hours [05:06:09] RECOVERY - Puppet freshness on cp1028 is OK: puppet ran at Mon Oct 21 05:05:59 UTC 2013 [05:06:29] PROBLEM - Puppet freshness on cp1028 is CRITICAL: No successful Puppet run in the last 10 hours [05:10:09] PROBLEM - search indices - check lucene status page on search18 is CRITICAL: HTTP CRITICAL: HTTP/1.1 200 OK - pattern found - 55856 bytes in 0.110 second response time [05:14:43] (03PS2) 10Legoktm: Add MassMessage jobs to the high priority queue [operations/puppet] - 10https://gerrit.wikimedia.org/r/90280 [05:15:09] (03PS3) 10Legoktm: Add MassMessage jobs to the high priority queue [operations/puppet] - 10https://gerrit.wikimedia.org/r/90280 [05:18:49] RECOVERY - Puppet freshness on cp1025 is OK: puppet ran at Mon Oct 21 05:18:43 UTC 2013 [05:19:29] PROBLEM - Puppet freshness on cp1025 is CRITICAL: No successful Puppet run in the last 10 hours [05:20:09] RECOVERY - Puppet freshness on cp1023 is OK: puppet ran at Mon Oct 21 05:19:59 UTC 2013 [05:20:39] PROBLEM - Puppet freshness on cp1023 is CRITICAL: No successful Puppet run in the last 10 hours [05:21:49] RECOVERY - Puppet freshness on cp1021 is OK: puppet ran at Mon Oct 21 05:21:45 UTC 2013 [05:21:59] PROBLEM - Puppet freshness on cp1021 is CRITICAL: No successful Puppet run in the last 10 hours [05:22:09] RECOVERY - Puppet freshness on cp1035 is OK: puppet ran at Mon Oct 21 05:22:00 UTC 2013 [05:22:09] PROBLEM - Puppet freshness on cp1035 is CRITICAL: No successful Puppet run in the last 10 hours [05:28:49] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Mon Oct 21 05:28:44 UTC 2013 [05:28:59] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [05:32:59] RECOVERY - Puppet freshness on cp1029 is OK: puppet ran at Mon Oct 21 05:32:53 UTC 2013 [05:33:09] PROBLEM - Puppet freshness on cp1029 is CRITICAL: No successful Puppet run in the last 10 hours [05:33:26] (03PS1) 10Springle: icinga pmp-check-mysql-innodb idle_blocker_duration [operations/puppet] - 10https://gerrit.wikimedia.org/r/90867 [05:33:59] RECOVERY - Puppet freshness on cp1032 is OK: puppet ran at Mon Oct 21 05:33:49 UTC 2013 [05:34:49] PROBLEM - Puppet freshness on cp1032 is CRITICAL: No successful Puppet run in the last 10 hours [05:35:34] (03CR) 10Springle: [C: 032] icinga pmp-check-mysql-innodb idle_blocker_duration [operations/puppet] - 10https://gerrit.wikimedia.org/r/90867 (owner: 10Springle) [05:35:49] RECOVERY - Puppet freshness on cp1027 is OK: puppet ran at Mon Oct 21 05:35:40 UTC 2013 [05:36:29] PROBLEM - Puppet freshness on cp1027 is CRITICAL: No successful Puppet run in the last 10 hours [05:41:49] RECOVERY - Puppet freshness on cp1036 is OK: puppet ran at Mon Oct 21 05:41:44 UTC 2013 [05:42:00] PROBLEM - Puppet freshness on cp1036 is CRITICAL: No successful Puppet run in the last 10 hours [05:42:49] RECOVERY - Puppet freshness on cp1024 is OK: puppet ran at Mon Oct 21 05:42:40 UTC 2013 [05:43:49] PROBLEM - Puppet freshness on cp1024 is CRITICAL: No successful Puppet run in the last 10 hours [05:43:49] RECOVERY - Puppet freshness on cp1030 is OK: puppet ran at Mon Oct 21 05:43:46 UTC 2013 [05:44:49] PROBLEM - Puppet freshness on cp1030 is CRITICAL: No successful Puppet run in the last 10 hours [05:45:09] RECOVERY - Puppet freshness on cp1026 is OK: puppet ran at Mon Oct 21 05:45:01 UTC 2013 [05:45:59] PROBLEM - Puppet freshness on cp1026 is CRITICAL: No successful Puppet run in the last 10 hours [05:55:59] RECOVERY - Puppet freshness on cp1033 is OK: puppet ran at Mon Oct 21 05:55:49 UTC 2013 [05:56:19] PROBLEM - Puppet freshness on cp1033 is CRITICAL: No successful Puppet run in the last 10 hours [05:56:59] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Mon Oct 21 05:56:54 UTC 2013 [05:57:49] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [06:00:49] RECOVERY - Puppet freshness on cp1022 is OK: puppet ran at Mon Oct 21 06:00:46 UTC 2013 [06:00:49] RECOVERY - Puppet freshness on cp1031 is OK: puppet ran at Mon Oct 21 06:00:46 UTC 2013 [06:01:19] PROBLEM - Puppet freshness on cp1031 is CRITICAL: No successful Puppet run in the last 10 hours [06:01:49] PROBLEM - Puppet freshness on cp1022 is CRITICAL: No successful Puppet run in the last 10 hours [06:02:58] (03CR) 10Ori.livneh: [C: 032] Add MassMessage jobs to the high priority queue [operations/puppet] - 10https://gerrit.wikimedia.org/r/90280 (owner: 10Legoktm) [06:05:59] RECOVERY - Puppet freshness on cp1028 is OK: puppet ran at Mon Oct 21 06:05:58 UTC 2013 [06:06:29] PROBLEM - Puppet freshness on cp1028 is CRITICAL: No successful Puppet run in the last 10 hours [06:07:00] RECOVERY - Disk space on copper is OK: DISK OK [06:14:09] RECOVERY - Puppet freshness on copper is OK: puppet ran at Mon Oct 21 06:14:06 UTC 2013 [06:15:12] !log moved older swift replication logs from copper:/root to iron:/root/swift-repl/ (now gzipped), copper was full [06:15:26] Logged the message, Master [06:32:27] RECOVERY - Puppet freshness on cp1035 is OK: puppet ran at Mon Oct 21 06:32:20 UTC 2013 [06:32:27] RECOVERY - Puppet freshness on cp1021 is OK: puppet ran at Mon Oct 21 06:32:20 UTC 2013 [06:32:27] RECOVERY - Puppet freshness on cp1025 is OK: puppet ran at Mon Oct 21 06:32:20 UTC 2013 [06:32:27] RECOVERY - Puppet freshness on cp1023 is OK: puppet ran at Mon Oct 21 06:32:20 UTC 2013 [06:32:27] RECOVERY - Puppet freshness on cp4001 is OK: puppet ran at Mon Oct 21 06:32:21 UTC 2013 [06:32:27] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Mon Oct 21 06:32:21 UTC 2013 [06:32:27] PROBLEM - Puppet freshness on cp1025 is CRITICAL: No successful Puppet run in the last 10 hours [06:32:37] PROBLEM - Puppet freshness on cp1023 is CRITICAL: No successful Puppet run in the last 10 hours [06:32:47] PROBLEM - Puppet freshness on cp1021 is CRITICAL: No successful Puppet run in the last 10 hours [06:32:57] RECOVERY - Puppet freshness on cp1029 is OK: puppet ran at Mon Oct 21 06:32:47 UTC 2013 [06:33:07] PROBLEM - Puppet freshness on cp1035 is CRITICAL: No successful Puppet run in the last 10 hours [06:33:07] PROBLEM - Puppet freshness on cp1029 is CRITICAL: No successful Puppet run in the last 10 hours [06:33:17] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [06:33:57] RECOVERY - Puppet freshness on cp1032 is OK: puppet ran at Mon Oct 21 06:33:47 UTC 2013 [06:34:17] PROBLEM - Puppet freshness on cp1032 is CRITICAL: No successful Puppet run in the last 10 hours [06:35:47] RECOVERY - Puppet freshness on cp1027 is OK: puppet ran at Mon Oct 21 06:35:42 UTC 2013 [06:36:27] PROBLEM - Puppet freshness on cp1027 is CRITICAL: No successful Puppet run in the last 10 hours [06:41:47] RECOVERY - Puppet freshness on cp1036 is OK: puppet ran at Mon Oct 21 06:41:45 UTC 2013 [06:42:47] PROBLEM - Puppet freshness on cp1036 is CRITICAL: No successful Puppet run in the last 10 hours [06:42:47] RECOVERY - Puppet freshness on cp1024 is OK: puppet ran at Mon Oct 21 06:42:45 UTC 2013 [06:43:17] PROBLEM - Puppet freshness on cp1024 is CRITICAL: No successful Puppet run in the last 10 hours [06:43:57] RECOVERY - Puppet freshness on cp1030 is OK: puppet ran at Mon Oct 21 06:43:51 UTC 2013 [06:44:07] PROBLEM - Puppet freshness on cp1030 is CRITICAL: No successful Puppet run in the last 10 hours [06:45:07] RECOVERY - Puppet freshness on cp1026 is OK: puppet ran at Mon Oct 21 06:45:06 UTC 2013 [06:45:47] PROBLEM - Puppet freshness on cp1026 is CRITICAL: No successful Puppet run in the last 10 hours [06:55:47] RECOVERY - Puppet freshness on cp1033 is OK: puppet ran at Mon Oct 21 06:55:43 UTC 2013 [06:56:17] PROBLEM - Puppet freshness on cp1033 is CRITICAL: No successful Puppet run in the last 10 hours [06:57:07] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Mon Oct 21 06:56:58 UTC 2013 [06:57:27] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [07:04:03] (03PS1) 10Ori.livneh: Correct path reference to bits path hit by ULS [operations/apache-config] - 10https://gerrit.wikimedia.org/r/90869 [07:18:47] RECOVERY - Puppet freshness on cp1025 is OK: puppet ran at Mon Oct 21 07:18:43 UTC 2013 [07:19:27] PROBLEM - Puppet freshness on cp1025 is CRITICAL: No successful Puppet run in the last 10 hours [07:20:07] RECOVERY - Puppet freshness on cp1023 is OK: puppet ran at Mon Oct 21 07:19:59 UTC 2013 [07:20:37] PROBLEM - Puppet freshness on cp1023 is CRITICAL: No successful Puppet run in the last 10 hours [07:21:47] RECOVERY - Puppet freshness on cp1035 is OK: puppet ran at Mon Oct 21 07:21:46 UTC 2013 [07:21:47] RECOVERY - Puppet freshness on cp1021 is OK: puppet ran at Mon Oct 21 07:21:46 UTC 2013 [07:22:07] PROBLEM - Puppet freshness on cp1035 is CRITICAL: No successful Puppet run in the last 10 hours [07:22:47] PROBLEM - Puppet freshness on cp1021 is CRITICAL: No successful Puppet run in the last 10 hours [07:28:57] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Mon Oct 21 07:28:47 UTC 2013 [07:29:17] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [07:32:57] RECOVERY - Puppet freshness on cp1029 is OK: puppet ran at Mon Oct 21 07:32:50 UTC 2013 [07:33:07] PROBLEM - Puppet freshness on cp1029 is CRITICAL: No successful Puppet run in the last 10 hours [07:33:47] RECOVERY - Puppet freshness on cp1032 is OK: puppet ran at Mon Oct 21 07:33:45 UTC 2013 [07:34:17] PROBLEM - Puppet freshness on cp1032 is CRITICAL: No successful Puppet run in the last 10 hours [07:35:47] RECOVERY - Puppet freshness on cp1027 is OK: puppet ran at Mon Oct 21 07:35:40 UTC 2013 [07:36:27] PROBLEM - Puppet freshness on cp1027 is CRITICAL: No successful Puppet run in the last 10 hours [07:41:57] RECOVERY - Puppet freshness on cp1036 is OK: puppet ran at Mon Oct 21 07:41:47 UTC 2013 [07:42:47] RECOVERY - Puppet freshness on cp1024 is OK: puppet ran at Mon Oct 21 07:42:37 UTC 2013 [07:42:47] PROBLEM - Puppet freshness on cp1036 is CRITICAL: No successful Puppet run in the last 10 hours [07:43:17] PROBLEM - Puppet freshness on cp1024 is CRITICAL: No successful Puppet run in the last 10 hours [07:43:57] RECOVERY - Puppet freshness on cp1030 is OK: puppet ran at Mon Oct 21 07:43:47 UTC 2013 [07:44:07] PROBLEM - Puppet freshness on cp1030 is CRITICAL: No successful Puppet run in the last 10 hours [07:44:47] RECOVERY - Puppet freshness on cp1026 is OK: puppet ran at Mon Oct 21 07:44:43 UTC 2013 [07:45:47] PROBLEM - Puppet freshness on cp1026 is CRITICAL: No successful Puppet run in the last 10 hours [07:54:35] (03PS2) 10ArielGlenn: remove srv1-234 main and mgmt entries, except for srv193 [operations/dns] - 10https://gerrit.wikimedia.org/r/90516 [07:55:57] RECOVERY - Puppet freshness on cp1033 is OK: puppet ran at Mon Oct 21 07:55:55 UTC 2013 [07:56:17] PROBLEM - Puppet freshness on cp1033 is CRITICAL: No successful Puppet run in the last 10 hours [07:56:57] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Mon Oct 21 07:56:56 UTC 2013 [07:57:27] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [07:57:27] PROBLEM - MySQL Slave Running on db1026 is CRITICAL: CRIT replication Slave_IO_Running: Yes Slave_SQL_Running: No Last_Error: Error Table wikidatawiki._wb_terms_new doesnt exist on query. De [07:59:17] PROBLEM - MySQL Slave Running on db45 is CRITICAL: CRIT replication Slave_IO_Running: Yes Slave_SQL_Running: No Last_Error: Error Deadlock found when trying to get lock: try restarting transac [07:59:27] RECOVERY - MySQL Slave Running on db1026 is OK: OK replication Slave_IO_Running: Yes Slave_SQL_Running: Yes Last_Error: [07:59:33] I am going to upgrade Jenkins / restart it for a scheduled maintenance. Expected downtime: 1 hour starting now. [07:59:40] !log upgrading Jenkins for scheduled maintenance [07:59:52] Logged the message, Master [08:00:36] !log stopping Zuul / Jenkins [08:00:48] Logged the message, Master [08:01:23] (03CR) 10ArielGlenn: [C: 032] remove srv1-234 main and mgmt entries, except for srv193 [operations/dns] - 10https://gerrit.wikimedia.org/r/90516 (owner: 10ArielGlenn) [08:02:07] PROBLEM - MySQL Replication Heartbeat on db45 is CRITICAL: CRIT replication delay 305 seconds [08:02:57] PROBLEM - zuul_service_running on gallium is CRITICAL: PROCS CRITICAL: 0 processes with regex args ^/usr/bin/python /usr/local/bin/zuul-server [08:13:04] that Zuul issue is me [08:13:15] I don't think I can flag a service has been under maintenance [08:13:20] and IIRC it does not send page [08:18:47] RECOVERY - Puppet freshness on cp1025 is OK: puppet ran at Mon Oct 21 08:18:43 UTC 2013 [08:19:27] PROBLEM - Puppet freshness on cp1025 is CRITICAL: No successful Puppet run in the last 10 hours [08:20:07] RECOVERY - Puppet freshness on cp1023 is OK: puppet ran at Mon Oct 21 08:19:59 UTC 2013 [08:20:37] PROBLEM - Puppet freshness on cp1023 is CRITICAL: No successful Puppet run in the last 10 hours [08:21:47] RECOVERY - Puppet freshness on cp1035 is OK: puppet ran at Mon Oct 21 08:21:39 UTC 2013 [08:22:07] RECOVERY - Puppet freshness on cp1021 is OK: puppet ran at Mon Oct 21 08:21:59 UTC 2013 [08:22:07] PROBLEM - Puppet freshness on cp1035 is CRITICAL: No successful Puppet run in the last 10 hours [08:22:47] PROBLEM - Puppet freshness on cp1021 is CRITICAL: No successful Puppet run in the last 10 hours [08:28:57] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Mon Oct 21 08:28:48 UTC 2013 [08:29:17] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [08:32:57] RECOVERY - Puppet freshness on cp1029 is OK: puppet ran at Mon Oct 21 08:32:49 UTC 2013 [08:33:07] PROBLEM - Puppet freshness on cp1029 is CRITICAL: No successful Puppet run in the last 10 hours [08:34:07] RECOVERY - Puppet freshness on cp1032 is OK: puppet ran at Mon Oct 21 08:33:59 UTC 2013 [08:34:17] PROBLEM - Puppet freshness on cp1032 is CRITICAL: No successful Puppet run in the last 10 hours [08:34:55] (03PS1) 10ArielGlenn: get rid of temp-es* hosts, entries from 2009 long since unused [operations/dns] - 10https://gerrit.wikimedia.org/r/90871 [08:35:47] RECOVERY - Puppet freshness on cp1027 is OK: puppet ran at Mon Oct 21 08:35:40 UTC 2013 [08:36:27] PROBLEM - Puppet freshness on cp1027 is CRITICAL: No successful Puppet run in the last 10 hours [08:41:05] (03CR) 10Akosiaris: "From what I see it should be hitting all hosts(when of course they include that class)" [operations/puppet] - 10https://gerrit.wikimedia.org/r/87332 (owner: 10Matanya) [08:41:57] RECOVERY - Puppet freshness on cp1036 is OK: puppet ran at Mon Oct 21 08:41:47 UTC 2013 [08:41:57] RECOVERY - zuul_service_running on gallium is OK: PROCS OK: 1 process with regex args ^/usr/bin/python /usr/local/bin/zuul-server [08:41:58] !log restarted Zuul [08:42:03] damn icinga is fast [08:42:10] Logged the message, Master [08:42:47] PROBLEM - Puppet freshness on cp1036 is CRITICAL: No successful Puppet run in the last 10 hours [08:42:47] RECOVERY - Puppet freshness on cp1024 is OK: puppet ran at Mon Oct 21 08:42:43 UTC 2013 [08:43:17] PROBLEM - Puppet freshness on cp1024 is CRITICAL: No successful Puppet run in the last 10 hours [08:43:18] !log stopping Zuul again. Need to upgrade Jenkins plugins [08:43:32] Logged the message, Master [08:43:57] RECOVERY - Puppet freshness on cp1030 is OK: puppet ran at Mon Oct 21 08:43:48 UTC 2013 [08:44:07] PROBLEM - Puppet freshness on cp1030 is CRITICAL: No successful Puppet run in the last 10 hours [08:44:47] RECOVERY - Puppet freshness on cp1026 is OK: puppet ran at Mon Oct 21 08:44:43 UTC 2013 [08:45:47] PROBLEM - Puppet freshness on cp1026 is CRITICAL: No successful Puppet run in the last 10 hours [08:45:57] PROBLEM - zuul_service_running on gallium is CRITICAL: PROCS CRITICAL: 0 processes with regex args ^/usr/bin/python /usr/local/bin/zuul-server [08:46:59] !log jenkins: upgrading plugins [08:47:10] Logged the message, Master [08:49:03] (03PS1) 10Akosiaris: Fix drac module broken in 22d7837 [operations/puppet] - 10https://gerrit.wikimedia.org/r/90874 [08:49:20] (03CR) 10Akosiaris: [C: 032] Fix drac module broken in 22d7837 [operations/puppet] - 10https://gerrit.wikimedia.org/r/90874 (owner: 10Akosiaris) [08:49:59] (03CR) 10Akosiaris: [V: 032] Fix drac module broken in 22d7837 [operations/puppet] - 10https://gerrit.wikimedia.org/r/90874 (owner: 10Akosiaris) [08:50:29] akosiaris: hi, jenkins is being upgraded so no linting for you :-] [08:51:10] !log forced verified +2 on gerrit 90874 since jenkins is being upgraded [08:51:22] hashar: yeah i remembered. Thanks :-) [08:51:23] Logged the message, Master [08:53:49] akosiaris: can you please explain your fix? I don't fully understand it [08:54:38] aaaaaaa before that. I just noticed this https://gerrit.wikimedia.org/r/#/c/90098/8/modules/ssh/templates/sshd_config.erb [08:54:59] yes, my bad [08:55:10] fixed by andrewbogott [08:55:21] yes... what i am trying to understand is why [08:55:42] I cherry picked paravoid's patch, and merged it into mine by accident [08:55:57] RECOVERY - Puppet freshness on cp1033 is OK: puppet ran at Mon Oct 21 08:55:47 UTC 2013 [08:56:17] PROBLEM - Puppet freshness on cp1033 is CRITICAL: No successful Puppet run in the last 10 hours [08:56:47] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Mon Oct 21 08:56:42 UTC 2013 [08:57:27] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [08:58:23] akosiaris: i'm referring to this patch: https://gerrit.wikimedia.org/r/#/c/15874/1/modules/ssh/templates/sshd_config.erb [08:59:48] !log rerestarting Jenkins. [09:00:02] Logged the message, Master [09:02:00] hmmm... ok. What I mostly disliked was that it got merged... [09:02:09] anyway [09:02:33] As far as the other fix goes, puppet needs to be able to reference files [09:02:46] !log Jenkins restarted / upgraded [09:02:47] and modules need to have their files in a files directory [09:03:00] Logged the message, Master [09:03:10] !log restarting Zuul [09:03:22] but confusingly enough the sources need to be of the form "puppet:///modules// [09:03:23] Logged the message, Master [09:03:25] or else it won't work [09:03:57] RECOVERY - zuul_service_running on gallium is OK: PROCS OK: 1 process with regex args ^/usr/bin/python /usr/local/bin/zuul-server [09:04:08] Jenkins should be back up now :-] [09:04:23] ahm, it was https://gerrit.wikimedia.org/r/#/c/87332/5/modules/drac/manifests/init.pp akosiaris, you requested the removal ... [09:04:58] Not really. Read my comment more carefully please [09:05:42]