[00:10:48] PROBLEM - RAID on mw1128 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:10:58] PROBLEM - RAID on mw1125 is CRITICAL: CHECK_NRPE: Error - Could not complete SSL handshake. [00:10:58] PROBLEM - twemproxy process on mw1118 is CRITICAL: CHECK_NRPE: Error - Could not complete SSL handshake. [00:11:08] PROBLEM - DPKG on mw1143 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:11:08] PROBLEM - RAID on mw1143 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:11:08] PROBLEM - Apache HTTP on mw1118 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:11:08] PROBLEM - DPKG on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:11:18] PROBLEM - RAID on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:11:18] PROBLEM - RAID on mw1133 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:11:38] PROBLEM - DPKG on mw1116 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:11:48] PROBLEM - RAID on mw1139 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:11:48] PROBLEM - Apache HTTP on mw1139 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:11:48] PROBLEM - RAID on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:11:48] PROBLEM - DPKG on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:11:48] PROBLEM - RAID on mw1116 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:11:48] PROBLEM - Apache HTTP on mw1116 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:11:58] RECOVERY - RAID on mw1125 is OK: OK: no RAID installed [00:11:58] PROBLEM - Apache HTTP on mw1124 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:11:59] RECOVERY - Apache HTTP on mw1118 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.074 second response time [00:11:59] RECOVERY - twemproxy process on mw1118 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [00:12:08] RECOVERY - DPKG on mw1142 is OK: All packages OK [00:12:08] RECOVERY - RAID on mw1133 is OK: OK: no RAID installed [00:12:08] PROBLEM - Apache HTTP on mw1143 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:12:18] goddamnit again [00:12:28] RECOVERY - DPKG on mw1116 is OK: All packages OK [00:12:38] RECOVERY - RAID on mw1139 is OK: OK: no RAID installed [00:12:39] RECOVERY - Apache HTTP on mw1139 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.062 second response time [00:12:39] RECOVERY - RAID on mw1116 is OK: OK: no RAID installed [00:12:39] RECOVERY - Apache HTTP on mw1116 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.043 second response time [00:12:48] PROBLEM - SSH on mw1143 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:12:48] RECOVERY - RAID on mw1128 is OK: OK: no RAID installed [00:12:48] PROBLEM - Apache HTTP on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:13:38] RECOVERY - RAID on mw1142 is OK: OK: no RAID installed [00:13:38] RECOVERY - Apache HTTP on mw1142 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.074 second response time [00:13:48] RECOVERY - SSH on mw1143 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [00:13:58] PROBLEM - MySQL InnoDB on db1056 is CRITICAL: CRIT longest blocking idle transaction sleeps for 766 seconds [00:13:58] PROBLEM - MySQL Idle Transactions on db1056 is CRITICAL: CRIT longest blocking idle transaction sleeps for 766 seconds [00:13:58] RECOVERY - Apache HTTP on mw1143 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.103 second response time [00:14:58] RECOVERY - MySQL InnoDB on db1056 is OK: OK longest blocking idle transaction sleeps for 0 seconds [00:14:58] RECOVERY - MySQL Idle Transactions on db1056 is OK: OK longest blocking idle transaction sleeps for 0 seconds [00:14:58] RECOVERY - DPKG on mw1143 is OK: All packages OK [00:16:18] PROBLEM - SSH on mw1124 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:17:08] PROBLEM - twemproxy process on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:17:08] RECOVERY - SSH on mw1124 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [00:17:58] PROBLEM - MySQL InnoDB on db1056 is CRITICAL: CRIT longest blocking idle transaction sleeps for 827 seconds [00:17:58] PROBLEM - MySQL Idle Transactions on db1056 is CRITICAL: CRIT longest blocking idle transaction sleeps for 828 seconds [00:18:08] PROBLEM - Apache HTTP on mw1143 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:18:58] RECOVERY - MySQL InnoDB on db1056 is OK: OK longest blocking idle transaction sleeps for 0 seconds [00:18:58] RECOVERY - MySQL Idle Transactions on db1056 is OK: OK longest blocking idle transaction sleeps for 0 seconds [00:18:58] RECOVERY - Apache HTTP on mw1143 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.074 second response time [00:19:48] PROBLEM - twemproxy process on mw1143 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:20:18] PROBLEM - SSH on mw1124 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:20:38] RECOVERY - twemproxy process on mw1143 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [00:20:58] RECOVERY - RAID on mw1143 is OK: OK: no RAID installed [00:21:18] RECOVERY - SSH on mw1124 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [00:21:48] PROBLEM - Disk space on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:21:58] PROBLEM - MySQL InnoDB on db1056 is CRITICAL: CRIT longest blocking idle transaction sleeps for 1068 seconds [00:21:58] PROBLEM - MySQL Idle Transactions on db1056 is CRITICAL: CRIT longest blocking idle transaction sleeps for 1068 seconds [00:22:08] PROBLEM - RAID on mw1120 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:22:48] PROBLEM - RAID on mw1128 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:22:48] PROBLEM - DPKG on mw1118 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:22:49] PROBLEM - DPKG on mw1128 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:23:08] PROBLEM - RAID on mw1125 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:23:08] PROBLEM - DPKG on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:23:28] PROBLEM - DPKG on mw1146 is CRITICAL: CHECK_NRPE: Error - Could not complete SSL handshake. [00:23:38] PROBLEM - twemproxy process on mw1117 is CRITICAL: CHECK_NRPE: Error - Could not complete SSL handshake. [00:23:38] PROBLEM - RAID on mw1117 is CRITICAL: CHECK_NRPE: Error - Could not complete SSL handshake. [00:23:48] RECOVERY - DPKG on mw1118 is OK: All packages OK [00:23:48] RECOVERY - Disk space on mw1124 is OK: DISK OK [00:23:48] PROBLEM - RAID on mw1116 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:23:48] PROBLEM - twemproxy process on mw1143 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:23:48] PROBLEM - RAID on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:23:49] PROBLEM - RAID on mw1146 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:23:49] PROBLEM - LVS HTTP IPv4 on api.svc.eqiad.wmnet is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:23:51] PROBLEM - Apache HTTP on mw1116 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:23:58] PROBLEM - Apache HTTP on mw1193 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:23:58] PROBLEM - Apache HTTP on mw1192 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:23:58] PROBLEM - Apache HTTP on mw1190 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:23:58] RECOVERY - RAID on mw1125 is OK: OK: no RAID installed [00:23:58] RECOVERY - MySQL InnoDB on db1056 is OK: OK longest blocking idle transaction sleeps for 0 seconds [00:23:59] RECOVERY - MySQL Idle Transactions on db1056 is OK: OK longest blocking idle transaction sleeps for 0 seconds [00:23:59] RECOVERY - twemproxy process on mw1124 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [00:24:08] PROBLEM - DPKG on mw1143 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:24:08] PROBLEM - Apache HTTP on mw1146 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:24:08] PROBLEM - RAID on mw1143 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:24:08] PROBLEM - Apache HTTP on mw1198 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:24:08] PROBLEM - Apache HTTP on mw1143 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:24:09] PROBLEM - Apache HTTP on mw1118 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:24:09] PROBLEM - twemproxy process on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:24:38] PROBLEM - Disk space on mw1146 is CRITICAL: CHECK_NRPE: Error - Could not complete SSL handshake. [00:24:39] RECOVERY - RAID on mw1116 is OK: OK: no RAID installed [00:24:39] RECOVERY - LVS HTTP IPv4 on api.svc.eqiad.wmnet is OK: HTTP OK: HTTP/1.1 200 OK - 2919 bytes in 0.130 second response time [00:24:48] RECOVERY - Apache HTTP on mw1116 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 3.878 second response time [00:24:48] RECOVERY - Apache HTTP on mw1190 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.044 second response time [00:24:48] RECOVERY - Apache HTTP on mw1193 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.046 second response time [00:24:48] RECOVERY - Apache HTTP on mw1192 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.053 second response time [00:24:48] PROBLEM - Apache HTTP on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:24:48] PROBLEM - Apache HTTP on mw1120 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:24:49] PROBLEM - Apache HTTP on mw1132 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:24:49] RECOVERY - DPKG on mw1128 is OK: All packages OK [00:24:58] PROBLEM - twemproxy process on mw1122 is CRITICAL: NRPE: Unable to read output [00:24:58] PROBLEM - Apache HTTP on mw1122 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:24:58] RECOVERY - Apache HTTP on mw1198 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.050 second response time [00:24:59] RECOVERY - Apache HTTP on mw1118 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.102 second response time [00:24:59] RECOVERY - RAID on mw1120 is OK: OK: no RAID installed [00:25:08] RECOVERY - DPKG on mw1142 is OK: All packages OK [00:25:08] RECOVERY - Apache HTTP on mw1143 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 7.898 second response time [00:25:08] RECOVERY - twemproxy process on mw1142 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [00:25:18] PROBLEM - RAID on mw1133 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:25:28] RECOVERY - DPKG on mw1146 is OK: All packages OK [00:25:38] RECOVERY - Disk space on mw1146 is OK: DISK OK [00:25:39] RECOVERY - Apache HTTP on mw1120 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.055 second response time [00:25:39] RECOVERY - RAID on mw1146 is OK: OK: no RAID installed [00:25:48] RECOVERY - Apache HTTP on mw1132 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 6.435 second response time [00:25:48] RECOVERY - twemproxy process on mw1143 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [00:25:48] RECOVERY - Apache HTTP on mw1122 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.206 second response time [00:25:48] RECOVERY - RAID on mw1128 is OK: OK: no RAID installed [00:25:48] PROBLEM - DPKG on mw1133 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:25:58] RECOVERY - Apache HTTP on mw1146 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.078 second response time [00:26:28] PROBLEM - SSH on mw1133 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:26:48] RECOVERY - twemproxy process on mw1122 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [00:26:48] PROBLEM - Disk space on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:26:58] PROBLEM - MySQL InnoDB on db1056 is CRITICAL: CRIT longest blocking idle transaction sleeps for 1449 seconds [00:26:58] PROBLEM - MySQL Idle Transactions on db1056 is CRITICAL: CRIT longest blocking idle transaction sleeps for 1449 seconds [00:27:08] PROBLEM - twemproxy process on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:27:18] RECOVERY - SSH on mw1133 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [00:27:38] RECOVERY - twemproxy process on mw1117 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [00:27:48] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:27:48] PROBLEM - SSH on mw1139 is CRITICAL: Server answer: [00:27:58] PROBLEM - twemproxy process on mw1139 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:27:58] RECOVERY - MySQL InnoDB on db1056 is OK: OK longest blocking idle transaction sleeps for 0 seconds [00:27:58] RECOVERY - MySQL Idle Transactions on db1056 is OK: OK longest blocking idle transaction sleeps for 0 seconds [00:27:58] PROBLEM - DPKG on mw1139 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:27:59] PROBLEM - DPKG on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:27:59] RECOVERY - DPKG on mw1143 is OK: All packages OK [00:28:08] PROBLEM - DPKG on mw1141 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:28:08] RECOVERY - RAID on mw1143 is OK: OK: no RAID installed [00:28:08] PROBLEM - twemproxy process on mw1133 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:28:08] PROBLEM - DPKG on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:28:08] PROBLEM - twemproxy process on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:28:08] PROBLEM - Apache HTTP on mw1195 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:28:18] PROBLEM - SSH on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:28:18] PROBLEM - Apache HTTP on mw1203 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:28:39] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 2.018 second response time [00:28:48] PROBLEM - Apache HTTP on mw1139 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:28:48] PROBLEM - Apache HTTP on mw1204 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:28:48] RECOVERY - twemproxy process on mw1139 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [00:28:48] RECOVERY - SSH on mw1139 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [00:28:48] PROBLEM - Apache HTTP on mw1132 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:28:49] PROBLEM - Apache HTTP on mw1191 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:28:49] PROBLEM - RAID on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:28:50] PROBLEM - Apache HTTP on mw1120 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:28:50] PROBLEM - SSH on mw1146 is CRITICAL: Server answer: [00:28:50] PROBLEM - RAID on mw1146 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:28:51] PROBLEM - RAID on mw1141 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:28:52] PROBLEM - LVS HTTP IPv4 on api.svc.eqiad.wmnet is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:28:52] RECOVERY - DPKG on mw1139 is OK: All packages OK [00:28:53] PROBLEM - SSH on mw1117 is CRITICAL: Server answer: [00:28:58] PROBLEM - Disk space on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:28:58] RECOVERY - DPKG on mw1123 is OK: All packages OK [00:28:59] RECOVERY - Apache HTTP on mw1195 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.047 second response time [00:29:08] RECOVERY - Apache HTTP on mw1203 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.056 second response time [00:29:08] PROBLEM - Apache HTTP on mw1146 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:29:08] PROBLEM - Disk space on mw1133 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:29:18] PROBLEM - Apache HTTP on mw1117 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:29:38] RECOVERY - Apache HTTP on mw1139 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.060 second response time [00:29:38] RECOVERY - Apache HTTP on mw1204 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.054 second response time [00:29:39] RECOVERY - RAID on mw1117 is OK: OK: no RAID installed [00:29:39] RECOVERY - Apache HTTP on mw1132 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.077 second response time [00:29:39] RECOVERY - Apache HTTP on mw1120 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.079 second response time [00:29:39] RECOVERY - Apache HTTP on mw1191 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.096 second response time [00:29:39] RECOVERY - RAID on mw1146 is OK: OK: no RAID installed [00:29:40] RECOVERY - LVS HTTP IPv4 on api.svc.eqiad.wmnet is OK: HTTP OK: HTTP/1.1 200 OK - 2919 bytes in 0.078 second response time [00:29:48] RECOVERY - RAID on mw1123 is OK: OK: no RAID installed [00:29:48] RECOVERY - SSH on mw1117 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [00:29:48] RECOVERY - SSH on mw1146 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [00:29:48] RECOVERY - RAID on mw1141 is OK: OK: no RAID installed [00:29:58] RECOVERY - Apache HTTP on mw1146 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.071 second response time [00:29:58] RECOVERY - twemproxy process on mw1124 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [00:30:08] RECOVERY - Apache HTTP on mw1117 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.066 second response time [00:30:08] RECOVERY - DPKG on mw1141 is OK: All packages OK [00:30:58] RECOVERY - Disk space on mw1142 is OK: DISK OK [00:31:08] RECOVERY - SSH on mw1142 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [00:31:48] RECOVERY - Disk space on mw1124 is OK: DISK OK [00:31:58] PROBLEM - Apache HTTP on mw1133 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:31:59] RECOVERY - twemproxy process on mw1133 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [00:32:08] RECOVERY - Disk space on mw1133 is OK: DISK OK [00:32:58] PROBLEM - MySQL Idle Transactions on db1056 is CRITICAL: CRIT longest blocking idle transaction sleeps for 1632 seconds [00:32:59] PROBLEM - MySQL InnoDB on db1056 is CRITICAL: CRIT longest blocking idle transaction sleeps for 1632 seconds [00:33:08] PROBLEM - twemproxy process on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:33:18] PROBLEM - SSH on mw1124 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:33:28] PROBLEM - SSH on mw1133 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:33:58] PROBLEM - Disk space on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:34:18] PROBLEM - SSH on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:34:18] RECOVERY - SSH on mw1133 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [00:34:48] PROBLEM - Disk space on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:34:58] RECOVERY - Disk space on mw1142 is OK: DISK OK [00:35:08] RECOVERY - twemproxy process on mw1142 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [00:36:08] RECOVERY - SSH on mw1124 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [00:36:08] PROBLEM - Disk space on mw1133 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:36:58] RECOVERY - MySQL InnoDB on db1056 is OK: OK longest blocking idle transaction sleeps for 0 seconds [00:36:58] RECOVERY - MySQL Idle Transactions on db1056 is OK: OK longest blocking idle transaction sleeps for 0 seconds [00:37:08] RECOVERY - Disk space on mw1133 is OK: DISK OK [00:37:48] RECOVERY - Apache HTTP on mw1133 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.059 second response time [00:38:08] RECOVERY - twemproxy process on mw1124 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [00:38:08] PROBLEM - twemproxy process on mw1133 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:38:28] PROBLEM - SSH on mw1133 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:39:08] RECOVERY - twemproxy process on mw1133 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [00:39:28] RECOVERY - SSH on mw1133 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [00:39:34] what's going on with the apaches? [00:39:39] RECOVERY - DPKG on mw1133 is OK: All packages OK [00:39:58] PROBLEM - Disk space on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:40:08] RECOVERY - SSH on mw1142 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [00:40:58] RECOVERY - Disk space on mw1142 is OK: DISK OK [00:41:08] PROBLEM - twemproxy process on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:41:18] RECOVERY - RAID on mw1133 is OK: OK: no RAID installed [00:41:48] RECOVERY - Disk space on mw1124 is OK: DISK OK [00:41:58] PROBLEM - MySQL Idle Transactions on db1056 is CRITICAL: CRIT longest blocking idle transaction sleeps for 2220 seconds [00:41:58] PROBLEM - MySQL InnoDB on db1056 is CRITICAL: CRIT longest blocking idle transaction sleeps for 2220 seconds [00:43:58] PROBLEM - Disk space on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:44:08] PROBLEM - twemproxy process on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:44:18] PROBLEM - SSH on mw1124 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:46:48] PROBLEM - Disk space on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:47:18] PROBLEM - SSH on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:47:58] RECOVERY - MySQL Idle Transactions on db1056 is OK: OK longest blocking idle transaction sleeps for 0 seconds [00:47:58] RECOVERY - MySQL InnoDB on db1056 is OK: OK longest blocking idle transaction sleeps for 0 seconds [00:48:18] RECOVERY - SSH on mw1142 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [00:49:48] RECOVERY - Disk space on mw1124 is OK: DISK OK [00:49:48] RECOVERY - Disk space on mw1142 is OK: DISK OK [00:51:58] PROBLEM - MySQL InnoDB on db1056 is CRITICAL: CRIT longest blocking idle transaction sleeps for 3046 seconds [00:51:58] PROBLEM - MySQL Idle Transactions on db1056 is CRITICAL: CRIT longest blocking idle transaction sleeps for 3046 seconds [00:52:08] RECOVERY - SSH on mw1124 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [00:52:18] PROBLEM - SSH on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:52:48] PROBLEM - Disk space on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:52:58] PROBLEM - Disk space on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:54:08] RECOVERY - twemproxy process on mw1124 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [00:54:08] RECOVERY - SSH on mw1142 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [00:54:48] RECOVERY - Disk space on mw1142 is OK: DISK OK [00:55:18] PROBLEM - SSH on mw1124 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:55:58] RECOVERY - MySQL Idle Transactions on db1056 is OK: OK longest blocking idle transaction sleeps for 0 seconds [00:55:58] RECOVERY - MySQL InnoDB on db1056 is OK: OK longest blocking idle transaction sleeps for 0 seconds [00:56:38] RECOVERY - DPKG on mw1124 is OK: All packages OK [00:56:48] RECOVERY - Disk space on mw1124 is OK: DISK OK [00:57:08] PROBLEM - twemproxy process on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [00:58:08] RECOVERY - SSH on mw1124 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [00:58:58] PROBLEM - MySQL Idle Transactions on db1056 is CRITICAL: CRIT longest blocking idle transaction sleeps for 3466 seconds [00:58:58] PROBLEM - MySQL InnoDB on db1056 is CRITICAL: CRIT longest blocking idle transaction sleeps for 3466 seconds [00:59:48] PROBLEM - DPKG on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:00:58] RECOVERY - MySQL Idle Transactions on db1056 is OK: OK longest blocking idle transaction sleeps for 0 seconds [01:00:58] RECOVERY - MySQL InnoDB on db1056 is OK: OK longest blocking idle transaction sleeps for 0 seconds [01:01:08] RECOVERY - DPKG on mw1142 is OK: All packages OK [01:01:08] RECOVERY - twemproxy process on mw1142 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [01:01:58] PROBLEM - Disk space on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:02:18] PROBLEM - SSH on mw1124 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:03:08] RECOVERY - twemproxy process on mw1124 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [01:03:48] RECOVERY - Disk space on mw1142 is OK: DISK OK [01:04:08] RECOVERY - SSH on mw1124 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [01:04:08] PROBLEM - DPKG on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:06:11] RECOVERY - RAID on mw1124 is OK: OK: no RAID installed [01:06:11] PROBLEM - twemproxy process on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:06:11] PROBLEM - SSH on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:08:01] PROBLEM - Disk space on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:08:51] RECOVERY - Disk space on mw1142 is OK: DISK OK [01:08:52] PROBLEM - Disk space on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:09:11] PROBLEM - RAID on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:09:42] RECOVERY - Disk space on mw1124 is OK: DISK OK [01:10:11] PROBLEM - twemproxy process on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:10:11] PROBLEM - SSH on mw1124 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:10:42] RECOVERY - RAID on mw1142 is OK: OK: no RAID installed [01:12:51] RECOVERY - DPKG on mw1124 is OK: All packages OK [01:12:51] RECOVERY - Apache HTTP on mw1124 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.229 second response time [01:13:11] RECOVERY - twemproxy process on mw1124 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [01:13:11] PROBLEM - RAID on mw1143 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:13:51] PROBLEM - RAID on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:14:01] RECOVERY - DPKG on mw1142 is OK: All packages OK [01:14:02] RECOVERY - twemproxy process on mw1142 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [01:14:02] RECOVERY - SSH on mw1142 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [01:14:11] PROBLEM - RAID on mw1132 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:14:21] PROBLEM - DPKG on mw1116 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:14:41] RECOVERY - RAID on mw1142 is OK: OK: no RAID installed [01:15:02] RECOVERY - RAID on mw1143 is OK: OK: no RAID installed [01:15:11] PROBLEM - Apache HTTP on mw1125 is CRITICAL: Connection timed out [01:15:21] PROBLEM - Disk space on mw1125 is CRITICAL: CHECK_NRPE: Error - Could not complete SSL handshake. [01:15:21] PROBLEM - RAID on mw1133 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:15:31] PROBLEM - RAID on mw1117 is CRITICAL: CHECK_NRPE: Error - Could not complete SSL handshake. [01:15:41] PROBLEM - Apache HTTP on mw1199 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:15:42] PROBLEM - Apache HTTP on mw1132 is CRITICAL: Connection timed out [01:15:42] PROBLEM - Apache HTTP on mw1204 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:15:42] PROBLEM - Disk space on mw1130 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:15:51] PROBLEM - Apache HTTP on mw1197 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:15:51] PROBLEM - Apache HTTP on mw1207 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:15:52] PROBLEM - RAID on mw1128 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:15:52] PROBLEM - RAID on mw1141 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:15:52] PROBLEM - LVS HTTP IPv4 on api.svc.eqiad.wmnet is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:16:01] PROBLEM - DPKG on mw1130 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:16:02] RECOVERY - RAID on mw1132 is OK: OK: no RAID installed [01:16:02] RECOVERY - Apache HTTP on mw1125 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.045 second response time [01:16:11] RECOVERY - RAID on mw1133 is OK: OK: no RAID installed [01:16:11] PROBLEM - DPKG on mw1141 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:16:11] PROBLEM - Apache HTTP on mw1130 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:16:11] PROBLEM - RAID on mw1130 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:16:11] PROBLEM - SSH on mw1130 is CRITICAL: Server answer: [01:16:12] PROBLEM - Apache HTTP on mw1141 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:16:12] PROBLEM - Apache HTTP on mw1117 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:16:21] RECOVERY - Disk space on mw1125 is OK: DISK OK [01:16:31] RECOVERY - Apache HTTP on mw1199 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.361 second response time [01:16:32] RECOVERY - Apache HTTP on mw1204 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.122 second response time [01:16:41] RECOVERY - RAID on mw1117 is OK: OK: no RAID installed [01:16:41] RECOVERY - Apache HTTP on mw1132 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.094 second response time [01:16:42] RECOVERY - Apache HTTP on mw1197 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.091 second response time [01:16:42] RECOVERY - Apache HTTP on mw1207 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.112 second response time [01:16:42] RECOVERY - LVS HTTP IPv4 on api.svc.eqiad.wmnet is OK: HTTP OK: HTTP/1.1 200 OK - 2919 bytes in 1.481 second response time [01:16:44] RECOVERY - RAID on mw1128 is OK: OK: no RAID installed [01:16:51] PROBLEM - DPKG on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:17:01] RECOVERY - DPKG on mw1130 is OK: All packages OK [01:17:01] PROBLEM - Apache HTTP on mw1124 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:17:01] RECOVERY - SSH on mw1130 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [01:17:01] RECOVERY - Apache HTTP on mw1130 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 2.972 second response time [01:17:01] RECOVERY - RAID on mw1130 is OK: OK: no RAID installed [01:17:11] RECOVERY - Apache HTTP on mw1117 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 3.038 second response time [01:17:11] RECOVERY - DPKG on mw1116 is OK: All packages OK [01:17:11] PROBLEM - twemproxy process on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:17:11] PROBLEM - DPKG on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:17:11] PROBLEM - twemproxy process on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:17:11] PROBLEM - SSH on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:17:31] RECOVERY - Disk space on mw1130 is OK: DISK OK [01:17:51] PROBLEM - RAID on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:18:01] PROBLEM - Disk space on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:18:11] RECOVERY - SSH on mw1124 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [01:18:11] RECOVERY - twemproxy process on mw1142 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [01:18:11] PROBLEM - Apache HTTP on mw1143 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:19:11] PROBLEM - RAID on mw1143 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:19:11] PROBLEM - DPKG on mw1143 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:19:21] PROBLEM - SSH on mw1141 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:19:51] RECOVERY - Disk space on mw1142 is OK: DISK OK [01:20:51] PROBLEM - twemproxy process on mw1143 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:21:11] PROBLEM - SSH on mw1124 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:21:41] RECOVERY - twemproxy process on mw1143 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [01:21:51] RECOVERY - DPKG on mw1124 is OK: All packages OK [01:22:11] PROBLEM - twemproxy process on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:22:41] RECOVERY - RAID on mw1142 is OK: OK: no RAID installed [01:23:01] PROBLEM - Disk space on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:23:02] RECOVERY - DPKG on mw1143 is OK: All packages OK [01:23:02] RECOVERY - RAID on mw1143 is OK: OK: no RAID installed [01:23:02] RECOVERY - Apache HTTP on mw1143 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.074 second response time [01:24:51] PROBLEM - DPKG on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:24:51] PROBLEM - twemproxy process on mw1141 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:24:51] PROBLEM - Disk space on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:24:51] RECOVERY - Disk space on mw1142 is OK: DISK OK [01:25:51] PROBLEM - RAID on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:26:01] RECOVERY - twemproxy process on mw1142 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [01:26:02] RECOVERY - DPKG on mw1142 is OK: All packages OK [01:26:11] RECOVERY - SSH on mw1142 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [01:28:51] RECOVERY - Disk space on mw1124 is OK: DISK OK [01:29:11] RECOVERY - SSH on mw1124 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [01:29:21] RECOVERY - SSH on mw1141 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [01:29:41] RECOVERY - RAID on mw1142 is OK: OK: no RAID installed [01:29:42] RECOVERY - Apache HTTP on mw1142 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.065 second response time [01:30:51] PROBLEM - RAID on mw1128 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:31:01] PROBLEM - Apache HTTP on mw1122 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:31:11] PROBLEM - RAID on mw1120 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:31:11] PROBLEM - DPKG on mw1143 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:31:11] PROBLEM - RAID on mw1143 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:31:41] PROBLEM - Apache HTTP on mw1204 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:31:51] RECOVERY - RAID on mw1128 is OK: OK: no RAID installed [01:31:51] PROBLEM - Apache HTTP on mw1132 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:31:51] PROBLEM - Apache HTTP on mw1207 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:31:51] PROBLEM - Apache HTTP on mw1120 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:31:52] PROBLEM - Apache HTTP on mw1189 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:31:52] PROBLEM - DPKG on mw1133 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:32:02] RECOVERY - RAID on mw1120 is OK: OK: no RAID installed [01:32:11] PROBLEM - DPKG on mw1117 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:32:12] RECOVERY - DPKG on mw1143 is OK: All packages OK [01:32:12] RECOVERY - DPKG on mw1141 is OK: All packages OK [01:32:12] PROBLEM - SSH on mw1124 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:32:12] PROBLEM - Apache HTTP on mw1117 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:32:31] RECOVERY - Apache HTTP on mw1204 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.207 second response time [01:32:41] RECOVERY - DPKG on mw1133 is OK: All packages OK [01:32:42] RECOVERY - Apache HTTP on mw1207 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 7.085 second response time [01:32:51] PROBLEM - Disk space on mw1141 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:32:51] PROBLEM - Apache HTTP on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:32:51] PROBLEM - RAID on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:32:51] RECOVERY - Apache HTTP on mw1122 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 1.002 second response time [01:33:01] RECOVERY - DPKG on mw1117 is OK: All packages OK [01:33:01] RECOVERY - Apache HTTP on mw1117 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.069 second response time [01:33:11] RECOVERY - SSH on mw1124 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [01:33:11] RECOVERY - twemproxy process on mw1124 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [01:33:41] PROBLEM - SSH on mw1121 is CRITICAL: Server answer: [01:33:41] RECOVERY - RAID on mw1142 is OK: OK: no RAID installed [01:33:51] PROBLEM - Apache HTTP on mw1121 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:33:51] PROBLEM - SSH on mw1116 is CRITICAL: Server answer: [01:34:01] PROBLEM - twemproxy process on mw1116 is CRITICAL: CHECK_NRPE: Error - Could not complete SSL handshake. [01:34:11] PROBLEM - Apache HTTP on mw1146 is CRITICAL: Connection timed out [01:34:11] PROBLEM - Apache HTTP on mw1201 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:34:11] PROBLEM - Apache HTTP on mw1130 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:34:11] PROBLEM - Apache HTTP on mw1194 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:34:11] PROBLEM - RAID on mw1130 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:34:11] PROBLEM - RAID on mw1121 is CRITICAL: CHECK_NRPE: Error - Could not complete SSL handshake. [01:34:21] PROBLEM - DPKG on mw1116 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:34:21] PROBLEM - RAID on mw1133 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:34:41] PROBLEM - twemproxy process on mw1121 is CRITICAL: CHECK_NRPE: Error - Could not complete SSL handshake. [01:34:41] PROBLEM - Disk space on mw1121 is CRITICAL: CHECK_NRPE: Error - Could not complete SSL handshake. [01:34:41] RECOVERY - Apache HTTP on mw1189 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.113 second response time [01:34:41] PROBLEM - RAID on mw1146 is CRITICAL: CHECK_NRPE: Error - Could not complete SSL handshake. [01:34:41] RECOVERY - Apache HTTP on mw1142 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.066 second response time [01:34:42] PROBLEM - DPKG on mw1146 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:34:42] PROBLEM - RAID on mw1139 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:34:51] PROBLEM - RAID on mw1116 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:34:51] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:34:51] RECOVERY - twemproxy process on mw1141 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [01:34:51] PROBLEM - DPKG on mw1139 is CRITICAL: CHECK_NRPE: Error - Could not complete SSL handshake. [01:34:52] PROBLEM - DPKG on mw1118 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:34:52] PROBLEM - Apache HTTP on mw1116 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:34:52] RECOVERY - SSH on mw1116 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [01:35:01] PROBLEM - RAID on mw1118 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:35:02] RECOVERY - Apache HTTP on mw1201 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.147 second response time [01:35:02] RECOVERY - Apache HTTP on mw1141 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.049 second response time [01:35:02] RECOVERY - Apache HTTP on mw1194 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 2.227 second response time [01:35:11] RECOVERY - twemproxy process on mw1116 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [01:35:11] PROBLEM - Apache HTTP on mw1118 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:35:11] PROBLEM - Apache HTTP on mw1123 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:35:11] PROBLEM - DPKG on mw1141 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:35:11] PROBLEM - DPKG on mw1143 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:35:12] RECOVERY - RAID on mw1121 is OK: OK: no RAID installed [01:35:31] RECOVERY - DPKG on mw1146 is OK: All packages OK [01:35:31] RECOVERY - RAID on mw1139 is OK: OK: no RAID installed [01:35:32] RECOVERY - SSH on mw1121 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [01:35:41] RECOVERY - twemproxy process on mw1121 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [01:35:41] RECOVERY - Disk space on mw1121 is OK: DISK OK [01:35:42] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.067 second response time [01:35:42] RECOVERY - Apache HTTP on mw1120 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.071 second response time [01:35:42] RECOVERY - Apache HTTP on mw1121 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.057 second response time [01:35:42] RECOVERY - DPKG on mw1118 is OK: All packages OK [01:35:42] RECOVERY - Apache HTTP on mw1132 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 4.855 second response time [01:35:43] RECOVERY - RAID on mw1146 is OK: OK: no RAID installed [01:35:43] RECOVERY - Apache HTTP on mw1116 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 4.162 second response time [01:35:51] RECOVERY - RAID on mw1116 is OK: OK: no RAID installed [01:35:51] PROBLEM - twemproxy process on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:35:51] PROBLEM - RAID on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:35:51] PROBLEM - twemproxy process on mw1143 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:35:51] PROBLEM - RAID on mw1128 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:35:51] RECOVERY - DPKG on mw1124 is OK: All packages OK [01:35:51] RECOVERY - RAID on mw1118 is OK: OK: no RAID installed [01:35:52] RECOVERY - DPKG on mw1139 is OK: All packages OK [01:35:52] RECOVERY - Apache HTTP on mw1124 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.280 second response time [01:35:53] PROBLEM - DPKG on mw1128 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:36:01] PROBLEM - DPKG on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:36:02] RECOVERY - Apache HTTP on mw1118 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.063 second response time [01:36:02] RECOVERY - Apache HTTP on mw1130 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.074 second response time [01:36:02] RECOVERY - Apache HTTP on mw1146 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.095 second response time [01:36:02] RECOVERY - RAID on mw1130 is OK: OK: no RAID installed [01:36:11] RECOVERY - RAID on mw1124 is OK: OK: no RAID installed [01:36:11] RECOVERY - DPKG on mw1116 is OK: All packages OK [01:36:11] RECOVERY - RAID on mw1133 is OK: OK: no RAID installed [01:36:11] PROBLEM - Apache HTTP on mw1143 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:36:41] PROBLEM - Apache HTTP on mw1128 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:36:41] RECOVERY - Disk space on mw1141 is OK: DISK OK [01:36:42] RECOVERY - RAID on mw1141 is OK: OK: no RAID installed [01:36:42] PROBLEM - SSH on mw1143 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:36:51] RECOVERY - DPKG on mw1128 is OK: All packages OK [01:36:51] PROBLEM - RAID on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:37:01] RECOVERY - DPKG on mw1141 is OK: All packages OK [01:37:41] RECOVERY - RAID on mw1142 is OK: OK: no RAID installed [01:37:51] RECOVERY - twemproxy process on mw1123 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [01:38:41] RECOVERY - twemproxy process on mw1143 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [01:38:42] RECOVERY - RAID on mw1123 is OK: OK: no RAID installed [01:38:51] PROBLEM - DPKG on mw1133 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:39:01] PROBLEM - Apache HTTP on mw1124 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:39:02] RECOVERY - DPKG on mw1143 is OK: All packages OK [01:39:11] PROBLEM - twemproxy process on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:39:11] PROBLEM - RAID on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:39:31] RECOVERY - SSH on mw1143 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [01:39:51] PROBLEM - twemproxy process on mw1128 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:39:51] PROBLEM - Apache HTTP on mw1121 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:39:51] PROBLEM - RAID on mw1141 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:39:52] RECOVERY - Apache HTTP on mw1124 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.072 second response time [01:39:52] PROBLEM - DPKG on mw1128 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:40:01] RECOVERY - twemproxy process on mw1124 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [01:40:02] RECOVERY - RAID on mw1143 is OK: OK: no RAID installed [01:40:02] RECOVERY - Apache HTTP on mw1143 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.093 second response time [01:40:02] RECOVERY - RAID on mw1124 is OK: OK: no RAID installed [01:40:11] PROBLEM - Apache HTTP on mw1141 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:40:11] PROBLEM - DPKG on mw1141 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:40:41] RECOVERY - DPKG on mw1133 is OK: All packages OK [01:40:42] RECOVERY - twemproxy process on mw1128 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [01:40:42] RECOVERY - Apache HTTP on mw1121 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.893 second response time [01:40:51] RECOVERY - RAID on mw1128 is OK: OK: no RAID installed [01:40:51] PROBLEM - RAID on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:40:51] PROBLEM - Apache HTTP on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:41:11] PROBLEM - DPKG on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:41:11] PROBLEM - Disk space on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:41:11] PROBLEM - SSH on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:41:51] PROBLEM - RAID on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:41:51] PROBLEM - twemproxy process on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:41:51] PROBLEM - twemproxy process on mw1141 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:42:01] PROBLEM - Disk space on mw1142 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:42:41] RECOVERY - Apache HTTP on mw1142 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.071 second response time [01:42:41] RECOVERY - RAID on mw1142 is OK: OK: no RAID installed [01:42:51] RECOVERY - twemproxy process on mw1141 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [01:42:51] RECOVERY - Disk space on mw1142 is OK: DISK OK [01:43:01] RECOVERY - DPKG on mw1142 is OK: All packages OK [01:43:02] RECOVERY - SSH on mw1142 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [01:43:11] PROBLEM - Disk space on mw1128 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:43:21] PROBLEM - SSH on mw1141 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:43:51] PROBLEM - RAID on mw1128 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:43:51] PROBLEM - twemproxy process on mw1128 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:44:11] RECOVERY - Disk space on mw1128 is OK: DISK OK [01:44:11] RECOVERY - DPKG on mw1141 is OK: All packages OK [01:44:21] PROBLEM - SSH on mw1123 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:45:51] RECOVERY - RAID on mw1141 is OK: OK: no RAID installed [01:46:01] PROBLEM - SSH on mw1128 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:46:02] RECOVERY - Apache HTTP on mw1141 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.065 second response time [01:46:11] RECOVERY - SSH on mw1141 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [01:46:41] RECOVERY - twemproxy process on mw1128 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [01:46:51] RECOVERY - RAID on mw1128 is OK: OK: no RAID installed [01:46:51] RECOVERY - DPKG on mw1128 is OK: All packages OK [01:46:51] PROBLEM - DPKG on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:46:51] RECOVERY - SSH on mw1128 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [01:47:01] PROBLEM - Apache HTTP on mw1124 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:47:11] RECOVERY - Disk space on mw1123 is OK: DISK OK [01:47:11] PROBLEM - RAID on mw1124 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:47:21] RECOVERY - SSH on mw1123 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [01:47:31] RECOVERY - Apache HTTP on mw1128 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.492 second response time [01:47:41] RECOVERY - DPKG on mw1124 is OK: All packages OK [01:47:51] RECOVERY - Apache HTTP on mw1124 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.065 second response time [01:48:01] RECOVERY - Apache HTTP on mw1123 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.060 second response time [01:48:02] RECOVERY - RAID on mw1124 is OK: OK: no RAID installed [01:50:21] PROBLEM - SSH on mw1123 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:51:11] PROBLEM - Apache HTTP on mw1123 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:51:21] PROBLEM - SSH on mw1141 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:51:51] PROBLEM - RAID on mw1141 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [01:52:11] RECOVERY - SSH on mw1141 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [01:52:41] RECOVERY - RAID on mw1141 is OK: OK: no RAID installed [01:55:21] RECOVERY - SSH on mw1123 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [01:56:02] RECOVERY - Apache HTTP on mw1123 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.058 second response time [01:56:41] RECOVERY - twemproxy process on mw1123 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [01:56:42] RECOVERY - RAID on mw1123 is OK: OK: no RAID installed [01:56:51] RECOVERY - DPKG on mw1123 is OK: All packages OK [01:59:51] PROBLEM - RAID on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [02:00:01] PROBLEM - DPKG on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [02:00:30] !log LocalisationUpdate failed: git pull of extensions failed [02:00:51] RECOVERY - RAID on mw1123 is OK: OK: no RAID installed [02:00:51] Logged the message, Master [02:02:21] PROBLEM - SSH on mw1123 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [02:03:21] RECOVERY - SSH on mw1123 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [02:03:51] PROBLEM - RAID on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [02:05:19] PROBLEM - Apache HTTP on mw1123 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [02:06:59] RECOVERY - Apache HTTP on mw1123 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.062 second response time [02:08:40] RECOVERY - RAID on mw1123 is OK: OK: no RAID installed [02:11:49] PROBLEM - RAID on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [02:11:49] PROBLEM - twemproxy process on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [02:12:09] PROBLEM - Apache HTTP on mw1123 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [02:16:49] RECOVERY - twemproxy process on mw1123 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [02:17:40] RECOVERY - RAID on mw1123 is OK: OK: no RAID installed [02:17:49] RECOVERY - DPKG on mw1123 is OK: All packages OK [02:17:59] RECOVERY - Apache HTTP on mw1123 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.061 second response time [02:21:09] PROBLEM - Apache HTTP on mw1123 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [02:21:49] PROBLEM - twemproxy process on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [02:21:49] PROBLEM - RAID on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [02:21:59] PROBLEM - DPKG on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [02:22:49] RECOVERY - twemproxy process on mw1123 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [02:22:59] RECOVERY - Apache HTTP on mw1123 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 1.533 second response time [02:23:39] RECOVERY - RAID on mw1123 is OK: OK: no RAID installed [02:23:49] RECOVERY - DPKG on mw1123 is OK: All packages OK [02:26:49] PROBLEM - RAID on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [02:26:59] PROBLEM - DPKG on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [02:27:09] PROBLEM - Apache HTTP on mw1123 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [02:31:49] PROBLEM - twemproxy process on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [02:31:59] RECOVERY - Apache HTTP on mw1123 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.060 second response time [02:31:59] RECOVERY - DPKG on mw1123 is OK: All packages OK [02:32:39] RECOVERY - twemproxy process on mw1123 is OK: PROCS OK: 1 process with UID = 65534 (nobody), command name nutcracker [02:33:40] RECOVERY - RAID on mw1123 is OK: OK: no RAID installed [02:44:49] PROBLEM - RAID on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [02:49:49] RECOVERY - RAID on mw1123 is OK: OK: no RAID installed [03:05:00] PROBLEM - DPKG on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [03:05:09] PROBLEM - Apache HTTP on mw1123 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [03:06:57] RECOVERY - DPKG on mw1123 is OK: All packages OK [03:07:57] RECOVERY - Apache HTTP on mw1123 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.512 second response time [03:26:47] PROBLEM - RAID on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [03:27:07] PROBLEM - Apache HTTP on mw1123 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [03:29:47] RECOVERY - RAID on mw1123 is OK: OK: no RAID installed [03:30:57] RECOVERY - Apache HTTP on mw1123 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.046 second response time [03:40:47] PROBLEM - RAID on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [03:42:07] PROBLEM - Apache HTTP on mw1123 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [03:42:17] PROBLEM - SSH on mw1123 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [03:43:07] RECOVERY - Apache HTTP on mw1123 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 6.087 second response time [03:43:07] RECOVERY - SSH on mw1123 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [03:43:47] RECOVERY - RAID on mw1123 is OK: OK: no RAID installed [03:52:47] PROBLEM - RAID on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [03:53:07] PROBLEM - Apache HTTP on mw1123 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [03:53:47] RECOVERY - RAID on mw1123 is OK: OK: no RAID installed [03:54:57] RECOVERY - Apache HTTP on mw1123 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 1.434 second response time [04:00:47] PROBLEM - RAID on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [04:01:37] RECOVERY - RAID on mw1123 is OK: OK: no RAID installed [04:04:57] PROBLEM - DPKG on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [04:05:07] PROBLEM - Apache HTTP on mw1123 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [04:06:01] RECOVERY - DPKG on mw1123 is OK: All packages OK [04:06:01] RECOVERY - Apache HTTP on mw1123 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.060 second response time [04:09:17] (03CR) 10Ori.livneh: [C: 032] Run sync-wikiversions AFTER all code is deployed [operations/puppet] - 10https://gerrit.wikimedia.org/r/93083 (owner: 10Reedy) [04:16:01] PROBLEM - DPKG on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [04:16:11] PROBLEM - Apache HTTP on mw1123 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [04:16:51] PROBLEM - RAID on mw1123 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [04:17:01] RECOVERY - Apache HTTP on mw1123 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.066 second response time [04:17:51] RECOVERY - DPKG on mw1123 is OK: All packages OK [04:18:41] RECOVERY - RAID on mw1123 is OK: OK: no RAID installed [04:24:51] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [04:25:41] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 4.710 second response time [04:31:51] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [04:32:22] Reedy: l10nupdate failed [04:32:41] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 2.112 second response time [04:37:51] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [04:41:41] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.878 second response time [04:45:02] PROBLEM - Apache HTTP on mw1122 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [04:45:51] RECOVERY - Apache HTTP on mw1122 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.250 second response time [04:47:51] PROBLEM - Apache HTTP on mw1132 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [04:51:51] PROBLEM - Apache HTTP on mw1121 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [04:52:41] RECOVERY - Apache HTTP on mw1121 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.445 second response time [04:56:42] RECOVERY - Apache HTTP on mw1132 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 5.618 second response time [04:59:51] PROBLEM - Apache HTTP on mw1132 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:02:51] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:03:01] PROBLEM - Apache HTTP on mw1122 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:03:51] RECOVERY - Apache HTTP on mw1132 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 8.434 second response time [05:04:20] jeremyb: for real? notice any issues? [05:05:58] greg-g: well it's just not new messages i guess [05:06:01] 02 02:00:30 <+logmsgbot> !log LocalisationUpdate failed: git pull of extensions failed [05:06:21] * greg-g nods [05:06:28] also, wth is going on with the apaches? [05:06:35] same as earlier maybe? [05:06:40] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.071 second response time [05:06:43] I guess, what was the cause? [05:06:49] i didn't actually look to see if they are APIs [05:06:54] i'll check the localization update [05:07:06] greg-g: things look more or less OK [05:07:22] ori-l: re l10n or apaches? [05:07:50] RECOVERY - Apache HTTP on mw1122 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 3.226 second response time [05:09:02] * jeremyb replied on RT 6155 [05:10:47] apaches [05:10:51] ori-l: /me nods [05:11:01] /me hungry [05:13:09] * Aaron|home chuckles at http://answers.yahoo.com/question/index?qid=20080818210506AAk9LVU [05:13:10] /me sleep [05:14:00] Aaron|home: heh [05:14:03] :) [05:14:10] PROBLEM - Apache HTTP on mw1146 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:14:37] Nope! Ohio is so full of fun and history, it's not EVEN funny! [05:14:50] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:15:01] RECOVERY - Apache HTTP on mw1146 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.071 second response time [05:15:06] In fact, scientists warn that Ohio may be over-capacity [05:15:23] Fun levels have been rising sharply since the early 1980s [05:15:52] History is accruing at a steady rate of 1 year per year [05:16:35] heh [05:18:30] PROBLEM - Apache HTTP on mw1132 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:18:40] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.928 second response time [05:19:00] PROBLEM - Apache HTTP on mw1122 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:19:20] RECOVERY - Apache HTTP on mw1132 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 1.966 second response time [05:19:29] * Aaron|home thinks of Recycled Air looking at https://upload.wikimedia.org/wikipedia/commons/2/21/Arkansas_from_air_20060411.jpg [05:20:50] RECOVERY - Apache HTTP on mw1122 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.066 second response time [05:22:30] PROBLEM - Apache HTTP on mw1132 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:23:50] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:24:01] PROBLEM - Apache HTTP on mw1122 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:24:10] PROBLEM - Apache HTTP on mw1141 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:24:40] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.052 second response time [05:25:10] RECOVERY - Apache HTTP on mw1141 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 6.066 second response time [05:28:40] PROBLEM - Apache HTTP on mw1128 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:29:30] RECOVERY - Apache HTTP on mw1128 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 4.623 second response time [05:29:50] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:30:10] PROBLEM - Apache HTTP on mw1141 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:30:10] PROBLEM - Apache HTTP on mw1121 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:31:50] RECOVERY - Apache HTTP on mw1122 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.504 second response time [05:32:10] RECOVERY - Apache HTTP on mw1141 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 4.944 second response time [05:32:20] RECOVERY - Apache HTTP on mw1132 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 4.956 second response time [05:32:24] (03PS1) 10Springle: update cnames to avoid slaves picked for imminent decomm [operations/dns] - 10https://gerrit.wikimedia.org/r/93150 [05:32:29] so much flapping [05:32:40] PROBLEM - Apache HTTP on mw1128 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:32:40] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 3.538 second response time [05:34:08] (03CR) 10Springle: [C: 032] update cnames to avoid slaves picked for imminent decomm [operations/dns] - 10https://gerrit.wikimedia.org/r/93150 (owner: 10Springle) [05:34:10] PROBLEM - Apache HTTP on mw1143 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:35:01] PROBLEM - Apache HTTP on mw1122 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:35:30] RECOVERY - Apache HTTP on mw1128 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 2.294 second response time [05:35:50] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:36:00] RECOVERY - Apache HTTP on mw1143 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.068 second response time [05:36:30] PROBLEM - Apache HTTP on mw1132 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:37:01] RECOVERY - Apache HTTP on mw1121 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.068 second response time [05:37:40] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.201 second response time [05:40:50] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:41:10] PROBLEM - Apache HTTP on mw1121 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:41:40] PROBLEM - Apache HTTP on mw1128 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:41:51] RECOVERY - Apache HTTP on mw1122 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.586 second response time [05:42:10] PROBLEM - Apache HTTP on mw1141 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:42:30] RECOVERY - Apache HTTP on mw1128 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 5.585 second response time [05:43:10] PROBLEM - Apache HTTP on mw1146 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:43:30] RECOVERY - Apache HTTP on mw1132 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 6.796 second response time [05:43:50] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 9.025 second response time [05:45:10] RECOVERY - Apache HTTP on mw1121 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.977 second response time [05:46:00] PROBLEM - Apache HTTP on mw1122 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:46:10] RECOVERY - Apache HTTP on mw1146 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 9.110 second response time [05:46:30] PROBLEM - Apache HTTP on mw1132 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:46:40] PROBLEM - Apache HTTP on mw1128 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:47:50] RECOVERY - Apache HTTP on mw1122 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.071 second response time [05:48:10] PROBLEM - Apache HTTP on mw1143 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:48:10] PROBLEM - Apache HTTP on mw1121 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:49:10] PROBLEM - Apache HTTP on mw1146 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:51:00] PROBLEM - Apache HTTP on mw1122 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:51:01] RECOVERY - Apache HTTP on mw1143 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.289 second response time [05:51:01] RECOVERY - Apache HTTP on mw1141 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.070 second response time [05:51:10] RECOVERY - Apache HTTP on mw1121 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 3.552 second response time [05:51:30] RECOVERY - Apache HTTP on mw1128 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.068 second response time [05:51:50] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:52:40] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.074 second response time [05:52:50] PROBLEM - Apache HTTP on mw1120 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:53:40] RECOVERY - Apache HTTP on mw1120 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.074 second response time [05:54:10] RECOVERY - Apache HTTP on mw1146 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 4.576 second response time [05:54:10] PROBLEM - Apache HTTP on mw1121 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:54:10] PROBLEM - Apache HTTP on mw1141 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:55:40] PROBLEM - Apache HTTP on mw1128 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:55:50] RECOVERY - Apache HTTP on mw1122 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 2.566 second response time [05:56:30] RECOVERY - Apache HTTP on mw1128 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 4.945 second response time [05:57:10] PROBLEM - Apache HTTP on mw1143 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:59:00] PROBLEM - Apache HTTP on mw1122 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:59:10] PROBLEM - Apache HTTP on mw1146 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:59:40] PROBLEM - Apache HTTP on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:00:00] RECOVERY - Apache HTTP on mw1122 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 5.901 second response time [06:00:10] RECOVERY - Apache HTTP on mw1143 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 8.686 second response time [06:01:38] (03PS1) 10Springle: depool db69 and db71 for reassignment during pmtpa decomm [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93151 [06:01:40] RECOVERY - Apache HTTP on mw1142 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 7.067 second response time [06:01:40] PROBLEM - Apache HTTP on mw1128 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:01:50] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:02:12] (03CR) 10Springle: [C: 032] depool db69 and db71 for reassignment during pmtpa decomm [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93151 (owner: 10Springle) [06:02:40] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 3.202 second response time [06:03:00] PROBLEM - Apache HTTP on mw1122 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:03:10] RECOVERY - Apache HTTP on mw1141 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 3.366 second response time [06:03:16] !log springle synchronized wmf-config/db-pmtpa.php 'depool db69 and db71 for reassignment during pmtpa decomm' [06:03:30] RECOVERY - Apache HTTP on mw1132 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 6.112 second response time [06:03:33] Logged the message, Master [06:03:40] RECOVERY - Apache HTTP on mw1128 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 5.895 second response time [06:05:09] RECOVERY - Apache HTTP on mw1121 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.068 second response time [06:05:39] PROBLEM - SSH on lvs1001 is CRITICAL: Server answer: [06:05:49] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:06:19] PROBLEM - Apache HTTP on mw1141 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:06:29] PROBLEM - Apache HTTP on mw1132 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:07:09] RECOVERY - Apache HTTP on mw1146 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.068 second response time [06:07:19] RECOVERY - Apache HTTP on mw1132 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.065 second response time [06:07:19] PROBLEM - Apache HTTP on mw1120 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:07:39] RECOVERY - SSH on lvs1001 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [06:08:09] PROBLEM - Apache HTTP on mw1143 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:08:19] PROBLEM - Apache HTTP on mw1121 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:08:19] RECOVERY - Apache HTTP on mw1120 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 9.101 second response time [06:10:09] RECOVERY - Apache HTTP on mw1143 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 7.349 second response time [06:10:29] PROBLEM - Apache HTTP on mw1132 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:10:39] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 1.541 second response time [06:11:09] RECOVERY - Apache HTTP on mw1121 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 4.846 second response time [06:12:19] RECOVERY - Apache HTTP on mw1141 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 9.459 second response time [06:14:39] PROBLEM - Apache HTTP on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:14:49] RECOVERY - Apache HTTP on mw1122 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.066 second response time [06:15:06] (03PS1) 10Springle: reassign db69 to s2 and db71 to s3 during pmtpa decomm (both 12th floor) [operations/puppet] - 10https://gerrit.wikimedia.org/r/93152 [06:15:19] PROBLEM - Apache HTTP on mw1141 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:15:19] PROBLEM - Apache HTTP on mw1146 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:15:19] PROBLEM - Apache HTTP on mw1121 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:15:39] PROBLEM - Apache HTTP on mw1128 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:16:07] (03CR) 10Springle: [C: 032] reassign db69 to s2 and db71 to s3 during pmtpa decomm (both 12th floor) [operations/puppet] - 10https://gerrit.wikimedia.org/r/93152 (owner: 10Springle) [06:16:09] RECOVERY - Apache HTTP on mw1141 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.069 second response time [06:16:19] PROBLEM - Apache HTTP on mw1143 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:16:39] RECOVERY - Apache HTTP on mw1142 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 9.716 second response time [06:17:09] RECOVERY - Apache HTTP on mw1146 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 1.777 second response time [06:17:09] RECOVERY - Apache HTTP on mw1121 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 3.348 second response time [06:17:39] RECOVERY - Apache HTTP on mw1128 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 9.092 second response time [06:17:59] PROBLEM - Apache HTTP on mw1122 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:18:49] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:19:09] RECOVERY - Apache HTTP on mw1143 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 6.024 second response time [06:19:19] PROBLEM - Apache HTTP on mw1141 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:19:39] PROBLEM - Apache HTTP on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:20:09] RECOVERY - Apache HTTP on mw1141 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 1.946 second response time [06:20:19] PROBLEM - Apache HTTP on mw1121 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:20:29] RECOVERY - Apache HTTP on mw1132 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 9.540 second response time [06:20:40] PROBLEM - Apache HTTP on mw1128 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:20:40] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 3.398 second response time [06:21:29] RECOVERY - Apache HTTP on mw1128 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.068 second response time [06:22:39] RECOVERY - Apache HTTP on mw1142 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 4.994 second response time [06:23:19] PROBLEM - Apache HTTP on mw1146 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:23:19] PROBLEM - Apache HTTP on mw1141 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:23:29] PROBLEM - Apache HTTP on mw1132 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:23:49] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:24:09] RECOVERY - Apache HTTP on mw1146 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 1.996 second response time [06:24:09] RECOVERY - Apache HTTP on mw1141 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 5.194 second response time [06:25:19] PROBLEM - Apache HTTP on mw1143 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:26:09] RECOVERY - Apache HTTP on mw1143 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 1.080 second response time [06:27:19] PROBLEM - Apache HTTP on mw1141 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:28:09] RECOVERY - Apache HTTP on mw1141 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 3.001 second response time [06:29:19] PROBLEM - Apache HTTP on mw1143 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:29:21] wt [06:29:39] PROBLEM - Apache HTTP on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:30:19] PROBLEM - Apache HTTP on mw1146 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:30:19] PROBLEM - Apache HTTP on mw1120 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:30:29] RECOVERY - Apache HTTP on mw1132 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 6.538 second response time [06:30:39] RECOVERY - Apache HTTP on mw1142 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 7.228 second response time [06:30:49] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 6.867 second response time [06:31:09] RECOVERY - Apache HTTP on mw1143 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 3.640 second response time [06:31:19] PROBLEM - Apache HTTP on mw1141 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:31:49] RECOVERY - Apache HTTP on mw1122 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 1.216 second response time [06:33:09] RECOVERY - Apache HTTP on mw1146 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.068 second response time [06:33:09] RECOVERY - Apache HTTP on mw1120 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.068 second response time [06:33:29] PROBLEM - Apache HTTP on mw1132 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:33:49] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:34:39] PROBLEM - Apache HTTP on mw1128 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:35:39] PROBLEM - Apache HTTP on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:36:19] PROBLEM - Apache HTTP on mw1143 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:36:19] PROBLEM - Apache HTTP on mw1146 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:36:19] PROBLEM - Apache HTTP on mw1120 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:36:29] RECOVERY - Apache HTTP on mw1142 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 1.636 second response time [06:36:59] PROBLEM - Apache HTTP on mw1122 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:39:09] RECOVERY - Apache HTTP on mw1121 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 6.340 second response time [06:39:39] PROBLEM - Apache HTTP on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:40:19] RECOVERY - Apache HTTP on mw1141 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 8.550 second response time [06:40:29] RECOVERY - Apache HTTP on mw1128 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.073 second response time [06:41:09] RECOVERY - Apache HTTP on mw1143 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 6.644 second response time [06:41:19] RECOVERY - Apache HTTP on mw1146 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 7.966 second response time [06:41:39] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.202 second response time [06:42:19] PROBLEM - Apache HTTP on mw1121 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:42:29] RECOVERY - Apache HTTP on mw1142 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.893 second response time [06:43:19] PROBLEM - Apache HTTP on mw1141 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:43:39] PROBLEM - Apache HTTP on mw1128 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:44:19] PROBLEM - Apache HTTP on mw1143 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:44:49] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:45:09] RECOVERY - Apache HTTP on mw1120 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.067 second response time [06:45:19] RECOVERY - Apache HTTP on mw1143 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 8.228 second response time [06:46:29] RECOVERY - Apache HTTP on mw1128 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.050 second response time [06:47:19] PROBLEM - Apache HTTP on mw1146 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:47:19] RECOVERY - Apache HTTP on mw1132 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.067 second response time [06:47:49] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 7.017 second response time [06:47:49] RECOVERY - Apache HTTP on mw1122 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 2.301 second response time [06:48:09] RECOVERY - Apache HTTP on mw1146 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 6.881 second response time [06:48:19] PROBLEM - Apache HTTP on mw1143 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:49:09] RECOVERY - Apache HTTP on mw1141 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 4.738 second response time [06:50:49] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:50:59] PROBLEM - Apache HTTP on mw1122 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:51:19] PROBLEM - Apache HTTP on mw1117 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:52:09] RECOVERY - Apache HTTP on mw1121 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.069 second response time [06:52:09] RECOVERY - Apache HTTP on mw1117 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.210 second response time [06:52:19] RECOVERY - Apache HTTP on mw1143 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 8.959 second response time [06:52:19] PROBLEM - Apache HTTP on mw1141 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:52:29] PROBLEM - Apache HTTP on mw1132 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:52:39] PROBLEM - Apache HTTP on mw1128 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:53:29] RECOVERY - Apache HTTP on mw1132 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 4.802 second response time [06:54:19] PROBLEM - Apache HTTP on mw1120 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:55:19] PROBLEM - Apache HTTP on mw1146 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:55:19] PROBLEM - Apache HTTP on mw1143 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:55:19] PROBLEM - Apache HTTP on mw1121 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:55:39] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.067 second response time [06:55:39] PROBLEM - Apache HTTP on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:56:09] RECOVERY - Apache HTTP on mw1146 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 6.136 second response time [06:56:29] PROBLEM - Apache HTTP on mw1132 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:57:09] RECOVERY - Apache HTTP on mw1120 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.068 second response time [06:57:29] RECOVERY - Apache HTTP on mw1128 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.444 second response time [06:58:09] RECOVERY - Apache HTTP on mw1143 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 4.847 second response time [06:58:29] RECOVERY - Apache HTTP on mw1132 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 8.950 second response time [06:58:29] RECOVERY - Apache HTTP on mw1142 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 3.169 second response time [06:58:42] !log increase weight of mw1189-1208 api servers from 20 to 25, they handle the load better (hopefully) [06:58:49] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:58:57] Logged the message, Master [06:59:19] PROBLEM - Apache HTTP on mw1146 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:59:39] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.502 second response time [07:00:09] RECOVERY - Apache HTTP on mw1121 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 1.122 second response time [07:00:09] RECOVERY - Apache HTTP on mw1146 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 3.143 second response time [07:03:49] RECOVERY - Apache HTTP on mw1122 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.339 second response time [07:04:19] PROBLEM - Apache HTTP on mw1146 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:04:19] PROBLEM - Apache HTTP on mw1121 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:04:29] PROBLEM - Apache HTTP on mw1132 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:05:09] RECOVERY - Apache HTTP on mw1146 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 3.947 second response time [07:05:26] PROBLEM - Apache HTTP on mw1120 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:05:46] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:05:46] RECOVERY - Apache HTTP on mw1120 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 1.539 second response time [07:06:16] RECOVERY - Apache HTTP on mw1121 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 8.334 second response time [07:06:36] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 2.558 second response time [07:07:26] RECOVERY - Apache HTTP on mw1132 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.065 second response time [07:07:36] PROBLEM - Apache HTTP on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:08:36] RECOVERY - Apache HTTP on mw1142 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 4.325 second response time [07:09:06] !log and increased again to 30, after looking at memory on both groups of boxes. [07:09:19] Logged the message, Master [07:11:36] PROBLEM - Apache HTTP on mw1132 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:11:36] PROBLEM - Apache HTTP on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:11:37] PROBLEM - Apache HTTP on mw1128 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:11:56] PROBLEM - Apache HTTP on mw1122 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:12:26] RECOVERY - Apache HTTP on mw1128 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.067 second response time [07:12:36] RECOVERY - Apache HTTP on mw1142 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 2.601 second response time [07:13:46] RECOVERY - Apache HTTP on mw1122 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.067 second response time [07:13:46] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:14:16] PROBLEM - Apache HTTP on mw1146 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:14:16] PROBLEM - Apache HTTP on mw1143 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:15:16] PROBLEM - Apache HTTP on mw1117 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:16:06] RECOVERY - Apache HTTP on mw1117 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.067 second response time [07:16:56] PROBLEM - Apache HTTP on mw1122 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:17:06] RECOVERY - Apache HTTP on mw1146 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 1.971 second response time [07:18:16] PROBLEM - Apache HTTP on mw1121 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:20:06] RECOVERY - Apache HTTP on mw1143 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 4.351 second response time [07:20:16] RECOVERY - Apache HTTP on mw1141 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 6.292 second response time [07:20:16] RECOVERY - Apache HTTP on mw1121 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 9.906 second response time [07:20:46] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 7.757 second response time [07:23:16] PROBLEM - Apache HTTP on mw1146 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:23:16] PROBLEM - Apache HTTP on mw1141 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:23:46] PROBLEM - Apache HTTP on mw1134 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:24:06] RECOVERY - Apache HTTP on mw1146 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 1.742 second response time [07:24:16] PROBLEM - Apache HTTP on mw1143 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:28:36] RECOVERY - Apache HTTP on mw1134 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.114 second response time [07:28:36] PROBLEM - Apache HTTP on mw1142 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:29:16] PROBLEM - Apache HTTP on mw1121 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:29:16] PROBLEM - Apache HTTP on mw1117 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:29:26] RECOVERY - Apache HTTP on mw1142 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.158 second response time [07:30:06] RECOVERY - Apache HTTP on mw1121 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.387 second response time [07:30:06] RECOVERY - Apache HTTP on mw1117 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 1.120 second response time [07:30:46] RECOVERY - Apache HTTP on mw1122 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.068 second response time [07:33:06] RECOVERY - Apache HTTP on mw1143 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.139 second response time [07:33:26] RECOVERY - Apache HTTP on mw1132 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.125 second response time [07:36:06] RECOVERY - Apache HTTP on mw1141 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 1.761 second response time [07:37:33] !log restarted a bunch of the apaches on the 12gb memory apis that were maxed out at their 100 client limit [07:37:43] we'll see how long that lasts [07:37:49] Logged the message, Master [07:37:58] maybe with the new weights they won't get stuck in the first place [07:38:07] not taking any bets though [07:46:05] jeremyb: spri ngle and I are already in the loop about amaranth but it needs or needed someone on site [07:46:20] (dunno if that happened while I was asleep or not) [07:48:42] looks like not (as I look at various logs anyways) [07:51:02] jesus effin christ [07:51:19] eh? [07:51:42] ULS was 41% of api reqs yesterday [07:51:48] ooowwww [07:51:50] it's basically reading a file and serving it as an api module [07:51:54] thanks for looking at that too [07:52:04] is this a new thing? [07:52:28] i checked going back to oct. 20th and no [07:52:54] been consistent since (at least) then, didn't go further back yet [07:53:30] ok well while it would be nice to fix, for sure, it would be nice to know what set things off last night and this morning [07:54:15] if that same rought # of requests and same rough percentages has been coming in to the api boxes the whole time, it's not something suddenly falling out of cache or some new update [07:55:27] from them at least [08:09:46] i'm sure legoktm will help fix ULS >.> <.< [08:10:21] he is probably going to murder me for suggesting that [08:10:22] p858snake|l: https://gerrit.wikimedia.org/r/#/c/92562/ [09:29:57] ## we're disabling this experimentally 11-09-2006 [09:29:57] #Crawl-delay: 1 [09:56:06] !log restarted twemproxy on ~10 of the api servers in eqiad in the lower memory/core group, was seeing a lot of entries in memcached-serious log and some processes churning on that [09:56:24] Logged the message, Master [09:56:32] shots in the dark... [10:07:47] RECOVERY - check_job_queue on hume is OK: JOBQUEUE OK - all job queues below 10,000 [10:10:57] PROBLEM - check_job_queue on hume is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [10:22:07] RECOVERY - check_job_queue on fenari is OK: JOBQUEUE OK - all job queues below 10,000 [10:24:56] going to wander off now since things look to be holding steady atm [10:26:17] PROBLEM - check_job_queue on fenari is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [11:31:46] PROBLEM - search indices - check lucene status page on search1003 is CRITICAL: Connection timed out [11:34:36] RECOVERY - search indices - check lucene status page on search1003 is OK: HTTP OK: HTTP/1.1 200 OK - 269 bytes in 0.001 second response time [11:55:46] PROBLEM - search indices - check lucene status page on search1003 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [11:56:36] RECOVERY - search indices - check lucene status page on search1003 is OK: HTTP OK: HTTP/1.1 200 OK - 269 bytes in 0.001 second response time [12:22:43] PROBLEM - Host wikiquote-lb.pmtpa.wikimedia.org_ipv6 is DOWN: /bin/ping6 -n -U -w 15 -c 5 2620:0:860:ed1a::3 [12:23:03] RECOVERY - Host wikiquote-lb.pmtpa.wikimedia.org_ipv6 is UP: PING OK - Packet loss = 0%, RTA = 35.47 ms [12:40:13] RECOVERY - check_job_queue on fenari is OK: JOBQUEUE OK - all job queues below 10,000 [12:43:13] PROBLEM - check_job_queue on fenari is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [12:43:42] !log start xtrabackup db52->db69 and db39->db71 [12:44:03] Logged the message, Master [12:56:18] hi guys [12:56:20] http://stats.grok.se/en/latest30/Font/Autonym.ttf [12:56:23] is it bad? [12:56:57] the stats are reporting 2.5M hits to that article *just today*, filed as https://bugzilla.wikimedia.org/show_bug.cgi?id=56514 [13:37:14] (03CR) 10Bartosz Dziewoński: "Aww shit… this breaks things." [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89603 (owner: 10Bartosz Dziewoński) [13:37:29] i broke teh sites :( (not really, just a wee little bit) [13:37:40] is anybody around to sync a config fix very quickly? [13:41:54] (03PS1) 10Bartosz Dziewoński: Unbreak 'watchcreations' option default value [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93160 [13:43:18] (03CR) 10Bartosz Dziewoński: "Fix: https://gerrit.wikimedia.org/r/93160" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/89603 (owner: 10Bartosz Dziewoński) [13:45:28] (03CR) 10Yurik: "(3 comments)" [operations/puppet] - 10https://gerrit.wikimedia.org/r/93006 (owner: 10Yurik) [13:46:18] is there any chance of getting a config fix synced on saturday? D: https://gerrit.wikimedia.org/r/93160 [13:46:24] (03PS2) 10Yurik: Add more zero values to analytics header [operations/puppet] - 10https://gerrit.wikimedia.org/r/93006 [13:46:40] i fucked up and need someone to fix it for me :( [14:14:04] MatmaRex: Are users complaining? [14:17:35] Elsie: yes, on VPT [14:18:23] (i am not amazing enough to go through my old patches and randomly see if they broke anything) [14:18:36] How many people are complaining? [14:19:10] a few? [14:19:34] We don't generally do deployments on Saturday. [14:21:58] i know. [14:22:49] greg-g: ^ [16:10:22] * Reedy kicks MatmaRex [16:10:51] wow, harsh [16:11:46] (03CR) 10Reedy: [C: 032] Unbreak 'watchcreations' option default value [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93160 (owner: 10Bartosz Dziewoński) [16:12:02] (03Merged) 10jenkins-bot: Unbreak 'watchcreations' option default value [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93160 (owner: 10Bartosz Dziewoński) [16:13:56] !log reedy synchronized wmf-config/CommonSettings.php [16:14:11] Logged the message, Master [16:17:53] apergos: He was complaining I hadn't merged it and that it "wouldn't change anything" [16:18:03] oh hrm [16:18:06] So I merge it and it breaks stuff [16:18:06] :D [16:18:17] yeah deploy on a weekend, not so much [16:18:21] even for teeny tiny things [16:18:36] I deployed it yesterday in the deployment window [16:18:37] But yeah [16:18:50] well that's still driday deploy day [16:18:53] er friday [16:19:40] eh next time he won't nag as loud :-D [16:20:13] oh any whiners from the sync? [16:20:32] since I'm here and we had another bout of badly behaved api servers this morning... [16:21:06] Nope, no complaints there [16:21:18] excellent [16:32:05] apergos: Ori says the API is getting hit pretty hard by ULS. [16:32:11] Not sure if that's related. [16:32:19] yeah I was here for that discussion earlier [16:32:30] Ah, I didn't see discussion, only the bug. [16:32:40] I'm sure it doesn't help, but the issue we had started yesterday evening, and the uls thing seems to be at least a week old [16:32:47] https://bugzilla.wikimedia.org/show_bug.cgi?id=56509 [16:32:51] yeah, saw it [16:32:53] * Elsie nods. [16:33:08] anything that can be made more efficient is good obviously [16:33:38] could be (for example) that the uls stuff pushed us close enough to the edge that the next small thing did us in [16:33:39] who knows [16:33:51] Yeah. [16:39:21] sorry Reedy, i love you [16:39:43] awww [16:50:21] * apergos is trying out processing 2.1  [16:50:39] having never used any previosu version or even heard of it til recently [16:52:29] https://github.com/processing ? [16:52:46] yep [17:22:18] PROBLEM - RAID on searchidx1001 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [17:23:08] RECOVERY - RAID on searchidx1001 is OK: OK: State is Optimal, checked 1 logical drive(s), 4 physical drive(s) [17:47:28] RECOVERY - check_job_queue on fenari is OK: JOBQUEUE OK - all job queues below 10,000 [17:50:28] PROBLEM - check_job_queue on fenari is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [18:00:49] heh: http://programmingisterrible.com/post/65781074112/devils-dictionary-of-programming [18:42:50] hi [18:42:58] I need to get an SSD [18:43:02] and connect it to my laptop [18:43:14] came here to ask for a few advices on how to do that [18:43:22] the SSDs I see on a store near me are on SATA III [18:43:34] should I get a connector SATA III => USB 2.0 ? [18:43:57] the speeds for the SSDs I've seen are around 400MB/s writing, 500MB/s reading [18:44:18] would SATA III => USB 2.0 lower some advantages of those speeds ? [18:44:21] I need it for development [18:44:27] RECOVERY - check_job_queue on hume is OK: JOBQUEUE OK - all job queues below 10,000 [18:44:37] I'm not sure if there's enough space in my laptop to put this SSD [18:44:56] apergos: aha. well i had gone through scrollback first, maybe i missed soemthing [18:46:08] apart from this, I'll need to switch some stuff on my filesystem to have /var/lib/mysql on the SSD [18:46:41] paravoid: do you have an SSD ? [18:47:37] PROBLEM - check_job_queue on hume is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [18:48:41] that is pretty off topic for in here [18:49:23] ok [18:55:22] ori-l, just read about ULS - pure lulz [18:56:06] MaxSem: Be nice. :P [18:56:24] nice? [18:57:30] A city in France. [19:16:18] MaxSem: erik already dragged ori into that, i bet he's having the time of his life :> [19:48:57] (03PS1) 10Cmjohnson: Addin mgmt dns back for amaranth [operations/dns] - 10https://gerrit.wikimedia.org/r/93177 [19:49:43] (03CR) 10Cmjohnson: [C: 032] Addin mgmt dns back for amaranth [operations/dns] - 10https://gerrit.wikimedia.org/r/93177 (owner: 10Cmjohnson) [19:50:18] !log dns update [19:50:38] Logged the message, Master [20:19:40] MaxSem: ULS where? [20:21:05] !bug :universallanguageselector | YuviPanda [20:21:05] YuviPanda: https://bugzilla.wikimedia.org/:universallanguageselector [20:21:20] whaaa. why is !bug crappy here. [20:21:29] MatmaRex: 404 [20:21:32] yeah [20:21:40] YuviPanda: https://bugzilla.wikimedia.org/buglist.cgi?quicksearch=%3Auniversallanguageselector [20:22:02] the latest few are rather interesting [20:29:52] RECOVERY - check_job_queue on hume is OK: JOBQUEUE OK - all job queues below 10,000 [20:30:53] MatmaRex: yeah, reading up on https://bugzilla.wikimedia.org/show_bug.cgi?id=46306 now [20:32:52] PROBLEM - check_job_queue on hume is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [20:34:33] MatmaRex: i was looking at EventLogging data with legoktm about how much ULS is actually used [20:34:56] MatmaRex: not sure if he did anything with it [20:49:58] legoktm: did you actually use the ULS EL data somewhere? [20:50:31] /dev/null [20:50:32] It's safe there [20:50:55] YuviPanda: uga [20:51:09] Nikerabbit: ploppa! [20:51:34] :-D [20:52:51] YuviPanda: https://en.wikipedia.org/wiki/Finlandization [20:53:31] Nikerabbit: are you being finlandized? :D [20:54:13] YuviPanda: are you? [20:54:22] hmm? [20:54:29] what I am, is confused :P [20:57:09] * Nikerabbit sighs [20:57:45] I've closed the juice bottle so tight I'm unable to open it [20:58:04] rubber bands around the cap [20:59:13] I... am still confused [20:59:14] wat [20:59:31] maybe the wine bottle is easier to open [21:06:25] it will make the juice bottle even harder to open [21:06:34] but you'll care less! [21:11:44] Heh. [21:12:01] We used to have one of those "as seen on TV" devices for opening juice bottles. [21:12:08] White handle with a rubber thing. [21:12:10] It was cute. [21:19:58] Elsie: can you explian this : https://he.wikipedia.org/wiki/%D7%9E%D7%99%D7%95%D7%97%D7%93:%D7%97%D7%A1%D7%99%D7%9E%D7%94/Drall [21:20:04] the block is epoch 0 [21:29:48] PROBLEM - Disk space on wtp1023 is CRITICAL: DISK CRITICAL - free space: / 355 MB (3% inode=76%): [21:30:05] About to head out, I can take a look later, if someone else doesn't beat me to it. [21:30:08] matanya: ^ [21:30:30] not so important, thanks Elsie [21:41:48] PROBLEM - Disk space on wtp1023 is CRITICAL: DISK CRITICAL - free space: / 356 MB (3% inode=76%): [22:33:57] RECOVERY - check_job_queue on hume is OK: JOBQUEUE OK - all job queues below 10,000 [22:36:57] PROBLEM - check_job_queue on hume is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [22:47:07] RECOVERY - check_job_queue on fenari is OK: JOBQUEUE OK - all job queues below 10,000 [22:50:07] PROBLEM - check_job_queue on fenari is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [22:52:07] RECOVERY - check_job_queue on fenari is OK: JOBQUEUE OK - all job queues below 10,000 [22:55:07] PROBLEM - check_job_queue on fenari is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [23:18:29] YuviPanda: No, I got busy with other things. [23:22:48] RECOVERY - check_job_queue on hume is OK: JOBQUEUE OK - all job queues below 10,000 [23:25:58] PROBLEM - check_job_queue on hume is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [23:28:58] RECOVERY - check_job_queue on hume is OK: JOBQUEUE OK - all job queues below 10,000 [23:31:58] PROBLEM - check_job_queue on hume is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [23:33:27] Why is that running on fenari and hume? [23:33:37] (ignoring that fact it's still in tampa) [23:43:40] Elsie: I have one of those openers [23:47:17] if you have a name for it, I'll gladly upload a photo on Commons :) [23:48:11] uh, it's swiss of course, zyliss