[19:52:29] blah [19:53:21] @logoff [20:53:25] New review: Bhartshorne; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1542 [20:53:25] Change merged: Bhartshorne; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1542 [21:4:11] New patchset: Pyoungmeister; "removing "temp" nagios config" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1543 [21:4:42] New review: Pyoungmeister; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1543 [21:4:43] Change merged: Pyoungmeister; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1543 [21:22:29] PROBLEM - Puppet freshness on mw1055 is CRITICAL: Puppet has not run in the last 10 hours [21:22:29] PROBLEM - Puppet freshness on mw1062 is CRITICAL: Puppet has not run in the last 10 hours [21:22:29] PROBLEM - Puppet freshness on mw1063 is CRITICAL: Puppet has not run in the last 10 hours [21:22:29] PROBLEM - Puppet freshness on mw1083 is CRITICAL: Puppet has not run in the last 10 hours [21:22:29] PROBLEM - Puppet freshness on mw1122 is CRITICAL: Puppet has not run in the last 10 hours [21:22:30] PROBLEM - Puppet freshness on mw1121 is CRITICAL: Puppet has not run in the last 10 hours [21:22:30] PROBLEM - Puppet freshness on mw1156 is CRITICAL: Puppet has not run in the last 10 hours [21:22:31] PROBLEM - Puppet freshness on mw1140 is CRITICAL: Puppet has not run in the last 10 hours [21:22:31] PROBLEM - Puppet freshness on mw1148 is CRITICAL: Puppet has not run in the last 10 hours [21:22:32] PROBLEM - Puppet freshness on mw1159 is CRITICAL: Puppet has not run in the last 10 hours [21:22:32] PROBLEM - Puppet freshness on snapshot4 is CRITICAL: Puppet has not run in the last 10 hours [21:22:33] PROBLEM - Puppet freshness on virt1 is CRITICAL: Puppet has not run in the last 10 hours [21:22:33] PROBLEM - Puppet freshness on virt2 is CRITICAL: Puppet has not run in the last 10 hours [21:25:15] RECOVERY - SSH on mw29 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:25:15] RECOVERY - Disk space on mw26 is OK: DISK OK [21:25:15] RECOVERY - Disk space on mw30 is OK: DISK OK [21:25:15] RECOVERY - Apache HTTP on mw27 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.024 second response time [21:25:15] RECOVERY - RAID on mw27 is OK: OK: no RAID installed [21:25:16] RECOVERY - DPKG on mw29 is OK: All packages OK [21:25:25] RECOVERY - Disk space on mw59 is OK: DISK OK [21:25:25] RECOVERY - Apache HTTP on mw6 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.025 second response time [21:25:25] RECOVERY - RAID on mw6 is OK: OK: no RAID installed [21:25:35] RECOVERY - ps1-a3-sdtpa-infeed-load-tower-A-phase-Z on ps1-a3-sdtpa is OK: ps1-a3-sdtpa-infeed-load-tower-A-phase-Z OK - 1713 [21:25:35] RECOVERY - ps1-a2-sdtpa-infeed-load-tower-B-phase-Y on ps1-a2-sdtpa is OK: ps1-a2-sdtpa-infeed-load-tower-B-phase-Y OK - 1063 [21:25:35] RECOVERY - ps1-a3-eqiad-infeed-load-tower-B-phase-X on ps1-a3-eqiad is OK: ps1-a3-eqiad-infeed-load-tower-B-phase-X OK - 775 [21:25:35] RECOVERY - ps1-a2-eqiad-infeed-load-tower-A-phase-X on ps1-a2-eqiad is OK: ps1-a2-eqiad-infeed-load-tower-A-phase-X OK - 613 [21:25:35] RECOVERY - ps1-a5-eqiad-infeed-load-tower-A-phase-X on ps1-a5-eqiad is OK: ps1-a5-eqiad-infeed-load-tower-A-phase-X OK - 900 [21:25:36] RECOVERY - ps1-a2-eqiad-infeed-load-tower-B-phase-Z on ps1-a2-eqiad is OK: ps1-a2-eqiad-infeed-load-tower-B-phase-Z OK - 600 [21:25:36] RECOVERY - ps1-a5-eqiad-infeed-load-tower-B-phase-Z on ps1-a5-eqiad is OK: ps1-a5-eqiad-infeed-load-tower-B-phase-Z OK - 1088 [21:26:5] RECOVERY - SSH on srv190 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:26:5] RECOVERY - DPKG on srv190 is OK: All packages OK [21:26:25] RECOVERY - SSH on srv243 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:26:25] RECOVERY - Disk space on srv245 is OK: DISK OK [21:26:25] RECOVERY - Apache HTTP on srv246 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.022 second response time [21:26:25] RECOVERY - Apache HTTP on srv241 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.028 second response time [21:26:25] RECOVERY - DPKG on srv243 is OK: All packages OK [21:26:26] RECOVERY - RAID on srv241 is OK: OK: no RAID installed [21:26:35] RECOVERY - RAID on mw21 is OK: OK: no RAID installed [21:26:55] RECOVERY - SSH on mw24 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:26:55] RECOVERY - Apache HTTP on mw36 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.022 second response time [21:27:15] RECOVERY - Disk space on mw20 is OK: DISK OK [21:27:15] RECOVERY - Apache HTTP on mw31 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.019 second response time [21:27:15] RECOVERY - Apache HTTP on mw17 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.031 second response time [21:27:15] RECOVERY - DPKG on mw19 is OK: All packages OK [21:28:25] RECOVERY - RAID on mw36 is OK: OK: no RAID installed [21:29:15] RECOVERY - Apache HTTP on mw12 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.034 second response time [21:29:35] RECOVERY - SSH on mw5 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:29:35] RECOVERY - Apache HTTP on mw48 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.030 second response time [21:29:35] RECOVERY - Apache HTTP on mw53 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.026 second response time [21:29:35] RECOVERY - RAID on mw48 is OK: OK: no RAID installed [21:29:35] RECOVERY - Disk space on mw52 is OK: DISK OK [21:29:36] RECOVERY - DPKG on mw5 is OK: All packages OK [21:29:45] RECOVERY - DPKG on srv193 is OK: All packages OK [21:30:5] RECOVERY - ps1-b5-sdtpa-infeed-load-tower-A-phase-Y on ps1-b5-sdtpa is OK: ps1-b5-sdtpa-infeed-load-tower-A-phase-Y OK - 1900 [21:30:5] RECOVERY - ps1-c1-sdtpa-infeed-load-tower-B-phase-Z on ps1-c1-sdtpa is OK: ps1-c1-sdtpa-infeed-load-tower-B-phase-Z OK - 638 [21:30:5] RECOVERY - ps1-c1-sdtpa-infeed-load-tower-A-phase-X on ps1-c1-sdtpa is OK: ps1-c1-sdtpa-infeed-load-tower-A-phase-X OK - 525 [21:30:5] RECOVERY - ps1-b8-eqiad-infeed-load-tower-A-phase-Y on ps1-b8-eqiad is OK: ps1-b8-eqiad-infeed-load-tower-A-phase-Y OK - 588 [21:30:5] RECOVERY - ps1-b7-eqiad-infeed-load-tower-A-phase-Z on ps1-b7-eqiad is OK: ps1-b7-eqiad-infeed-load-tower-A-phase-Z OK - 650 [21:30:6] RECOVERY - ps1-b5-eqiad-infeed-load-tower-A-phase-Z on ps1-b5-eqiad is OK: ps1-b5-eqiad-infeed-load-tower-A-phase-Z OK - 638 [21:30:6] RECOVERY - ps1-b6-eqiad-infeed-load-tower-B-phase-X on ps1-b6-eqiad is OK: ps1-b6-eqiad-infeed-load-tower-B-phase-X OK - 200 [21:30:35] RECOVERY - SSH on mw57 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:30:35] RECOVERY - SSH on mw8 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:30:35] RECOVERY - ps1-d3-pmtpa-infeed-load-tower-B-phase-Z on ps1-d3-pmtpa is OK: ps1-d3-pmtpa-infeed-load-tower-B-phase-Z OK - 25 [21:30:35] RECOVERY - RAID on mw40 is OK: OK: no RAID installed [21:30:35] RECOVERY - DPKG on mw52 is OK: All packages OK [21:31:5] RECOVERY - DPKG on mw8 is OK: All packages OK [21:31:25] RECOVERY - SSH on srv231 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:31:25] RECOVERY - Disk space on srv233 is OK: DISK OK [21:31:25] RECOVERY - Apache HTTP on srv234 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.027 second response time [21:31:25] RECOVERY - RAID on srv234 is OK: OK: no RAID installed [21:31:25] RECOVERY - DPKG on srv231 is OK: All packages OK [21:31:35] RECOVERY - Disk space on srv265 is OK: DISK OK [21:31:35] RECOVERY - Apache HTTP on srv267 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.019 second response time [21:31:55] RECOVERY - RAID on srv267 is OK: OK: no RAID installed [21:32:25] RECOVERY - SSH on mw42 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:32:45] RECOVERY - Disk space on srv235 is OK: DISK OK [21:32:55] RECOVERY - Disk space on srv262 is OK: DISK OK [21:32:55] RECOVERY - DPKG on srv260 is OK: All packages OK [21:33:24] RECOVERY - ps1-a6-eqiad-infeed-load-tower-A-phase-Y on ps1-a6-eqiad is OK: ps1-a6-eqiad-infeed-load-tower-A-phase-Y OK - 575 [21:33:25] RECOVERY - ps1-a8-eqiad-infeed-load-tower-B-phase-Y on ps1-a8-eqiad is OK: ps1-a8-eqiad-infeed-load-tower-B-phase-Y OK - 392 [21:33:35] RECOVERY - Apache HTTP on srv226 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.021 second response time [21:33:35] RECOVERY - RAID on srv279 is OK: OK: no RAID installed [21:34:5] RECOVERY - ps1-d1-sdtpa-infeed-load-tower-A-phase-X on ps1-d1-sdtpa is OK: ps1-d1-sdtpa-infeed-load-tower-A-phase-X OK - 1838 [21:34:5] RECOVERY - ps1-a1-sdtpa-infeed-load-tower-A-phase-Y on ps1-a1-sdtpa is OK: ps1-a1-sdtpa-infeed-load-tower-A-phase-Y OK - 538 [21:34:35] RECOVERY - SSH on mw17 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:35:35] RECOVERY - Disk space on mw17 is OK: DISK OK [21:35:35] RECOVERY - SSH on mw15 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:35:35] RECOVERY - Apache HTTP on mw13 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.017 second response time [21:35:35] RECOVERY - Apache HTTP on mw18 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.024 second response time [21:35:35] RECOVERY - RAID on mw13 is OK: OK: no RAID installed [21:35:36] RECOVERY - DPKG on mw15 is OK: All packages OK [21:35:45] RECOVERY - ps1-a5-sdtpa-infeed-load-tower-A-phase-X on ps1-a5-sdtpa is OK: ps1-a5-sdtpa-infeed-load-tower-A-phase-X OK - 2125 [21:35:45] RECOVERY - ps1-a8-eqiad-infeed-load-tower-B-phase-Z on ps1-a8-eqiad is OK: ps1-a8-eqiad-infeed-load-tower-B-phase-Z OK - 334 [21:35:45] RECOVERY - ps1-a4-eqiad-infeed-load-tower-B-phase-Z on ps1-a4-eqiad is OK: ps1-a4-eqiad-infeed-load-tower-B-phase-Z OK - 588 [21:35:45] RECOVERY - ps1-a6-eqiad-infeed-load-tower-A-phase-Z on ps1-a6-eqiad is OK: ps1-a6-eqiad-infeed-load-tower-A-phase-Z OK - 638 [21:35:55] RECOVERY - SSH on srv263 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:35:55] RECOVERY - RAID on srv244 is OK: OK: no RAID installed [21:36:5] RECOVERY - SSH on mw34 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:36:5] RECOVERY - Disk space on mw36 is OK: DISK OK [21:36:5] RECOVERY - Apache HTTP on mw37 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.027 second response time [21:36:5] RECOVERY - DPKG on mw34 is OK: All packages OK [21:36:5] RECOVERY - RAID on mw32 is OK: OK: no RAID installed [21:36:6] RECOVERY - RAID on mw37 is OK: OK: no RAID installed [21:36:15] RECOVERY - Apache HTTP on mw21 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.021 second response time [21:36:25] RECOVERY - Disk space on srv193 is OK: DISK OK [21:36:25] RECOVERY - DPKG on srv196 is OK: All packages OK [21:36:35] RECOVERY - SSH on srv193 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:36:45] RECOVERY - ps1-a2-sdtpa-infeed-load-tower-A-phase-Y on ps1-a2-sdtpa is OK: ps1-a2-sdtpa-infeed-load-tower-A-phase-Y OK - 1113 [21:36:45] RECOVERY - ps1-a1-sdtpa-infeed-load-tower-B-phase-X on ps1-a1-sdtpa is OK: ps1-a1-sdtpa-infeed-load-tower-B-phase-X OK - 600 [21:36:45] RECOVERY - ps1-a3-eqiad-infeed-load-tower-A-phase-X on ps1-a3-eqiad is OK: ps1-a3-eqiad-infeed-load-tower-A-phase-X OK - 700 [21:36:45] RECOVERY - ps1-a2-eqiad-infeed-load-tower-A-phase-Z on ps1-a2-eqiad is OK: ps1-a2-eqiad-infeed-load-tower-A-phase-Z OK - 613 [21:36:45] RECOVERY - ps1-a3-eqiad-infeed-load-tower-B-phase-Z on ps1-a3-eqiad is OK: ps1-a3-eqiad-infeed-load-tower-B-phase-Z OK - 738 [21:36:46] RECOVERY - ps1-a1-eqiad-infeed-load-tower-B-phase-Y on ps1-a1-eqiad is OK: ps1-a1-eqiad-infeed-load-tower-B-phase-Y OK - 401 [21:36:55] RECOVERY - RAID on mw9 is OK: OK: no RAID installed [21:36:55] RECOVERY - ps1-a8-eqiad-infeed-load-tower-A-phase-X on ps1-a8-eqiad is OK: ps1-a8-eqiad-infeed-load-tower-A-phase-X OK - 399 [21:36:55] RECOVERY - ps1-b3-eqiad-infeed-load-tower-A-phase-X on ps1-b3-eqiad is OK: ps1-b3-eqiad-infeed-load-tower-A-phase-X OK - 950 [21:37:5] RECOVERY - Disk space on srv243 is OK: DISK OK [21:37:5] RECOVERY - Apache HTTP on srv261 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.023 second response time [21:37:15] RECOVERY - DPKG on mw27 is OK: All packages OK [21:37:35] RECOVERY - RAID on srv286 is OK: OK: no RAID installed [21:37:45] RECOVERY - DPKG on mw14 is OK: All packages OK [21:38:5] RECOVERY - SSH on srv210 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:38:5] RECOVERY - Disk space on srv212 is OK: DISK OK [21:38:5] RECOVERY - Apache HTTP on srv213 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.024 second response time [21:38:5] RECOVERY - RAID on srv213 is OK: OK: no RAID installed [21:38:5] RECOVERY - DPKG on srv210 is OK: All packages OK [21:38:25] RECOVERY - Disk space on mw8 is OK: DISK OK [21:38:35] RECOVERY - RAID on mw45 is OK: OK: no RAID installed [21:38:35] RECOVERY - ps1-b2-eqiad-infeed-load-tower-A-phase-Y on ps1-b2-eqiad is OK: ps1-b2-eqiad-infeed-load-tower-A-phase-Y OK - 713 [21:38:55] RECOVERY - Apache HTTP on mw45 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.025 second response time [21:38:55] RECOVERY - RAID on srv277 is OK: OK: no RAID installed [21:39:5] RECOVERY - RAID on mw34 is OK: OK: no RAID installed [21:39:15] RECOVERY - Disk space on mw57 is OK: DISK OK [21:39:15] RECOVERY - Disk space on mw12 is OK: DISK OK [21:39:15] RECOVERY - Disk space on srv273 is OK: DISK OK [21:39:15] RECOVERY - RAID on mw18 is OK: OK: no RAID installed [21:39:45] RECOVERY - Apache HTTP on srv236 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.024 second response time [21:39:45] RECOVERY - ps1-a5-eqiad-infeed-load-tower-A-phase-Y on ps1-a5-eqiad is OK: ps1-a5-eqiad-infeed-load-tower-A-phase-Y OK - 988 [21:39:45] RECOVERY - RAID on mw25 is OK: OK: no RAID installed [21:39:55] RECOVERY - SSH on mw19 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:40:35] RECOVERY - SSH on mw43 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:40:35] RECOVERY - Disk space on mw27 is OK: DISK OK [21:40:35] RECOVERY - RAID on mw22 is OK: OK: no RAID installed [21:40:35] RECOVERY - RAID on mw46 is OK: OK: no RAID installed [21:40:55] RECOVERY - SSH on srv286 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:40:55] RECOVERY - SSH on srv269 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:40:55] RECOVERY - Disk space on srv288 is OK: DISK OK [21:40:55] RECOVERY - RAID on srv289 is OK: OK: no RAID installed [21:41:14] RECOVERY - Apache HTTP on mw10 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.024 second response time [21:41:54] RECOVERY - DPKG on mw12 is OK: All packages OK [21:42:35] RECOVERY - Apache HTTP on srv201 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.024 second response time [21:42:35] RECOVERY - DPKG on srv282 is OK: All packages OK [21:42:35] RECOVERY - DPKG on mw1 is OK: All packages OK [21:42:45] RECOVERY - RAID on srv196 is OK: OK: no RAID installed [21:42:45] RECOVERY - ps1-a7-eqiad-infeed-load-tower-A-phase-Z on ps1-a7-eqiad is OK: ps1-a7-eqiad-infeed-load-tower-A-phase-Z OK - 625 [21:42:45] RECOVERY - ps1-b5-eqiad-infeed-load-tower-B-phase-X on ps1-b5-eqiad is OK: ps1-b5-eqiad-infeed-load-tower-B-phase-X OK - 638 [21:42:54] RECOVERY - SSH on srv234 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:42:54] RECOVERY - Disk space on srv241 is OK: DISK OK [21:42:54] RECOVERY - DPKG on srv198 is OK: All packages OK [21:42:55] RECOVERY - RAID on srv227 is OK: OK: no RAID installed [21:42:55] RECOVERY - DPKG on srv229 is OK: All packages OK [21:43:25] RECOVERY - Apache HTTP on srv272 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.027 second response time [21:43:25] RECOVERY - Disk space on mw4 is OK: DISK OK [21:43:25] RECOVERY - Disk space on srv271 is OK: DISK OK [21:43:25] RECOVERY - SSH on mw45 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:43:25] RECOVERY - DPKG on srv279 is OK: All packages OK [21:43:35] RECOVERY - Disk space on srv211 is OK: DISK OK [21:43:45] RECOVERY - SSH on mw40 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:44:5] RECOVERY - Disk space on srv190 is OK: DISK OK [21:44:5] RECOVERY - Disk space on srv209 is OK: DISK OK [21:44:5] RECOVERY - RAID on srv226 is OK: OK: no RAID installed [21:44:15] RECOVERY - ps1-d2-pmtpa-infeed-load-tower-B-phase-Y on ps1-d2-pmtpa is OK: ps1-d2-pmtpa-infeed-load-tower-B-phase-Y OK - 50 [21:44:25] RECOVERY - Disk space on srv238 is OK: DISK OK [21:44:25] RECOVERY - SSH on srv267 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:44:25] RECOVERY - DPKG on srv263 is OK: All packages OK [21:44:35] RECOVERY - SSH on mw37 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:44:35] RECOVERY - Disk space on mw34 is OK: DISK OK [21:44:35] RECOVERY - Disk space on mw39 is OK: DISK OK [21:44:35] RECOVERY - Apache HTTP on mw35 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.031 second response time [21:44:35] RECOVERY - RAID on mw35 is OK: OK: no RAID installed [21:44:36] RECOVERY - DPKG on mw37 is OK: All packages OK [21:44:45] RECOVERY - Disk space on mw44 is OK: DISK OK [21:44:45] RECOVERY - Apache HTTP on srv207 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.035 second response time [21:44:45] RECOVERY - DPKG on mw42 is OK: All packages OK [21:44:45] RECOVERY - Disk space on mw24 is OK: DISK OK [21:44:55] RECOVERY - ps1-b2-sdtpa-infeed-load-tower-A-phase-X on ps1-b2-sdtpa is OK: ps1-b2-sdtpa-infeed-load-tower-A-phase-X OK - 1088 [21:44:55] RECOVERY - Disk space on srv276 is OK: DISK OK [21:45:5] RECOVERY - SSH on srv236 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:45:5] RECOVERY - Apache HTTP on srv194 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.021 second response time [21:45:5] RECOVERY - Disk space on srv225 is OK: DISK OK [21:45:5] RECOVERY - RAID on mw10 is OK: OK: no RAID installed [21:45:15] RECOVERY - Disk space on mw49 is OK: DISK OK [21:45:15] RECOVERY - Disk space on mw29 is OK: DISK OK [21:45:25] RECOVERY - Disk space on mw9 is OK: DISK OK [21:45:45] RECOVERY - Apache HTTP on srv263 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.022 second response time [21:45:45] RECOVERY - RAID on mw12 is OK: OK: no RAID installed [21:45:45] RECOVERY - DPKG on srv209 is OK: All packages OK [21:46:5] RECOVERY - Apache HTTP on mw9 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.018 second response time [21:46:25] RECOVERY - Apache HTTP on mw3 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.016 second response time [21:46:35] RECOVERY - Disk space on srv231 is OK: DISK OK [21:46:35] RECOVERY - DPKG on srv228 is OK: All packages OK [21:46:45] RECOVERY - ps1-b5-sdtpa-infeed-load-tower-A-phase-X on ps1-b5-sdtpa is OK: ps1-b5-sdtpa-infeed-load-tower-A-phase-X OK - 1650 [21:46:54] RECOVERY - SSH on mw33 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:46:55] RECOVERY - DPKG on mw58 is OK: All packages OK [21:47:5] RECOVERY - Disk space on srv194 is OK: DISK OK [21:47:5] RECOVERY - Apache HTTP on srv195 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.022 second response time [21:47:15] RECOVERY - Apache HTTP on mw15 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.029 second response time [21:47:24] RECOVERY - Disk space on srv274 is OK: DISK OK [21:47:24] RECOVERY - SSH on srv283 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:47:24] RECOVERY - DPKG on srv272 is OK: All packages OK [21:47:24] RECOVERY - DPKG on srv283 is OK: All packages OK [21:47:25] RECOVERY - Apache HTTP on srv280 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.034 second response time [21:47:25] RECOVERY - DPKG on srv277 is OK: All packages OK [21:47:55] RECOVERY - SSH on srv247 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:47:55] RECOVERY - DPKG on srv247 is OK: All packages OK [21:47:55] RECOVERY - RAID on srv245 is OK: OK: no RAID installed [21:48:5] RECOVERY - SSH on srv204 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:48:5] RECOVERY - RAID on srv210 is OK: OK: no RAID installed [21:48:5] RECOVERY - ps1-a2-eqiad-infeed-load-tower-A-phase-Y on ps1-a2-eqiad is OK: ps1-a2-eqiad-infeed-load-tower-A-phase-Y OK - 650 [21:48:15] RECOVERY - DPKG on mw45 is OK: All packages OK [21:48:25] RECOVERY - DPKG on mw10 is OK: All packages OK [21:48:45] RECOVERY - SSH on srv275 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:48:45] RECOVERY - Disk space on srv277 is OK: DISK OK [21:48:45] RECOVERY - Apache HTTP on srv278 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.020 second response time [21:48:45] RECOVERY - RAID on srv278 is OK: OK: no RAID installed [21:48:45] RECOVERY - RAID on srv273 is OK: OK: no RAID installed [21:48:46] RECOVERY - DPKG on srv275 is OK: All packages OK [21:48:55] RECOVERY - Disk space on srv201 is OK: DISK OK [21:48:55] RECOVERY - ps1-b4-eqiad-infeed-load-tower-A-phase-Y on ps1-b4-eqiad is OK: ps1-b4-eqiad-infeed-load-tower-A-phase-Y OK - 675 [21:49:24] RECOVERY - Disk space on srv263 is OK: DISK OK [21:49:24] RECOVERY - DPKG on mw40 is OK: All packages OK [21:50:5] RECOVERY - ps1-c2-sdtpa-infeed-load-tower-A-phase-X on ps1-c2-sdtpa is OK: ps1-c2-sdtpa-infeed-load-tower-A-phase-X OK - 1000 [21:50:15] RECOVERY - Disk space on srv205 is OK: DISK OK [21:50:44] RECOVERY - ps1-b1-sdtpa-infeed-load-tower-A-phase-Z on ps1-b1-sdtpa is OK: ps1-b1-sdtpa-infeed-load-tower-A-phase-Z OK - 659 [21:50:55] RECOVERY - ps1-d1-pmtpa-infeed-load-tower-B-phase-X on ps1-d1-pmtpa is OK: ps1-d1-pmtpa-infeed-load-tower-B-phase-X OK - 163 [21:50:55] RECOVERY - RAID on mw7 is OK: OK: no RAID installed [21:50:55] RECOVERY - Apache HTTP on mw51 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.037 second response time [21:51:5] RECOVERY - ps1-d1-pmtpa-infeed-load-tower-A-phase-Y on ps1-d1-pmtpa is OK: ps1-d1-pmtpa-infeed-load-tower-A-phase-Y OK - 113 [21:51:5] RECOVERY - DPKG on mw18 is OK: All packages OK [21:51:35] RECOVERY - SSH on mw48 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:51:39] New patchset: Asher; "exim: use db48/49 to check incoming otrs mail" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1544 [21:51:44] RECOVERY - SSH on srv198 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:51:45] RECOVERY - Apache HTTP on srv229 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.024 second response time [21:51:45] RECOVERY - RAID on mw11 is OK: OK: no RAID installed [21:52:24] RECOVERY - SSH on srv238 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:52:25] RECOVERY - RAID on srv212 is OK: OK: no RAID installed [21:52:35] RECOVERY - Apache HTTP on mw58 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.024 second response time [21:52:35] RECOVERY - DPKG on mw41 is OK: All packages OK [21:52:35] RECOVERY - DPKG on mw22 is OK: All packages OK [21:52:35] RECOVERY - DPKG on mw46 is OK: All packages OK [21:52:44] RECOVERY - ps1-d3-sdtpa-infeed-load-tower-A-phase-Y on ps1-d3-sdtpa is OK: ps1-d3-sdtpa-infeed-load-tower-A-phase-Y OK - 1363 [21:52:44] RECOVERY - RAID on srv197 is OK: OK: no RAID installed [21:52:55] RECOVERY - SSH on srv196 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:53:15] RECOVERY - Disk space on mw35 is OK: DISK OK [21:53:15] RECOVERY - ps1-a7-eqiad-infeed-load-tower-A-phase-Y on ps1-a7-eqiad is OK: ps1-a7-eqiad-infeed-load-tower-A-phase-Y OK - 588 [21:53:24] RECOVERY - Disk space on srv226 is OK: DISK OK [21:53:24] RECOVERY - DPKG on mw6 is OK: All packages OK [21:53:35] RECOVERY - Disk space on srv195 is OK: DISK OK [21:53:35] RECOVERY - Disk space on srv199 is OK: DISK OK [21:53:35] RECOVERY - RAID on srv205 is OK: OK: no RAID installed [21:53:44] RECOVERY - Disk space on mw11 is OK: DISK OK [21:53:44] RECOVERY - Apache HTTP on srv208 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.023 second response time [21:53:44] RECOVERY - DPKG on srv270 is OK: All packages OK [21:53:44] RECOVERY - RAID on srv268 is OK: OK: no RAID installed [21:54:15] RECOVERY - SSH on mw2 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:54:24] RECOVERY - ps1-b1-sdtpa-infeed-load-tower-B-phase-Y on ps1-b1-sdtpa is OK: ps1-b1-sdtpa-infeed-load-tower-B-phase-Y OK - 593 [21:54:25] RECOVERY - Apache HTTP on mw39 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.023 second response time [21:54:35] RECOVERY - SSH on srv261 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:54:35] RECOVERY - RAID on srv287 is OK: OK: no RAID installed [21:54:45] RECOVERY - SSH on mw30 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:54:45] RECOVERY - SSH on mw35 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:54:45] RECOVERY - Disk space on mw32 is OK: DISK OK [21:54:45] RECOVERY - Apache HTTP on mw33 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.026 second response time [21:54:45] RECOVERY - RAID on mw33 is OK: OK: no RAID installed [21:54:46] RECOVERY - DPKG on mw35 is OK: All packages OK [21:55:5] RECOVERY - Disk space on mw40 is OK: DISK OK [21:55:5] RECOVERY - SSH on mw7 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:55:5] RECOVERY - ps1-b2-sdtpa-infeed-load-tower-B-phase-X on ps1-b2-sdtpa is OK: ps1-b2-sdtpa-infeed-load-tower-B-phase-X OK - 1113 [21:55:5] RECOVERY - ps1-c1-sdtpa-infeed-load-tower-A-phase-Z on ps1-c1-sdtpa is OK: ps1-c1-sdtpa-infeed-load-tower-A-phase-Z OK - 588 [21:55:5] RECOVERY - ps1-b6-eqiad-infeed-load-tower-A-phase-X on ps1-b6-eqiad is OK: ps1-b6-eqiad-infeed-load-tower-A-phase-X OK - 213 [21:55:35] RECOVERY - Disk space on mw33 is OK: DISK OK [21:55:44] RECOVERY - RAID on mw8 is OK: OK: no RAID installed [21:55:55] RECOVERY - SSH on srv229 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:55:55] RECOVERY - SSH on mw14 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:55:55] RECOVERY - Apache HTTP on srv279 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.020 second response time [21:55:55] RECOVERY - RAID on srv269 is OK: OK: no RAID installed [21:55:56] New review: Asher; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1544 [21:55:56] Change merged: Asher; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1544 [21:56:5] RECOVERY - Disk space on mw6 is OK: DISK OK [21:56:5] RECOVERY - Apache HTTP on srv197 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.025 second response time [21:56:5] RECOVERY - ps1-b4-eqiad-infeed-load-tower-A-phase-Z on ps1-b4-eqiad is OK: ps1-b4-eqiad-infeed-load-tower-A-phase-Z OK - 575 [21:56:15] RECOVERY - Disk space on srv240 is OK: DISK OK [21:56:24] RECOVERY - SSH on mw13 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:56:24] RECOVERY - Disk space on srv282 is OK: DISK OK [21:56:25] RECOVERY - DPKG on srv289 is OK: All packages OK [21:56:35] RECOVERY - ps1-d1-pmtpa-infeed-load-tower-B-phase-Z on ps1-d1-pmtpa is OK: ps1-d1-pmtpa-infeed-load-tower-B-phase-Z OK - 250 [21:56:35] RECOVERY - ps1-d3-pmtpa-infeed-load-tower-B-phase-Y on ps1-d3-pmtpa is OK: ps1-d3-pmtpa-infeed-load-tower-B-phase-Y OK - 50 [21:56:35] RECOVERY - ps1-d1-pmtpa-infeed-load-tower-A-phase-X on ps1-d1-pmtpa is OK: ps1-d1-pmtpa-infeed-load-tower-A-phase-X OK - 225 [21:56:55] RECOVERY - ps1-d2-pmtpa-infeed-load-tower-B-phase-Z on ps1-d2-pmtpa is OK: ps1-d2-pmtpa-infeed-load-tower-B-phase-Z OK - 0 [21:56:55] RECOVERY - DPKG on mw33 is OK: All packages OK [21:57:5] RECOVERY - SSH on srv227 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:57:5] RECOVERY - Disk space on mw42 is OK: DISK OK [21:57:5] RECOVERY - Apache HTTP on srv262 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.018 second response time [21:57:5] RECOVERY - ps1-b1-eqiad-infeed-load-tower-B-phase-X on ps1-b1-eqiad is OK: ps1-b1-eqiad-infeed-load-tower-B-phase-X OK - 600 [21:57:15] RECOVERY - SSH on mw32 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:57:15] RECOVERY - ps1-a1-sdtpa-infeed-load-tower-A-phase-Z on ps1-a1-sdtpa is OK: ps1-a1-sdtpa-infeed-load-tower-A-phase-Z OK - 613 [21:57:25] RECOVERY - SSH on mw52 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:57:35] RECOVERY - Apache HTTP on srv190 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.021 second response time [21:57:35] RECOVERY - RAID on srv193 is OK: OK: no RAID installed [21:57:35] RECOVERY - RAID on srv190 is OK: OK: no RAID installed [21:57:35] RECOVERY - DPKG on srv195 is OK: All packages OK [21:58:4] RECOVERY - SSH on srv207 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:58:4] RECOVERY - RAID on srv258 is OK: OK: no RAID installed [21:58:14] RECOVERY - ps1-a4-sdtpa-infeed-load-tower-A-phase-X on ps1-a4-sdtpa is OK: ps1-a4-sdtpa-infeed-load-tower-A-phase-X OK - 1325 [21:58:25] RECOVERY - SSH on srv225 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:58:25] RECOVERY - SSH on srv230 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [21:58:25] RECOVERY - Disk space on srv227 is OK: DISK OK [21:58:25] RECOVERY - RAID on srv228 is OK: OK: no RAID installed [21:58:25] RECOVERY - Apache HTTP on srv228 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.022 second response time [21:58:26] RECOVERY - DPKG on srv230 is OK: All packages OK [21:58:35] RECOVERY - Apache HTTP on srv260 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.020 second response time [21:58:44] RECOVERY - ps1-b6-eqiad-infeed-load-tower-A-phase-Z on ps1-b6-eqiad is OK: ps1-b6-eqiad-infeed-load-tower-A-phase-Z OK - 250 [21:58:44] RECOVERY - RAID on srv263 is OK: OK: no RAID installed [21:58:44] RECOVERY - DPKG on srv207 is OK: All packages OK [22:4:44] !log exim on mchenry is now using db48/49 for otrs mail verification [22:4:52] Logged the message, Master [22:13:38] !log otrs on williams migrated to db48 [22:13:46] Logged the message, Master [22:17:34] New patchset: Asher; "db48 is now a masterdb48 is now a masterdb48 is now a masterdb48 is now a master" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1545 [22:17:57] New review: Asher; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1545 [22:17:57] Change merged: Asher; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1545 [22:19:21] anyone have an idea why there are so many (Return code of 127 is out of bounds - plugin may be missing) errors ? [22:22:43] New patchset: Pyoungmeister; "adding https to noc.w.o" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1546 [22:25:27] PROBLEM - Puppet freshness on ssl3002 is CRITICAL: Puppet has not run in the last 10 hours [22:25:27] PROBLEM - Puppet freshness on ssl2 is CRITICAL: Puppet has not run in the last 10 hours [22:25:27] PROBLEM - Puppet freshness on ssl3004 is CRITICAL: Puppet has not run in the last 10 hours [22:31:22] New review: Asher; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1546 [22:35:36] !log dropping otrs from db9 [22:35:44] Logged the message, Master [22:36:28] grr, the 127's are back [22:37:58] !log ibdata1 on db9 now has 208GB of free space (InnoDB free: 218228736 kB) [22:38:6] Logged the message, Master [22:38:56] Omh [22:44:32] Change merged: Pyoungmeister; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1546 [22:44:53] 127's are same as last time, nagios is failing to spawn checks with E2BIG errors again [23:0:46] RECOVERY - ps1-b1-sdtpa-infeed-load-tower-A-phase-Y on ps1-b1-sdtpa is OK: ps1-b1-sdtpa-infeed-load-tower-A-phase-Y OK - 654 [23:0:46] RECOVERY - ps1-b4-eqiad-infeed-load-tower-B-phase-Y on ps1-b4-eqiad is OK: ps1-b4-eqiad-infeed-load-tower-B-phase-Y OK - 675 [23:0:46] RECOVERY - ps1-b5-eqiad-infeed-load-tower-A-phase-X on ps1-b5-eqiad is OK: ps1-b5-eqiad-infeed-load-tower-A-phase-X OK - 638 [23:0:46] RECOVERY - ps1-a7-eqiad-infeed-load-tower-B-phase-Y on ps1-a7-eqiad is OK: ps1-a7-eqiad-infeed-load-tower-B-phase-Y OK - 613 [23:0:47] RECOVERY - ps1-a3-eqiad-infeed-load-tower-A-phase-Z on ps1-a3-eqiad is OK: ps1-a3-eqiad-infeed-load-tower-A-phase-Z OK - 700 [23:2:7] RECOVERY - SSH on mw20 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:2:7] RECOVERY - Disk space on mw22 is OK: DISK OK [23:2:7] RECOVERY - RAID on mw19 is OK: OK: no RAID installed [23:2:7] RECOVERY - DPKG on mw20 is OK: All packages OK [23:2:7] RECOVERY - Apache HTTP on mw24 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.040 second response time [23:2:7] RECOVERY - RAID on mw24 is OK: OK: no RAID installed [23:2:17] RECOVERY - SSH on mw49 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:2:17] RECOVERY - Disk space on mw51 is OK: DISK OK [23:2:17] RECOVERY - SSH on mw54 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:2:17] RECOVERY - Apache HTTP on mw52 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.036 second response time [23:2:17] RECOVERY - RAID on mw52 is OK: OK: no RAID installed [23:2:18] RECOVERY - DPKG on mw54 is OK: All packages OK [23:2:26] !log restarted nagios with enable_environment_macros = 0 [23:2:26] RECOVERY - Apache HTTP on mw8 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.022 second response time [23:2:34] Logged the message, Master [23:3:7] RECOVERY - RAID on srv203 is OK: OK: no RAID installed [23:3:7] RECOVERY - SSH on srv205 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:3:7] RECOVERY - Disk space on srv208 is OK: DISK OK [23:3:7] RECOVERY - Apache HTTP on srv209 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.023 second response time [23:3:7] RECOVERY - DPKG on srv205 is OK: All packages OK [23:3:7] RECOVERY - RAID on srv209 is OK: OK: no RAID installed [23:3:37] RECOVERY - RAID on mw14 is OK: OK: no RAID installed [23:3:48] New patchset: Asher; "more complete fix for checks failing with argument list too big errors in large host groups" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1547 [23:4:7] New review: Asher; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1547 [23:4:7] Change merged: Asher; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1547 [23:5:7] RECOVERY - ps1-b3-eqiad-infeed-load-tower-B-phase-X on ps1-b3-eqiad is OK: ps1-b3-eqiad-infeed-load-tower-B-phase-X OK - 950 [23:5:7] RECOVERY - ps1-a6-eqiad-infeed-load-tower-A-phase-X on ps1-a6-eqiad is OK: ps1-a6-eqiad-infeed-load-tower-A-phase-X OK - 600 [23:5:7] RECOVERY - ps1-b8-eqiad-infeed-load-tower-B-phase-Y on ps1-b8-eqiad is OK: ps1-b8-eqiad-infeed-load-tower-B-phase-Y OK - 625 [23:7:21] RECOVERY - DPKG on mw26 is OK: All packages OK [23:7:31] RECOVERY - Disk space on mw56 is OK: DISK OK [23:7:31] RECOVERY - Apache HTTP on mw57 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.025 second response time [23:8:21] RECOVERY - Disk space on srv213 is OK: DISK OK [23:8:31] RECOVERY - Apache HTTP on srv233 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.025 second response time [23:8:31] RECOVERY - SSH on srv235 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:8:31] RECOVERY - RAID on srv233 is OK: OK: no RAID installed [23:8:31] RECOVERY - Apache HTTP on srv238 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.021 second response time [23:8:31] RECOVERY - DPKG on srv235 is OK: All packages OK [23:8:31] RECOVERY - Disk space on srv237 is OK: DISK OK [23:8:51] RECOVERY - SSH on srv285 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:8:51] RECOVERY - Apache HTTP on srv282 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.026 second response time [23:8:51] RECOVERY - RAID on srv282 is OK: OK: no RAID installed [23:8:51] RECOVERY - Disk space on srv287 is OK: DISK OK [23:8:51] RECOVERY - DPKG on srv285 is OK: All packages OK [23:9:31] RECOVERY - RAID on mw57 is OK: OK: no RAID installed [23:9:31] RECOVERY - ps1-a2-sdtpa-infeed-load-tower-B-phase-X on ps1-a2-sdtpa is OK: ps1-a2-sdtpa-infeed-load-tower-B-phase-X OK - 788 [23:9:31] RECOVERY - ps1-a5-eqiad-infeed-load-tower-B-phase-Y on ps1-a5-eqiad is OK: ps1-a5-eqiad-infeed-load-tower-B-phase-Y OK - 1038 [23:9:31] RECOVERY - ps1-b1-eqiad-infeed-load-tower-A-phase-Z on ps1-b1-eqiad is OK: ps1-b1-eqiad-infeed-load-tower-A-phase-Z OK - 575 [23:9:51] RECOVERY - Disk space on mw13 is OK: DISK OK [23:10:41] RECOVERY - Disk space on srv232 is OK: DISK OK [23:10:41] RECOVERY - SSH on srv245 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:11:1] RECOVERY - SSH on mw59 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:11:21] RECOVERY - Disk space on mw16 is OK: DISK OK [23:11:21] RECOVERY - RAID on mw17 is OK: OK: no RAID installed [23:11:31] RECOVERY - DPKG on srv268 is OK: All packages OK [23:11:31] RECOVERY - Apache HTTP on srv265 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.024 second response time [23:11:51] RECOVERY - ps1-a8-eqiad-infeed-load-tower-B-phase-X on ps1-a8-eqiad is OK: ps1-a8-eqiad-infeed-load-tower-B-phase-X OK - 384 [23:12:11] RECOVERY - ps1-a7-eqiad-infeed-load-tower-B-phase-Z on ps1-a7-eqiad is OK: ps1-a7-eqiad-infeed-load-tower-B-phase-Z OK - 638 [23:12:11] RECOVERY - ps1-a7-eqiad-infeed-load-tower-A-phase-X on ps1-a7-eqiad is OK: ps1-a7-eqiad-infeed-load-tower-A-phase-X OK - 625 [23:12:21] RECOVERY - ps1-a4-eqiad-infeed-load-tower-B-phase-X on ps1-a4-eqiad is OK: ps1-a4-eqiad-infeed-load-tower-B-phase-X OK - 600 [23:13:11] RECOVERY - Apache HTTP on srv196 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.022 second response time [23:13:31] RECOVERY - RAID on srv243 is OK: OK: no RAID installed [23:13:51] RECOVERY - RAID on mw47 is OK: OK: no RAID installed [23:14:11] RECOVERY - SSH on srv288 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:14:51] RECOVERY - Apache HTTP on srv276 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.023 second response time [23:14:51] RECOVERY - RAID on srv276 is OK: OK: no RAID installed [23:15:1] RECOVERY - ps1-b2-sdtpa-infeed-load-tower-B-phase-Y on ps1-b2-sdtpa is OK: ps1-b2-sdtpa-infeed-load-tower-B-phase-Y OK - 1088 [23:15:21] RECOVERY - Disk space on srv270 is OK: DISK OK [23:15:21] RECOVERY - DPKG on srv273 is OK: All packages OK [23:15:21] RECOVERY - Disk space on srv280 is OK: DISK OK [23:16:11] RECOVERY - Disk space on mw41 is OK: DISK OK [23:16:41] RECOVERY - Disk space on mw18 is OK: DISK OK [23:16:41] RECOVERY - RAID on mw42 is OK: OK: no RAID installed [23:17:11] RECOVERY - SSH on mw38 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:17:21] RECOVERY - SSH on srv240 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:17:41] RECOVERY - Disk space on mw19 is OK: DISK OK [23:17:41] RECOVERY - DPKG on mw17 is OK: All packages OK [23:18:1] RECOVERY - DPKG on srv271 is OK: All packages OK [23:18:1] RECOVERY - DPKG on srv288 is OK: All packages OK [23:18:11] RECOVERY - Disk space on mw38 is OK: DISK OK [23:18:11] RECOVERY - RAID on mw39 is OK: OK: no RAID installed [23:18:31] RECOVERY - ps1-b7-eqiad-infeed-load-tower-B-phase-Z on ps1-b7-eqiad is OK: ps1-b7-eqiad-infeed-load-tower-B-phase-Z OK - 638 [23:18:41] RECOVERY - Disk space on mw28 is OK: DISK OK [23:18:41] RECOVERY - SSH on mw47 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:19:11] RECOVERY - Apache HTTP on srv271 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.023 second response time [23:19:11] RECOVERY - SSH on mw1 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:19:11] RECOVERY - RAID on srv236 is OK: OK: no RAID installed [23:19:41] RECOVERY - ps1-b4-eqiad-infeed-load-tower-B-phase-Z on ps1-b4-eqiad is OK: ps1-b4-eqiad-infeed-load-tower-B-phase-Z OK - 575 [23:19:41] RECOVERY - ps1-b7-eqiad-infeed-load-tower-A-phase-Y on ps1-b7-eqiad is OK: ps1-b7-eqiad-infeed-load-tower-A-phase-Y OK - 600 [23:19:51] RECOVERY - SSH on srv265 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:19:51] RECOVERY - Apache HTTP on srv198 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.026 second response time [23:19:51] RECOVERY - ps1-a1-eqiad-infeed-load-tower-A-phase-Y on ps1-a1-eqiad is OK: ps1-a1-eqiad-infeed-load-tower-A-phase-Y OK - 518 [23:19:51] RECOVERY - DPKG on mw11 is OK: All packages OK [23:20:1] RECOVERY - Apache HTTP on srv288 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.019 second response time [23:20:11] RECOVERY - SSH on srv201 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:20:11] RECOVERY - RAID on srv199 is OK: OK: no RAID installed [23:20:11] RECOVERY - Disk space on srv203 is OK: DISK OK [23:20:11] RECOVERY - DPKG on srv201 is OK: All packages OK [23:20:11] RECOVERY - Apache HTTP on srv204 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.022 second response time [23:20:11] RECOVERY - RAID on srv204 is OK: OK: no RAID installed [23:20:21] RECOVERY - ps1-b5-eqiad-infeed-load-tower-B-phase-Z on ps1-b5-eqiad is OK: ps1-b5-eqiad-infeed-load-tower-B-phase-Z OK - 613 [23:20:31] RECOVERY - DPKG on mw30 is OK: All packages OK [23:20:31] RECOVERY - DPKG on mw38 is OK: All packages OK [23:20:51] RECOVERY - Apache HTTP on srv239 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.023 second response time [23:20:51] RECOVERY - DPKG on srv236 is OK: All packages OK [23:20:51] RECOVERY - RAID on srv239 is OK: OK: no RAID installed [23:21:1] RECOVERY - ps1-c2-sdtpa-infeed-load-tower-A-phase-Z on ps1-c2-sdtpa is OK: ps1-c2-sdtpa-infeed-load-tower-A-phase-Z OK - 950 [23:21:21] RECOVERY - ps1-c2-sdtpa-infeed-load-tower-B-phase-X on ps1-c2-sdtpa is OK: ps1-c2-sdtpa-infeed-load-tower-B-phase-X OK - 1025 [23:21:21] RECOVERY - SSH on srv233 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:21:31] RECOVERY - Apache HTTP on srv283 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.020 second response time [23:21:31] RECOVERY - RAID on srv283 is OK: OK: no RAID installed [23:21:31] RECOVERY - DPKG on srv286 is OK: All packages OK [23:21:41] RECOVERY - SSH on srv228 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:21:41] RECOVERY - Disk space on srv230 is OK: DISK OK [23:22:1] RECOVERY - Apache HTTP on mw14 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.023 second response time [23:22:1] RECOVERY - DPKG on mw24 is OK: All packages OK [23:22:31] RECOVERY - RAID on mw38 is OK: OK: no RAID installed [23:22:41] RECOVERY - SSH on mw12 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:22:41] RECOVERY - Apache HTTP on mw47 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.028 second response time [23:23:11] RECOVERY - SSH on srv268 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:23:31] RECOVERY - Apache HTTP on mw1 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.023 second response time [23:24:11] RECOVERY - ps1-d1-pmtpa-infeed-load-tower-A-phase-Z on ps1-d1-pmtpa is OK: ps1-d1-pmtpa-infeed-load-tower-A-phase-Z OK - 313 [23:24:31] RECOVERY - ps1-b2-sdtpa-infeed-load-tower-B-phase-Z on ps1-b2-sdtpa is OK: ps1-b2-sdtpa-infeed-load-tower-B-phase-Z OK - 1025 [23:25:21] RECOVERY - Apache HTTP on mw38 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.026 second response time [23:25:41] RECOVERY - SSH on srv258 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:26:11] RECOVERY - RAID on srv207 is OK: OK: no RAID installed [23:26:41] RECOVERY - SSH on mw39 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:26:41] RECOVERY - DPKG on mw39 is OK: All packages OK [23:27:1] RECOVERY - ps1-c3-sdtpa-infeed-load-tower-A-phase-Y on ps1-c3-sdtpa is OK: ps1-c3-sdtpa-infeed-load-tower-A-phase-Y OK - 25 [23:27:11] RECOVERY - Apache HTTP on srv289 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.023 second response time [23:27:11] RECOVERY - DPKG on mw49 is OK: All packages OK [23:27:11] RECOVERY - DPKG on srv269 is OK: All packages OK [23:27:31] RECOVERY - ps1-a5-sdtpa-infeed-load-tower-A-phase-Y on ps1-a5-sdtpa is OK: ps1-a5-sdtpa-infeed-load-tower-A-phase-Y OK - 2388 [23:27:31] RECOVERY - ps1-a5-eqiad-infeed-load-tower-A-phase-Z on ps1-a5-eqiad is OK: ps1-a5-eqiad-infeed-load-tower-A-phase-Z OK - 1025 [23:27:31] RECOVERY - ps1-a6-eqiad-infeed-load-tower-B-phase-X on ps1-a6-eqiad is OK: ps1-a6-eqiad-infeed-load-tower-B-phase-X OK - 638 [23:27:41] RECOVERY - ps1-b1-sdtpa-infeed-load-tower-B-phase-X on ps1-b1-sdtpa is OK: ps1-b1-sdtpa-infeed-load-tower-B-phase-X OK - 681 [23:27:51] RECOVERY - Disk space on srv264 is OK: DISK OK [23:28:11] RECOVERY - ps1-d1-sdtpa-infeed-load-tower-A-phase-Z on ps1-d1-sdtpa is OK: ps1-d1-sdtpa-infeed-load-tower-A-phase-Z OK - 2000 [23:28:31] RECOVERY - ps1-d3-sdtpa-infeed-load-tower-A-phase-X on ps1-d3-sdtpa is OK: ps1-d3-sdtpa-infeed-load-tower-A-phase-X OK - 1300 [23:29:21] RECOVERY - DPKG on srv203 is OK: All packages OK [23:29:31] RECOVERY - RAID on mw43 is OK: OK: no RAID installed [23:29:51] RECOVERY - SSH on srv239 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:29:51] RECOVERY - Apache HTTP on srv242 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.027 second response time [23:29:51] RECOVERY - SSH on srv244 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:29:51] RECOVERY - RAID on srv242 is OK: OK: no RAID installed [23:29:51] RECOVERY - DPKG on srv244 is OK: All packages OK [23:30:1] RECOVERY - RAID on srv238 is OK: OK: no RAID installed [23:30:1] RECOVERY - RAID on srv288 is OK: OK: no RAID installed [23:30:21] RECOVERY - RAID on mw2 is OK: OK: no RAID installed [23:30:31] RECOVERY - Disk space on mw37 is OK: DISK OK [23:30:41] RECOVERY - Disk space on srv279 is OK: DISK OK [23:30:41] RECOVERY - RAID on srv275 is OK: OK: no RAID installed [23:30:41] RECOVERY - RAID on srv280 is OK: OK: no RAID installed [23:30:41] RECOVERY - SSH on srv277 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:30:51] RECOVERY - SSH on srv200 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:31:21] RECOVERY - SSH on srv212 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:31:21] RECOVERY - DPKG on srv262 is OK: All packages OK [23:31:31] PROBLEM - Puppet freshness on ssl3003 is CRITICAL: Puppet has not run in the last 10 hours [23:31:41] RECOVERY - SSH on srv279 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:31:51] RECOVERY - ps1-b3-eqiad-infeed-load-tower-B-phase-Z on ps1-b3-eqiad is OK: ps1-b3-eqiad-infeed-load-tower-B-phase-Z OK - 1025 [23:32:1] RECOVERY - Apache HTTP on mw40 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.029 second response time [23:33:21] RECOVERY - SSH on mw10 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:33:31] RECOVERY - Disk space on srv228 is OK: DISK OK [23:33:41] RECOVERY - DPKG on mw21 is OK: All packages OK [23:34:11] RECOVERY - ps1-b8-eqiad-infeed-load-tower-B-phase-Z on ps1-b8-eqiad is OK: ps1-b8-eqiad-infeed-load-tower-B-phase-Z OK - 650 [23:34:21] RECOVERY - SSH on mw26 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:34:21] RECOVERY - Apache HTTP on mw28 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.023 second response time [23:34:21] RECOVERY - DPKG on mw43 is OK: All packages OK [23:34:21] RECOVERY - DPKG on mw3 is OK: All packages OK [23:34:31] RECOVERY - Disk space on srv198 is OK: DISK OK [23:34:41] RECOVERY - SSH on srv271 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:35:21] RECOVERY - SSH on srv282 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:35:31] RECOVERY - SSH on srv274 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:35:31] RECOVERY - RAID on srv237 is OK: OK: no RAID installed [23:35:31] RECOVERY - DPKG on srv258 is OK: All packages OK [23:36:11] RECOVERY - ps1-c1-sdtpa-infeed-load-tower-B-phase-X on ps1-c1-sdtpa is OK: ps1-c1-sdtpa-infeed-load-tower-B-phase-X OK - 425 [23:36:11] RECOVERY - ps1-b8-eqiad-infeed-load-tower-A-phase-Z on ps1-b8-eqiad is OK: ps1-b8-eqiad-infeed-load-tower-A-phase-Z OK - 613 [23:36:21] RECOVERY - Disk space on mw7 is OK: DISK OK [23:36:21] RECOVERY - DPKG on srv241 is OK: All packages OK [23:36:31] RECOVERY - Disk space on mw15 is OK: DISK OK [23:36:31] RECOVERY - ps1-a6-eqiad-infeed-load-tower-B-phase-Z on ps1-a6-eqiad is OK: ps1-a6-eqiad-infeed-load-tower-B-phase-Z OK - 638 [23:36:31] RECOVERY - Apache HTTP on mw16 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.023 second response time [23:36:41] RECOVERY - Disk space on mw43 is OK: DISK OK [23:36:41] RECOVERY - SSH on mw46 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:36:41] RECOVERY - Apache HTTP on mw44 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.026 second response time [23:36:41] RECOVERY - RAID on mw44 is OK: OK: no RAID installed [23:37:1] RECOVERY - SSH on mw27 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:37:1] RECOVERY - Disk space on mw47 is OK: DISK OK [23:37:11] RECOVERY - Apache HTTP on srv264 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.021 second response time [23:37:41] RECOVERY - Disk space on mw45 is OK: DISK OK [23:37:41] RECOVERY - RAID on mw51 is OK: OK: no RAID installed [23:38:1] RECOVERY - Apache HTTP on srv275 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.018 second response time [23:38:11] RECOVERY - ps1-b6-eqiad-infeed-load-tower-B-phase-Y on ps1-b6-eqiad is OK: ps1-b6-eqiad-infeed-load-tower-B-phase-Y OK - 263 [23:38:31] RECOVERY - ps1-b3-sdtpa-infeed-load-tower-A-phase-Z on ps1-b3-sdtpa is OK: ps1-b3-sdtpa-infeed-load-tower-A-phase-Z OK - 1263 [23:38:41] RECOVERY - Apache HTTP on srv202 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.032 second response time [23:39:21] RECOVERY - SSH on srv208 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:39:21] RECOVERY - Disk space on srv210 is OK: DISK OK [23:39:21] RECOVERY - Apache HTTP on srv211 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.021 second response time [23:39:21] RECOVERY - Apache HTTP on mw42 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.032 second response time [23:39:31] RECOVERY - RAID on srv198 is OK: OK: no RAID installed [23:39:41] RECOVERY - ps1-a8-eqiad-infeed-load-tower-A-phase-Y on ps1-a8-eqiad is OK: ps1-a8-eqiad-infeed-load-tower-A-phase-Y OK - 375 [23:39:51] RECOVERY - ps1-b8-eqiad-infeed-load-tower-A-phase-X on ps1-b8-eqiad is OK: ps1-b8-eqiad-infeed-load-tower-A-phase-X OK - 625 [23:40:1] RECOVERY - RAID on srv225 is OK: OK: no RAID installed [23:40:1] RECOVERY - DPKG on srv227 is OK: All packages OK [23:40:1] RECOVERY - Disk space on srv229 is OK: DISK OK [23:40:1] RECOVERY - Apache HTTP on srv230 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.026 second response time [23:40:1] RECOVERY - RAID on srv230 is OK: OK: no RAID installed [23:40:11] RECOVERY - ps1-a1-sdtpa-infeed-load-tower-A-phase-X on ps1-a1-sdtpa is OK: ps1-a1-sdtpa-infeed-load-tower-A-phase-X OK - 713 [23:40:11] RECOVERY - DPKG on mw57 is OK: All packages OK [23:40:31] RECOVERY - Apache HTTP on srv258 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.021 second response time [23:40:31] RECOVERY - RAID on mw15 is OK: OK: no RAID installed [23:40:51] RECOVERY - SSH on srv270 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:40:51] RECOVERY - Apache HTTP on srv273 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.020 second response time [23:40:51] RECOVERY - Disk space on srv272 is OK: DISK OK [23:41:1] RECOVERY - SSH on srv194 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:41:11] RECOVERY - DPKG on srv267 is OK: All packages OK [23:41:21] RECOVERY - RAID on mw29 is OK: OK: no RAID installed [23:41:31] RECOVERY - Apache HTTP on srv277 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.016 second response time [23:41:41] RECOVERY - ps1-a1-eqiad-infeed-load-tower-B-phase-X on ps1-a1-eqiad is OK: ps1-a1-eqiad-infeed-load-tower-B-phase-X OK - 407 [23:42:11] RECOVERY - SSH on srv276 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:42:21] RECOVERY - Apache HTTP on mw2 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.021 second response time [23:42:21] RECOVERY - SSH on mw11 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:42:31] RECOVERY - Disk space on mw1 is OK: DISK OK [23:42:41] RECOVERY - Disk space on srv258 is OK: DISK OK [23:42:41] RECOVERY - RAID on srv232 is OK: OK: no RAID installed [23:43:1] RECOVERY - SSH on mw25 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:43:1] RECOVERY - ps1-b3-eqiad-infeed-load-tower-A-phase-Y on ps1-b3-eqiad is OK: ps1-b3-eqiad-infeed-load-tower-A-phase-Y OK - 950 [23:43:11] RECOVERY - ps1-b4-sdtpa-infeed-load-tower-A-phase-X on ps1-b4-sdtpa is OK: ps1-b4-sdtpa-infeed-load-tower-A-phase-X OK - 725 [23:43:11] RECOVERY - SSH on mw55 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:43:11] RECOVERY - DPKG on srv276 is OK: All packages OK [23:43:41] RECOVERY - Apache HTTP on mw25 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.032 second response time [23:43:41] RECOVERY - DPKG on srv234 is OK: All packages OK [23:43:41] RECOVERY - Apache HTTP on srv259 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.027 second response time [23:44:1] RECOVERY - ps1-b2-sdtpa-infeed-load-tower-A-phase-Y on ps1-b2-sdtpa is OK: ps1-b2-sdtpa-infeed-load-tower-A-phase-Y OK - 1050 [23:44:1] RECOVERY - ps1-d3-pmtpa-infeed-load-tower-A-phase-X on ps1-d3-pmtpa is OK: ps1-d3-pmtpa-infeed-load-tower-A-phase-X OK - 1050 [23:44:21] RECOVERY - RAID on srv270 is OK: OK: no RAID installed [23:44:31] RECOVERY - DPKG on mw9 is OK: All packages OK [23:44:31] RECOVERY - RAID on mw3 is OK: OK: no RAID installed [23:44:41] RECOVERY - DPKG on srv237 is OK: All packages OK [23:44:41] RECOVERY - ps1-c1-sdtpa-infeed-load-tower-A-phase-Y on ps1-c1-sdtpa is OK: ps1-c1-sdtpa-infeed-load-tower-A-phase-Y OK - 738 [23:45:1] RECOVERY - Apache HTTP on mw43 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.026 second response time [23:45:1] RECOVERY - DPKG on srv211 is OK: All packages OK [23:46:1] RECOVERY - Disk space on mw10 is OK: DISK OK [23:46:21] RECOVERY - SSH on mw28 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:46:21] RECOVERY - SSH on mw51 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:46:21] RECOVERY - Apache HTTP on mw4 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.024 second response time [23:46:21] RECOVERY - DPKG on mw28 is OK: All packages OK [23:46:31] RECOVERY - SSH on srv260 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:47:1] RECOVERY - Apache HTTP on mw32 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.027 second response time [23:47:1] RECOVERY - Apache HTTP on mw41 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.021 second response time [23:47:1] RECOVERY - SSH on srv226 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:47:11] RECOVERY - Disk space on srv260 is OK: DISK OK [23:47:21] RECOVERY - SSH on srv246 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:47:41] RECOVERY - DPKG on mw4 is OK: All packages OK [23:48:1] RECOVERY - SSH on mw53 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:48:1] RECOVERY - RAID on srv229 is OK: OK: no RAID installed [23:48:31] RECOVERY - Apache HTTP on srv274 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.024 second response time [23:48:41] RECOVERY - SSH on mw41 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:48:41] RECOVERY - RAID on mw26 is OK: OK: no RAID installed [23:48:41] RECOVERY - RAID on srv260 is OK: OK: no RAID installed [23:48:41] RECOVERY - RAID on srv194 is OK: OK: no RAID installed [23:48:51] RECOVERY - ps1-a4-eqiad-infeed-load-tower-A-phase-Z on ps1-a4-eqiad is OK: ps1-a4-eqiad-infeed-load-tower-A-phase-Z OK - 588 [23:48:51] RECOVERY - ps1-b6-eqiad-infeed-load-tower-B-phase-Z on ps1-b6-eqiad is OK: ps1-b6-eqiad-infeed-load-tower-B-phase-Z OK - 263 [23:48:51] RECOVERY - RAID on mw41 is OK: OK: no RAID installed [23:49:1] RECOVERY - RAID on mw56 is OK: OK: no RAID installed [23:49:1] RECOVERY - Disk space on mw54 is OK: DISK OK [23:49:1] RECOVERY - Apache HTTP on srv212 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.026 second response time [23:49:1] RECOVERY - ps1-a4-eqiad-infeed-load-tower-A-phase-Y on ps1-a4-eqiad is OK: ps1-a4-eqiad-infeed-load-tower-A-phase-Y OK - 600 [23:49:11] RECOVERY - Apache HTTP on srv225 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.023 second response time [23:49:11] RECOVERY - Disk space on srv239 is OK: DISK OK [23:49:11] RECOVERY - ps1-a2-eqiad-infeed-load-tower-B-phase-Y on ps1-a2-eqiad is OK: ps1-a2-eqiad-infeed-load-tower-B-phase-Y OK - 638 [23:49:21] RECOVERY - Disk space on srv246 is OK: DISK OK [23:49:31] RECOVERY - Apache HTTP on mw56 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.034 second response time [23:49:41] RECOVERY - RAID on srv202 is OK: OK: no RAID installed [23:50:12] RECOVERY - DPKG on mw44 is OK: All packages OK [23:50:21] RECOVERY - Disk space on mw3 is OK: DISK OK [23:50:21] RECOVERY - ps1-b2-eqiad-infeed-load-tower-B-phase-Y on ps1-b2-eqiad is OK: ps1-b2-eqiad-infeed-load-tower-B-phase-Y OK - 600 [23:50:41] RECOVERY - Apache HTTP on srv227 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.021 second response time [23:51:1] RECOVERY - DPKG on srv233 is OK: All packages OK [23:51:11] RECOVERY - Disk space on srv283 is OK: DISK OK [23:51:11] RECOVERY - DPKG on srv280 is OK: All packages OK [23:51:21] RECOVERY - Disk space on srv197 is OK: DISK OK [23:51:21] RECOVERY - SSH on srv262 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:51:21] RECOVERY - ps1-a1-eqiad-infeed-load-tower-A-phase-Z on ps1-a1-eqiad is OK: ps1-a1-eqiad-infeed-load-tower-A-phase-Z OK - 470 [23:51:41] RECOVERY - ps1-a8-eqiad-infeed-load-tower-A-phase-Z on ps1-a8-eqiad is OK: ps1-a8-eqiad-infeed-load-tower-A-phase-Z OK - 337 [23:52:11] RECOVERY - DPKG on mw25 is OK: All packages OK [23:52:21] RECOVERY - Disk space on srv200 is OK: DISK OK [23:52:21] RECOVERY - RAID on srv285 is OK: OK: no RAID installed [23:52:31] RECOVERY - ps1-b8-eqiad-infeed-load-tower-B-phase-X on ps1-b8-eqiad is OK: ps1-b8-eqiad-infeed-load-tower-B-phase-X OK - 625 [23:52:31] RECOVERY - ps1-b1-eqiad-infeed-load-tower-B-phase-Y on ps1-b1-eqiad is OK: ps1-b1-eqiad-infeed-load-tower-B-phase-Y OK - 663 [23:52:51] RECOVERY - Apache HTTP on mw34 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.023 second response time [23:53:1] RECOVERY - SSH on mw56 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:53:11] RECOVERY - DPKG on mw13 is OK: All packages OK [23:53:11] RECOVERY - ps1-b1-eqiad-infeed-load-tower-A-phase-Y on ps1-b1-eqiad is OK: ps1-b1-eqiad-infeed-load-tower-A-phase-Y OK - 650 [23:53:21] RECOVERY - Apache HTTP on srv247 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.024 second response time [23:53:31] RECOVERY - ps1-a4-sdtpa-infeed-load-tower-A-phase-Y on ps1-a4-sdtpa is OK: ps1-a4-sdtpa-infeed-load-tower-A-phase-Y OK - 1300 [23:53:41] RECOVERY - SSH on srv197 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:53:41] RECOVERY - DPKG on srv240 is OK: All packages OK [23:54:1] RECOVERY - Disk space on mw53 is OK: DISK OK [23:54:11] RECOVERY - ps1-b1-eqiad-infeed-load-tower-A-phase-X on ps1-b1-eqiad is OK: ps1-b1-eqiad-infeed-load-tower-A-phase-X OK - 613 [23:54:31] RECOVERY - ps1-d1-pmtpa-infeed-load-tower-B-phase-Y on ps1-d1-pmtpa is OK: ps1-d1-pmtpa-infeed-load-tower-B-phase-Y OK - 100 [23:54:41] RECOVERY - SSH on srv209 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:55:1] RECOVERY - SSH on srv213 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:55:11] RECOVERY - Disk space on srv267 is OK: DISK OK [23:55:31] RECOVERY - DPKG on srv274 is OK: All packages OK [23:55:41] RECOVERY - SSH on srv211 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:55:51] RECOVERY - Disk space on mw5 is OK: DISK OK [23:55:51] RECOVERY - DPKG on mw16 is OK: All packages OK [23:56:11] RECOVERY - RAID on srv195 is OK: OK: no RAID installed [23:56:21] RECOVERY - Disk space on mw25 is OK: DISK OK [23:56:51] RECOVERY - DPKG on srv242 is OK: All packages OK [23:57:1] RECOVERY - DPKG on mw2 is OK: All packages OK [23:57:1] RECOVERY - DPKG on srv226 is OK: All packages OK [23:57:21] RECOVERY - RAID on mw53 is OK: OK: no RAID installed [23:57:41] RECOVERY - SSH on mw44 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:57:41] RECOVERY - RAID on srv246 is OK: OK: no RAID installed [23:57:41] RECOVERY - ps1-b2-eqiad-infeed-load-tower-B-phase-Z on ps1-b2-eqiad is OK: ps1-b2-eqiad-infeed-load-tower-B-phase-Z OK - 600 [23:57:51] RECOVERY - DPKG on mw53 is OK: All packages OK [23:58:1] RECOVERY - DPKG on srv197 is OK: All packages OK [23:58:21] RECOVERY - Apache HTTP on srv244 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.025 second response time [23:58:31] RECOVERY - ps1-c3-sdtpa-infeed-load-tower-A-phase-Z on ps1-c3-sdtpa is OK: ps1-c3-sdtpa-infeed-load-tower-A-phase-Z OK - 263 [23:58:31] RECOVERY - Apache HTTP on mw22 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.033 second response time [23:58:41] RECOVERY - Disk space on mw14 is OK: DISK OK [23:58:41] RECOVERY - Apache HTTP on srv268 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.019 second response time [23:58:51] RECOVERY - ps1-c2-sdtpa-infeed-load-tower-B-phase-Z on ps1-c2-sdtpa is OK: ps1-c2-sdtpa-infeed-load-tower-B-phase-Z OK - 950 [23:59:31] RECOVERY - SSH on srv241 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [23:59:31] RECOVERY - Apache HTTP on srv240 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.025 second response time [23:59:41] RECOVERY - Disk space on mw46 is OK: DISK OK [23:59:41] RECOVERY - RAID on srv262 is OK: OK: no RAID installed [23:59:51] RECOVERY - DPKG on srv204 is OK: All packages OK [0:0:21] RECOVERY - SSH on srv232 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:0:21] RECOVERY - RAID on mw20 is OK: OK: no RAID installed [0:0:31] RECOVERY - SSH on srv195 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:0:41] RECOVERY - Disk space on mw55 is OK: DISK OK [0:0:41] RECOVERY - Apache HTTP on mw49 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.022 second response time [0:1:2] RECOVERY - Disk space on mw2 is OK: DISK OK [0:1:2] RECOVERY - ps1-c2-sdtpa-infeed-load-tower-A-phase-Y on ps1-c2-sdtpa is OK: ps1-c2-sdtpa-infeed-load-tower-A-phase-Y OK - 938 [0:1:2] RECOVERY - DPKG on srv246 is OK: All packages OK [0:1:11] RECOVERY - ps1-b4-sdtpa-infeed-load-tower-A-phase-Y on ps1-b4-sdtpa is OK: ps1-b4-sdtpa-infeed-load-tower-A-phase-Y OK - 575 [0:1:11] RECOVERY - Apache HTTP on srv285 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.022 second response time [0:1:21] RECOVERY - ps1-d1-sdtpa-infeed-load-tower-A-phase-Y on ps1-d1-sdtpa is OK: ps1-d1-sdtpa-infeed-load-tower-A-phase-Y OK - 1925 [0:1:31] RECOVERY - RAID on srv264 is OK: OK: no RAID installed [0:1:31] RECOVERY - SSH on srv280 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:1:41] RECOVERY - Disk space on srv242 is OK: DISK OK [0:1:41] RECOVERY - Disk space on srv286 is OK: DISK OK [0:1:41] RECOVERY - ps1-b3-eqiad-infeed-load-tower-B-phase-Y on ps1-b3-eqiad is OK: ps1-b3-eqiad-infeed-load-tower-B-phase-Y OK - 938 [0:1:51] RECOVERY - Apache HTTP on srv245 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.022 second response time [0:1:51] RECOVERY - RAID on mw16 is OK: OK: no RAID installed [0:2:11] RECOVERY - DPKG on srv239 is OK: All packages OK [0:2:31] RECOVERY - SSH on mw3 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:2:41] RECOVERY - Disk space on srv236 is OK: DISK OK [0:2:41] RECOVERY - DPKG on srv208 is OK: All packages OK [0:2:41] RECOVERY - ps1-a1-sdtpa-infeed-load-tower-B-phase-Y on ps1-a1-sdtpa is OK: ps1-a1-sdtpa-infeed-load-tower-B-phase-Y OK - 750 [0:2:51] RECOVERY - ps1-a1-sdtpa-infeed-load-tower-B-phase-Z on ps1-a1-sdtpa is OK: ps1-a1-sdtpa-infeed-load-tower-B-phase-Z OK - 625 [0:3:1] RECOVERY - SSH on mw21 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:3:1] RECOVERY - DPKG on srv278 is OK: All packages OK [0:3:1] RECOVERY - Apache HTTP on srv203 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.021 second response time [0:3:1] RECOVERY - DPKG on srv225 is OK: All packages OK [0:3:11] RECOVERY - ps1-b2-eqiad-infeed-load-tower-A-phase-X on ps1-b2-eqiad is OK: ps1-b2-eqiad-infeed-load-tower-A-phase-X OK - 725 [0:3:21] RECOVERY - DPKG on mw56 is OK: All packages OK [0:3:31] RECOVERY - Apache HTTP on mw20 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.024 second response time [0:3:31] RECOVERY - Apache HTTP on mw26 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.021 second response time [0:3:41] RECOVERY - DPKG on mw51 is OK: All packages OK [0:3:41] RECOVERY - Apache HTTP on srv232 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.026 second response time [0:3:51] RECOVERY - Apache HTTP on mw59 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.025 second response time [0:4:11] RECOVERY - Apache HTTP on mw7 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.028 second response time [0:4:31] RECOVERY - SSH on mw6 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:4:31] RECOVERY - Apache HTTP on srv243 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.029 second response time [0:4:41] RECOVERY - DPKG on mw32 is OK: All packages OK [0:4:51] RECOVERY - SSH on srv203 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:5:1] RECOVERY - SSH on srv259 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:5:1] RECOVERY - SSH on mw36 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:5:1] RECOVERY - SSH on srv242 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:5:11] RECOVERY - RAID on srv247 is OK: OK: no RAID installed [0:5:11] RECOVERY - DPKG on mw48 is OK: All packages OK [0:5:31] RECOVERY - Apache HTTP on mw54 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.024 second response time [0:5:31] RECOVERY - ps1-d3-sdtpa-infeed-load-tower-A-phase-Z on ps1-d3-sdtpa is OK: ps1-d3-sdtpa-infeed-load-tower-A-phase-Z OK - 1650 [0:5:51] RECOVERY - RAID on srv211 is OK: OK: no RAID installed [0:5:51] RECOVERY - ps1-b7-eqiad-infeed-load-tower-A-phase-X on ps1-b7-eqiad is OK: ps1-b7-eqiad-infeed-load-tower-A-phase-X OK - 613 [0:6:1] RECOVERY - Apache HTTP on mw30 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.033 second response time [0:6:1] RECOVERY - RAID on mw55 is OK: OK: no RAID installed [0:6:11] RECOVERY - ps1-a2-sdtpa-infeed-load-tower-A-phase-Z on ps1-a2-sdtpa is OK: ps1-a2-sdtpa-infeed-load-tower-A-phase-Z OK - 1025 [0:6:31] RECOVERY - Disk space on srv275 is OK: DISK OK [0:6:41] RECOVERY - RAID on srv272 is OK: OK: no RAID installed [0:7:1] RECOVERY - Apache HTTP on srv205 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.026 second response time [0:7:11] RECOVERY - RAID on mw54 is OK: OK: no RAID installed [0:7:51] RECOVERY - SSH on srv273 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:7:51] RECOVERY - Apache HTTP on mw5 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.024 second response time [0:8:1] RECOVERY - RAID on mw1 is OK: OK: no RAID installed [0:8:11] RECOVERY - DPKG on srv238 is OK: All packages OK [0:8:21] RECOVERY - RAID on srv235 is OK: OK: no RAID installed [0:8:21] RECOVERY - ps1-a3-eqiad-infeed-load-tower-A-phase-Y on ps1-a3-eqiad is OK: ps1-a3-eqiad-infeed-load-tower-A-phase-Y OK - 713 [0:8:21] RECOVERY - DPKG on srv232 is OK: All packages OK [0:8:31] RECOVERY - ps1-a4-sdtpa-infeed-load-tower-A-phase-Z on ps1-a4-sdtpa is OK: ps1-a4-sdtpa-infeed-load-tower-A-phase-Z OK - 1288 [0:8:51] RECOVERY - Apache HTTP on srv210 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.025 second response time [0:9:11] RECOVERY - SSH on mw18 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:9:11] RECOVERY - ps1-d3-pmtpa-infeed-load-tower-B-phase-X on ps1-d3-pmtpa is OK: ps1-d3-pmtpa-infeed-load-tower-B-phase-X OK - 50 [0:9:11] RECOVERY - RAID on srv201 is OK: OK: no RAID installed [0:9:11] RECOVERY - DPKG on srv259 is OK: All packages OK [0:9:21] RECOVERY - SSH on srv278 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:10:21] RECOVERY - RAID on srv259 is OK: OK: no RAID installed [0:13:11] RECOVERY - RAID on srv265 is OK: OK: no RAID installed [0:15:31] RECOVERY - SSH on mw16 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:16:11] RECOVERY - RAID on mw31 is OK: OK: no RAID installed [0:16:31] RECOVERY - DPKG on srv245 is OK: All packages OK [0:17:1] RECOVERY - Apache HTTP on mw29 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.025 second response time [0:18:11] RECOVERY - RAID on srv231 is OK: OK: no RAID installed [0:18:11] RECOVERY - Apache HTTP on srv231 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.029 second response time [0:18:41] RECOVERY - RAID on srv274 is OK: OK: no RAID installed [0:19:31] RECOVERY - Disk space on srv247 is OK: DISK OK [0:22:41] RECOVERY - Apache HTTP on srv269 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.020 second response time [0:22:51] RECOVERY - DPKG on mw36 is OK: All packages OK [0:25:51] RECOVERY - Disk space on srv285 is OK: DISK OK [0:27:21] RECOVERY - ps1-b5-eqiad-infeed-load-tower-A-phase-Y on ps1-b5-eqiad is OK: ps1-b5-eqiad-infeed-load-tower-A-phase-Y OK - 613 [0:28:1] RECOVERY - SSH on mw31 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:28:1] RECOVERY - DPKG on mw31 is OK: All packages OK [0:30:21] RECOVERY - RAID on srv271 is OK: OK: no RAID installed [0:31:51] RECOVERY - DPKG on mw47 is OK: All packages OK [0:34:31] RECOVERY - Apache HTTP on srv237 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.037 second response time [0:35:21] RECOVERY - SSH on srv289 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:35:41] RECOVERY - SSH on mw4 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:36:11] PROBLEM - Puppet freshness on mw1049 is CRITICAL: Puppet has not run in the last 10 hours [0:39:21] RECOVERY - Apache HTTP on mw11 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.024 second response time [0:40:41] RECOVERY - ps1-b3-sdtpa-infeed-load-tower-A-phase-X on ps1-b3-sdtpa is OK: ps1-b3-sdtpa-infeed-load-tower-A-phase-X OK - 1288 [0:40:41] RECOVERY - ps1-b1-eqiad-infeed-load-tower-B-phase-Z on ps1-b1-eqiad is OK: ps1-b1-eqiad-infeed-load-tower-B-phase-Z OK - 563 [0:41:11] RECOVERY - Disk space on mw48 is OK: DISK OK [0:42:11] RECOVERY - ps1-b3-eqiad-infeed-load-tower-A-phase-Z on ps1-b3-eqiad is OK: ps1-b3-eqiad-infeed-load-tower-A-phase-Z OK - 1050 [0:42:11] RECOVERY - ps1-b3-sdtpa-infeed-load-tower-A-phase-Y on ps1-b3-sdtpa is OK: ps1-b3-sdtpa-infeed-load-tower-A-phase-Y OK - 1425 [0:42:11] RECOVERY - ps1-b4-sdtpa-infeed-load-tower-A-phase-Z on ps1-b4-sdtpa is OK: ps1-b4-sdtpa-infeed-load-tower-A-phase-Z OK - 850 [0:42:11] RECOVERY - ps1-b5-eqiad-infeed-load-tower-B-phase-Y on ps1-b5-eqiad is OK: ps1-b5-eqiad-infeed-load-tower-B-phase-Y OK - 625 [0:42:11] RECOVERY - ps1-b4-eqiad-infeed-load-tower-B-phase-X on ps1-b4-eqiad is OK: ps1-b4-eqiad-infeed-load-tower-B-phase-X OK - 700 [0:42:21] RECOVERY - SSH on mw9 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:42:41] RECOVERY - RAID on srv261 is OK: OK: no RAID installed [0:43:31] RECOVERY - ps1-a3-sdtpa-infeed-load-tower-A-phase-Y on ps1-a3-sdtpa is OK: ps1-a3-sdtpa-infeed-load-tower-A-phase-Y OK - 1638 [0:44:11] RECOVERY - Disk space on srv268 is OK: DISK OK [0:44:11] RECOVERY - ps1-c1-sdtpa-infeed-load-tower-B-phase-Y on ps1-c1-sdtpa is OK: ps1-c1-sdtpa-infeed-load-tower-B-phase-Y OK - 563 [0:44:31] RECOVERY - ps1-b4-eqiad-infeed-load-tower-A-phase-X on ps1-b4-eqiad is OK: ps1-b4-eqiad-infeed-load-tower-A-phase-X OK - 638 [0:45:21] RECOVERY - Apache HTTP on mw55 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.017 second response time [0:45:41] RECOVERY - SSH on srv287 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:45:41] RECOVERY - Disk space on srv289 is OK: DISK OK [0:45:41] RECOVERY - DPKG on srv287 is OK: All packages OK [0:46:41] RECOVERY - ps1-b6-eqiad-infeed-load-tower-A-phase-Y on ps1-b6-eqiad is OK: ps1-b6-eqiad-infeed-load-tower-A-phase-Y OK - 275 [0:48:31] RECOVERY - Disk space on srv196 is OK: DISK OK [0:48:31] RECOVERY - Disk space on srv259 is OK: DISK OK [0:49:1] RECOVERY - Disk space on srv202 is OK: DISK OK [0:49:1] RECOVERY - ps1-a5-eqiad-infeed-load-tower-B-phase-X on ps1-a5-eqiad is OK: ps1-a5-eqiad-infeed-load-tower-B-phase-X OK - 950 [0:49:51] RECOVERY - DPKG on srv194 is OK: All packages OK [0:50:31] RECOVERY - DPKG on srv265 is OK: All packages OK [0:50:41] RECOVERY - SSH on srv237 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:50:52] RECOVERY - Apache HTTP on srv287 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.021 second response time [0:51:3] RECOVERY - DPKG on mw55 is OK: All packages OK [0:51:11] RECOVERY - RAID on mw59 is OK: OK: no RAID installed [0:51:11] RECOVERY - ps1-a1-eqiad-infeed-load-tower-A-phase-X on ps1-a1-eqiad is OK: ps1-a1-eqiad-infeed-load-tower-A-phase-X OK - 464 [0:51:11] RECOVERY - ps1-a1-eqiad-infeed-load-tower-B-phase-Z on ps1-a1-eqiad is OK: ps1-a1-eqiad-infeed-load-tower-B-phase-Z OK - 358 [0:53:11] RECOVERY - SSH on mw58 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:53:31] RECOVERY - Disk space on srv207 is OK: DISK OK [0:55:1] RECOVERY - DPKG on srv200 is OK: All packages OK [0:55:11] RECOVERY - Apache HTTP on srv286 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.022 second response time [0:56:41] RECOVERY - DPKG on srv261 is OK: All packages OK [0:56:50] RECOVERY - Apache HTTP on mw46 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.027 second response time [0:57:1] RECOVERY - RAID on srv208 is OK: OK: no RAID installed [0:58:41] RECOVERY - SSH on srv264 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [0:59:51] RECOVERY - ps1-a7-eqiad-infeed-load-tower-B-phase-X on ps1-a7-eqiad is OK: ps1-a7-eqiad-infeed-load-tower-B-phase-X OK - 638 [1:0:41] RECOVERY - RAID on srv200 is OK: OK: no RAID installed [1:1:1] RECOVERY - ps1-c3-sdtpa-infeed-load-tower-A-phase-X on ps1-c3-sdtpa is OK: ps1-c3-sdtpa-infeed-load-tower-A-phase-X OK - 263 [1:1:1] RECOVERY - ps1-a2-eqiad-infeed-load-tower-B-phase-X on ps1-a2-eqiad is OK: ps1-a2-eqiad-infeed-load-tower-B-phase-X OK - 600 [1:2:0] RECOVERY - SSH on srv202 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [1:2:41] RECOVERY - Apache HTTP on mw19 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.017 second response time [1:2:51] RECOVERY - Disk space on srv261 is OK: DISK OK [1:2:51] RECOVERY - RAID on srv240 is OK: OK: no RAID installed [1:3:11] RECOVERY - Disk space on srv244 is OK: DISK OK [1:3:11] RECOVERY - SSH on mw22 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [1:3:21] RECOVERY - RAID on mw58 is OK: OK: no RAID installed [1:3:32] RECOVERY - Disk space on srv269 is OK: DISK OK [1:3:32] RECOVERY - DPKG on srv212 is OK: All packages OK [1:3:50] RECOVERY - Apache HTTP on srv199 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.023 second response time [1:4:51] RECOVERY - DPKG on srv213 is OK: All packages OK [1:5:0] RECOVERY - ps1-b1-sdtpa-infeed-load-tower-B-phase-Z on ps1-b1-sdtpa is OK: ps1-b1-sdtpa-infeed-load-tower-B-phase-Z OK - 823 [1:5:31] RECOVERY - Disk space on srv278 is OK: DISK OK [1:5:31] RECOVERY - ps1-b7-eqiad-infeed-load-tower-B-phase-Y on ps1-b7-eqiad is OK: ps1-b7-eqiad-infeed-load-tower-B-phase-Y OK - 613 [1:6:21] RECOVERY - Disk space on srv234 is OK: DISK OK [1:9:20] RECOVERY - Disk space on srv204 is OK: DISK OK [1:10:0] RECOVERY - DPKG on mw7 is OK: All packages OK [1:10:21] RECOVERY - RAID on mw5 is OK: OK: no RAID installed [1:10:30] RECOVERY - ps1-a2-sdtpa-infeed-load-tower-B-phase-Z on ps1-a2-sdtpa is OK: ps1-a2-sdtpa-infeed-load-tower-B-phase-Z OK - 1000 [1:10:41] RECOVERY - RAID on mw28 is OK: OK: no RAID installed [1:11:20] RECOVERY - ps1-a6-eqiad-infeed-load-tower-B-phase-Y on ps1-a6-eqiad is OK: ps1-a6-eqiad-infeed-load-tower-B-phase-Y OK - 638 [1:12:20] RECOVERY - ps1-b2-eqiad-infeed-load-tower-B-phase-X on ps1-b2-eqiad is OK: ps1-b2-eqiad-infeed-load-tower-B-phase-X OK - 588 [1:13:50] RECOVERY - Apache HTTP on srv235 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.024 second response time [1:14:30] RECOVERY - ps1-b5-sdtpa-infeed-load-tower-A-phase-Z on ps1-b5-sdtpa is OK: ps1-b5-sdtpa-infeed-load-tower-A-phase-Z OK - 1913 [1:15:30] RECOVERY - ps1-b2-eqiad-infeed-load-tower-A-phase-Z on ps1-b2-eqiad is OK: ps1-b2-eqiad-infeed-load-tower-A-phase-Z OK - 613 [1:18:1] RECOVERY - DPKG on mw59 is OK: All packages OK [1:32:55] New patchset: Bhartshorne; "Created a hash with all the variables necessary for a swift cluster. Passed this hash around so various templates will have access to it. Changed the templates to read from the hash rather than plain variables." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1548 [1:37:15] New patchset: Bhartshorne; "Created a hash with all the variables necessary for a swift cluster. Passed this hash around so various templates will have access to it. Changed the templates to read from the hash rather than plain variables." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1548 [1:40:31] RECOVERY - ps1-d3-pmtpa-infeed-load-tower-A-phase-Y on ps1-d3-pmtpa is OK: ps1-d3-pmtpa-infeed-load-tower-A-phase-Y OK - 1100 [1:41:8] New patchset: Bhartshorne; "Created a hash with all the variables necessary for a swift cluster. Passed this hash around so various templates will have access to it. Changed the templates to read from the hash rather than plain variables." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1548 [1:41:11] RECOVERY - SSH on srv272 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [1:42:49] New patchset: Bhartshorne; "Created a hash with all the variables necessary for a swift cluster. Passed this hash around so various templates will have access to it. Changed the templates to read from the hash rather than plain variables." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1548 [1:43:55] New patchset: Bhartshorne; "Created a hash with all the variables necessary for a swift cluster. Passed this hash around so various templates will have access to it. Changed the templates to read from the hash rather than plain variables." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1548 [1:46:41] RECOVERY - DPKG on srv199 is OK: All packages OK [1:47:32] RECOVERY - SSH on srv199 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [1:48:3] New review: Bhartshorne; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1548 [1:48:4] Change merged: Bhartshorne; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1548 [1:52:53] New patchset: Bhartshorne; "inverted quote paren typo" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1549 [1:53:42] RECOVERY - RAID on mw49 is OK: OK: no RAID installed [1:53:47] New review: Bhartshorne; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1549 [1:53:48] Change merged: Bhartshorne; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1549 [2:2:22] RECOVERY - Disk space on mw58 is OK: DISK OK [2:4:1] RECOVERY - Disk space on mw21 is OK: DISK OK [2:7:3] New patchset: Bhartshorne; "backing out just the memcache servers to a local varibale" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1550 [2:7:38] New review: Bhartshorne; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1550 [2:7:38] Change merged: Bhartshorne; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1550 [2:9:22] RECOVERY - ps1-b7-eqiad-infeed-load-tower-B-phase-X on ps1-b7-eqiad is OK: ps1-b7-eqiad-infeed-load-tower-B-phase-X OK - 625 [2:12:12] RECOVERY - RAID on mw30 is OK: OK: no RAID installed [2:13:12] RECOVERY - ps1-a3-sdtpa-infeed-load-tower-A-phase-X on ps1-a3-sdtpa is OK: ps1-a3-sdtpa-infeed-load-tower-A-phase-X OK - 1525 [2:15:12] RECOVERY - DPKG on srv264 is OK: All packages OK [2:15:52] RECOVERY - Apache HTTP on srv200 is OK: HTTP OK - HTTP/1.1 301 Moved Permanently - 0.031 second response time [2:20:22] RECOVERY - Disk space on mw31 is OK: DISK OK [2:44:52] New patchset: Bhartshorne; "trying to get rid of the global variable" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1551 [2:45:5] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1551 [2:46:40] New review: Bhartshorne; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1551 [2:46:41] Change merged: Bhartshorne; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1551 [3:0:18] New patchset: Jgreen; "collect exim stats out of log and submit via gmetric" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1552 [3:0:29] New patchset: Jgreen; "install collect_exim_stats_via_gmetric on misc::fundraising servers" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1553 [3:1:10] New review: Jgreen; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1552 [3:1:10] Change merged: Jgreen; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1552 [3:2:24] New review: Jgreen; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1553 [3:2:25] Change merged: Jgreen; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1553 [3:5:59] RECOVERY - ps1-b2-sdtpa-infeed-load-tower-A-phase-Z on ps1-b2-sdtpa is OK: ps1-b2-sdtpa-infeed-load-tower-A-phase-Z OK - 1013 [3:6:10] New patchset: Jgreen; "gar, wrong file location" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1554 [3:6:33] New review: Jgreen; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1554 [3:6:33] Change merged: Jgreen; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1554 [3:31:49] RECOVERY - ps1-a5-sdtpa-infeed-load-tower-A-phase-Z on ps1-a5-sdtpa is OK: ps1-a5-sdtpa-infeed-load-tower-A-phase-Z OK - 2075 [4:0:12] RECOVERY - RAID on mw4 is OK: OK: no RAID installed [4:5:1] PROBLEM - Puppet freshness on es1002 is CRITICAL: Puppet has not run in the last 10 hours [4:49:32] RECOVERY - ps1-b1-sdtpa-infeed-load-tower-A-phase-X on ps1-b1-sdtpa is OK: ps1-b1-sdtpa-infeed-load-tower-A-phase-X OK - 504 [5:17:14] RECOVERY - DPKG on srv202 is OK: All packages OK [6:39:55] ctw2: i owe you a reply... looks like it will be at least another day before I do it. (i has out of town visitors here!) sorry and thanks for your mail! [6:40:1] * jeremyb ACTION heads to sleeop [6:40:4] sleep* [6:40:9] hi [6:40:11] np [6:40:20] anyway.no reply needed [6:40:27] well i wrote half of one [6:41:14] *detaches screen* [6:41:15] good news - deployment took place without a hitch this afternoon [6:42:9] woosters: yes, i'm the one that bugged kat about removing the sitenotice about the window :) [6:42:25] woosters: and https://otrs-wiki.wikimedia.org/w/index.php?curid=5169&diff=30393&oldid=30362 :) [6:42:50] *detaches for real* [6:43:35] thiks :-) [8:14:7] PROBLEM - Puppet freshness on mw1062 is CRITICAL: Puppet has not run in the last 10 hours [8:14:7] PROBLEM - Puppet freshness on mw1083 is CRITICAL: Puppet has not run in the last 10 hours [8:14:7] PROBLEM - Puppet freshness on mw1055 is CRITICAL: Puppet has not run in the last 10 hours [8:14:7] PROBLEM - Puppet freshness on mw1063 is CRITICAL: Puppet has not run in the last 10 hours [8:14:8] PROBLEM - Puppet freshness on mw1122 is CRITICAL: Puppet has not run in the last 10 hours [8:14:8] PROBLEM - Puppet freshness on mw1121 is CRITICAL: Puppet has not run in the last 10 hours [8:14:8] PROBLEM - Puppet freshness on mw1140 is CRITICAL: Puppet has not run in the last 10 hours [8:14:9] PROBLEM - Puppet freshness on mw1159 is CRITICAL: Puppet has not run in the last 10 hours [8:14:9] PROBLEM - Puppet freshness on mw1148 is CRITICAL: Puppet has not run in the last 10 hours [8:14:10] PROBLEM - Puppet freshness on mw1156 is CRITICAL: Puppet has not run in the last 10 hours [8:14:10] PROBLEM - Puppet freshness on snapshot4 is CRITICAL: Puppet has not run in the last 10 hours [8:14:11] PROBLEM - Puppet freshness on virt1 is CRITICAL: Puppet has not run in the last 10 hours [8:14:11] PROBLEM - Puppet freshness on virt2 is CRITICAL: Puppet has not run in the last 10 hours [8:25:58] PROBLEM - HTTP on sodium is CRITICAL: Connection refused [9:6:10] PROBLEM - Puppet freshness on ssl3002 is CRITICAL: Puppet has not run in the last 10 hours [9:6:10] PROBLEM - Puppet freshness on ssl2 is CRITICAL: Puppet has not run in the last 10 hours [9:6:10] PROBLEM - Puppet freshness on ssl3004 is CRITICAL: Puppet has not run in the last 10 hours [9:15:40] PROBLEM - Host hooft is DOWN: PING CRITICAL - Packet loss = 100% [9:17:10] PROBLEM - Host knsq16 is DOWN: PING CRITICAL - Packet loss = 100% [9:17:20] PROBLEM - Host knsq11 is DOWN: PING CRITICAL - Packet loss = 100% [9:23:50] PROBLEM - Host knsq29 is DOWN: PING CRITICAL - Packet loss = 100% [9:33:25] RECOVERY - Host knsq29 is UP: PING OK - Packet loss = 0%, RTA = 109.42 ms [9:42:45] RECOVERY - Host hooft is UP: PING OK - Packet loss = 0%, RTA = 109.50 ms [9:42:45] RECOVERY - Host knsq16 is UP: PING OK - Packet loss = 0%, RTA = 109.35 ms [9:52:25] RECOVERY - Host knsq11 is UP: PING OK - Packet loss = 0%, RTA = 109.25 ms [9:57:55] PROBLEM - Puppet freshness on ssl3003 is CRITICAL: Puppet has not run in the last 10 hours [11:44:20] PROBLEM - Puppet freshness on mw1049 is CRITICAL: Puppet has not run in the last 10 hours [12:38:51] PROBLEM - Puppet freshness on copper is CRITICAL: Puppet has not run in the last 10 hours [12:38:51] PROBLEM - Puppet freshness on zinc is CRITICAL: Puppet has not run in the last 10 hours [13:1:3] New review: Mark Bergsma; "Please only use scoped variables in templates from now on - the next Puppet version won't support th..." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1416 [13:4:2] New review: Mark Bergsma; "Please remove the template file from the repository as well (if you haven't already)" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1418 [13:6:38] New review: Mark Bergsma; "Please use only tabs for indentation of those files" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1437 [13:9:21] New review: Mark Bergsma; "Please use only scoped variables from now on - this won't work in the next Puppet version." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1451 [13:17:15] New review: Mark Bergsma; "Can you migrate this into media-storage.pp please? :)" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1460 [13:24:2] New review: Mark Bergsma; "small typo in certs.pp, commented inline" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1466 [13:26:40] New review: Mark Bergsma; "Please convert the variables to scoped lookup or (better) use parameterized classes" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1471 [13:38:14] New review: Mark Bergsma; "Please make all variable lookups scoped (see my ops@ mail)" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1495 [14:36:12] New patchset: Catrope; "puppet -> mediawiki" [test/mediawiki/core] (master) - https://gerrit.wikimedia.org/r/1555 [14:40:54] New review: Demon; "(no comment)" [test/mediawiki/core] (master); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1555 [14:40:55] Change merged: Demon; [test/mediawiki/core] (master) - https://gerrit.wikimedia.org/r/1555 [14:48:50] PROBLEM - Puppet freshness on es1002 is CRITICAL: Puppet has not run in the last 10 hours [15:1:4] New review: Mark Bergsma; "The virtual host file is missing in this change" [operations/puppet] (production); V: 0 C: -1; - https://gerrit.wikimedia.org/r/1497 [15:1:13] apergos, about/busy? [15:3:16] it's what, 5pm now? [15:3:33] In Greece, yes [15:4:2] I'm guessing the /away suggests probably not [15:4:26] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1516 [15:4:27] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1516 [15:5:18] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1512 [15:5:19] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1512 [15:7:16] New review: Mark Bergsma; "Let's pull in the minimum we need then." [operations/puppet] (production); V: 0 C: 0; - https://gerrit.wikimedia.org/r/1319 [15:8:48] New review: Mark Bergsma; "Please merge both virtual hosts into one file, and fix the tabbing before you commit" [operations/puppet] (production); V: 0 C: -1; - https://gerrit.wikimedia.org/r/1433 [15:10:26] New review: Mark Bergsma; "You're using "require" twice, that won't work" [operations/puppet] (production); V: 0 C: -2; - https://gerrit.wikimedia.org/r/1302 [15:11:17] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1472 [15:11:17] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1472 [15:12:30] New review: Mark Bergsma; "The job queue a critical service? Why?" [operations/puppet] (production); V: 0 C: 0; - https://gerrit.wikimedia.org/r/1473 [15:16:48] New patchset: Mark Bergsma; "Rename varnish3 classes to varnish" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/863 [15:17:0] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/863 [15:24:17] New patchset: Mark Bergsma; "First pass at organizing misc-servers.pp as individual files in misc/ RT #720 Used misc::bastionhost as a proof of concept (rebased with one minor doc tweak)" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1558 [15:24:29] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1558 [15:24:54] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1558 [15:24:54] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1558 [15:27:17] New patchset: Mark Bergsma; "Move misc::contint into its own file under misc/" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1559 [15:27:29] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1559 [15:27:42] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1559 [15:27:42] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1559 [15:30:30] Reedy: [15:30:33] New patchset: Mark Bergsma; "Revert "Move misc::contint into its own file under misc/"" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1560 [15:30:45] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1560 [15:30:45] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1560 [15:30:55] I'm takin frequent breaks today ... actually, not quite true. I'm taking sporadic but long breaks as I try to gt my shoulder to behave better [15:31:15] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1560 [15:31:15] so I should be here again for the next couple hours, then as it gets to the point where I can't stand it again, another break [15:31:16] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1560 [15:31:56] mark: I got an issue since you merged production in test [15:32:0] must be something minor [15:32:8] ok [15:32:11] Error 400 on SERVER: Could not parse for environment production: No file(s) found for import of '../private/manifests/mail.pp' at /etc/puppet/manifests/base.pp:13 on node i-0000009d.pmtpa.wmflabs [15:32:16] can you move misc::contint into "misc/contint.pp"? [15:32:30] ah right [15:33:2] is it because the private repo need a merge of production to test ? [15:33:8] no [15:33:9] lemme fix [15:34:50] New patchset: Mark Bergsma; "Add empty mail.pp" [labs/private] (master) - https://gerrit.wikimedia.org/r/1561 [15:34:51] New review: gerrit2; "Lint check passed." [labs/private] (master); V: 1 - https://gerrit.wikimedia.org/r/1561 [15:34:59] New review: Mark Bergsma; "(no comment)" [labs/private] (master); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1561 [15:35:0] Change merged: Mark Bergsma; [labs/private] (master) - https://gerrit.wikimedia.org/r/1561 [15:35:52] mark: looks good thanks! [15:37:56] apergos, so, I've cleaned up the rotatebot scripts a little bit and variablised stuff... I'm just wondering if you've any suggestions from where to run it on the cluster? Will need libjpeg-progs installing.. Then the other is just a set of perl scripts. Thanks [15:38:19] hmm [15:38:20] mark: migrating misc::content is in my TODO list. [15:39:24] what does it do? i.e.... wgets? imagemagick? db quereies? [15:39:42] queries [15:40:6] I think it uses fget in most places... Does do some db queries, no imagemagick, just uss exiftools and libjpeg-progs [15:40:7] New patchset: Dzahn; "fully puppetize check_job_queue with custom check interval" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1473 [15:40:32] how much memory do you think it wants? [15:40:58] in the script they're limiting it to 100 meg [15:40:58] New review: Dzahn; "changed it to critical=>false." [operations/puppet] (production); V: 1 C: 1; - https://gerrit.wikimedia.org/r/1473 [15:41:3] ok [15:41:16] so plus a bit more for the shell outs [15:41:29] how does it upload the rotated image? [15:42:27] socket calls, fsockopen, fputs etc [15:42:49] right [15:43:49] and last q, is this meant to run long-term or for how long do we think? [15:45:27] https://commons.wikimedia.org/wiki/User_talk:Rotatebot#Rotatebot_and_the_WMF [15:45:58] If you read the last comment, that gives some estimates on number of rotates to be done [15:46:6] as soon as it opens ni my browser [15:46:14] I need a quad core machine over here ;-/ [15:46:27] or maybe 6 core, that would usually cover me [15:46:44] so 5 cores can then sit idle? :D [15:46:47] no [15:46:56] I run these parallel processes when testing dump runs [15:47:24] they're quite cpu heavy [15:47:47] ahh, true [15:47:58] didn't realise you did those tests locally [15:48:4] and then it's ahrd to actually do anything else... with a 2 core system :-D [15:48:19] apergos, renice? [15:48:24] yes, I run piles of things locally before I even think about going to test on the cluster [15:48:35] Reedy, where are those scripts? [15:48:35] Platonides: I want them to finish ... [15:48:42] and reniced guys always finish last :-D [15:48:59] sure, but you could do things in between [15:49:44] /trunk/tools/rotatebot [15:50:16] updating [15:50:40] there's only around 28k images total with an orientation that mw pays attention to [15:51:8] at least if it's true that it only matters for orientation 3 6 and 8 [15:51:22] on commons of cours [15:51:23] e [15:52:4] ok so next q [15:52:32] is running out of a labs instance sensible? would we get decent network speed for it, I mean? [15:52:57] of course you should get a decent labs instance for it [15:53:4] for testing [15:53:59] if you would have the same network speed as running directly on a cluster host, that seems like the sane approach, this is exactly what the platform is for (among other things) [15:54:35] I mean, I could stick it on eg snapshot4 and run it for two weeks untl the next en run but that's a kind of crappy approach [15:54:55] labs instances are gbit connected as well [15:55:0] great [15:55:17] but why does that matter, it's for testing dumping, not actually doing production runs, right? [15:55:22] no, not for dumps [15:55:28] this is for the rotatebot [15:55:30] New patchset: Jgreen; "opening exim throttles for fundraising mail servers" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1562 [15:55:37] I don't know what that is [15:55:43] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1562 [15:55:49] a bot which rotates (images) [15:56:0] ah [15:56:2] so mw 1.18 now pays attention to exif rotation datain images [15:56:9] where is it running now? [15:56:14] New review: Jgreen; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1562 [15:56:14] Change merged: Jgreen; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1562 [15:56:25] which would be just fine but some apps that rotate image are broken about they way they handle exif data [15:56:37] <^demon|away> People should just stand upright when taking their pictures ;-) [15:56:37] so now new thumbs... are broken, some of them (for old uploads) [15:56:38] New patchset: Dzahn; "handle locales via puppet" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1302 [15:56:43] mark, externally in toolserver [15:56:49] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1302 [15:57:24] part of the point of labs was to be a toolserver-like thing, where people could run bots etc like this [15:57:25] New review: Dzahn; "dropped one "require". we require the package anyways" [operations/puppet] (production); V: 1 C: 1; - https://gerrit.wikimedia.org/r/1302 [15:58:7] ugh, it edits with the GUI interface [15:58:8] yes, but i'm not sure ryan wants to have that stuff yet [15:58:13] I don't know [15:58:14] labs may not be really ready yet [15:58:23] I guess we can ask, if it isn't, it isn't [15:58:29] I thought you were talking about the stuff you were testing locally, which you needed a quad core host for [15:58:34] THAT should be done in a labs instance [15:58:34] no :-D [15:58:57] you're good at switching subjects subtly so noone has any clue what you're talking about ;p [15:58:58] New review: Dzahn; "ok, but should i also move this into it's own file in /misc/ now?" [operations/puppet] (production); V: 0 C: 0; - https://gerrit.wikimedia.org/r/1433 [15:59:14] mysql_connect($databanknames, $userloginname, $databasepw) or suicide ("Can't connect to MySQL"); lol [15:59:26] I didn't switch topics. it was part of the conversation reedy and I were having :-P [15:59:38] so bad to rotate bots [15:59:40] bot [15:59:43] *back [15:59:49] heh [16:0:0] Platonides, I know. It's not a long term thing, so I'm not attempting to rewrite it :p [16:0:8] if ryan says he's not ready, where do we put stuff like these ? hume? [16:0:24] it doesn't eat so much memory... [16:0:25] hrmf. git diff on sockpuppet is picking up a ton of changes that I didn't pass through gerrit [16:0:30] New review: Mark Bergsma; "You can now, if you want to. We can also aggregate vhosts that are on the same server into one file ..." [operations/puppet] (production); V: 0 C: -1; - https://gerrit.wikimedia.org/r/1433 [16:0:38] that would be me jeff [16:0:41] I can merge if you want [16:0:59] sure, or I will if you know it's your wtuff [16:1:2] er stuff [16:1:38] all done [16:1:42] cool. thx [16:1:58] that bot is using a db connection to check that the user adding the rotate template is autoconfirmed [16:2:18] labs doesn't have access to the databases (yet) [16:2:50] it doesn'thave copies; I thought it had access to the production dbs [16:2:51] although it could easily be replaced with a check that it is not an anon [16:3:7] I thought it was filtered [16:3:20] another q for ryan [16:3:23] Ryan said we can use MariaDB and (at least right now) there is no "no external repo" policy in labs [16:3:25] no it doesn't [16:3:29] and won't for another while [16:3:44] so i would be adding their repo and install that on a labs instance, instead of mysql [16:4:1] mark: if not labs, could this run on hume? does that make sense? [16:4:18] yes, but hume is ops/staff only [16:4:27] so only if you manage it [16:4:31] but sure, that's what hume is for [16:4:43] I guess I'm assuming that Reedy would keep an eye on it (sorry Reedy) [16:4:51] reedy has access too I think [16:5:49] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/1302 [16:6:19] ok, i'll fix that [16:6:43] looks like he does [16:8:12] New review: Mark Bergsma; "(no comment)" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/863 [16:8:12] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/863 [16:8:22] Yeah, I've got hume acccess [16:10:37] ok, and libjpeg-progs is in place for you... [16:10:44] wanna make sure the bot runs ok? [16:11:12] New patchset: Dzahn; "handle locales via puppet" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1302 [16:11:46] New review: Dzahn; "fixed path conflict" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1302 [16:11:47] Change merged: Dzahn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1302 [16:13:47] Reedy: [16:13:49] New review: Demon; "(no comment)" [operations/puppet] (production) C: 1; - https://gerrit.wikimedia.org/r/1494 [16:14:8] Sorry, went to grab a snack [16:14:25] Thanks. Will do in a few, got a feeling I might need a hammar ;0 [16:14:28] ;) [16:15:50] New patchset: Dzahn; "can we have nagios-plugins-basic installed on all monitoring::hosts via base?" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1319 [16:16:1] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1319 [16:16:45] ok [16:16:49] New review: Dzahn; "changed it to nagios-plugins-basic, this does not contain the SMB check which pulls Samba stuff" [operations/puppet] (production); V: 0 C: 1; - https://gerrit.wikimedia.org/r/1319 [16:16:57] if you ping me and I'm not around it means I'm on the next shoulder break [16:24:31] OTRS is broken... [16:24:35] who's around? [16:24:55] (seems to have no new mail since the window) [16:25:3] i just sent 2 test msgs myself [16:29:53] RobH: mark: ^^^ ? [16:30:24] (binasher's window but he seems to not be here) [16:30:24] since what window? [16:30:36] 22-24 UTC yesterday [16:30:46] otrs moved off of db9/db10 [16:31:1] no mail has worked since then? [16:31:15] someone complained on list and afaict they are correct [16:31:24] the newest mail *i* can see is 20 hours old [16:31:41] ahh, 22:01 binasher: exim on mchenry is now using db48/49 for otrs mail verification [16:33:42] jeremyb: folks are lookin into it now, thank you for bringing it up =] [16:33:59] i am headed into datacenter, but folks are checkin =] [16:34:7] back online shortly [16:34:8] RobH: i see. well someone may want to respond to otrs-en-l [16:34:18] * jeremyb ACTION also has to head offline [16:34:46] !log Fixed database entries in william's exim.conf [16:34:55] Logged the message, Master [16:35:47] that seems to have done *something* [16:42:7] mark: is there a spool somewhere that can be flushed? or they were all lost you think? [16:42:30] (my first 2 test messages haven't shown up. my 3rd (after your !log) did show) [16:42:42] they're probably in a mail spool [16:43:0] exim will gradually deliver them [16:45:50] ahh, a trickle is appearing [16:50:4] mark: thanks! /me will bbl [16:54:6] Did anyone ever fix that stats page on the donation drive after brandon linked it to reddit and accidentally fried half the infrastructure? [16:55:36] New patchset: Dzahn; "start minimal bugzilla class, move to own file, existing bz config, fixed tabs" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1497 [16:55:48] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1497 [16:56:37] yes, folks fixed it up [16:57:40] awesome, I wanted to look at the stats, but by the time I saw the link, reddit was already like "OMG ZERG RUSH" and killed the server(s) it was running on [17:4:32] mark: well, some people started the bot stuff early [17:4:55] as long as people know that we're still in closed beta, and their bot infrastructure might die for any reason, it's fine if they run them there [17:5:2] fine with me ;) [17:5:14] eh we've already arranged for reedy to run it on hume [17:5:39] meh and there he goes [17:6:2] huh and suddenly my shoulder does *not hurt* wtf [17:6:28] I'm in my vulture perch position... weird [17:17:32] maplebed: I am almost done with a proposed change [17:17:39] ossm. [17:17:51] this allows me to nicely demonstrate what I mean by "role classes" [17:22:55] btw, if you want to refer to hash from a template, and use a scoped lookup, use: scope.lookupvar('swift::storage::config::cluster_settings')['hash_key'] [17:23:44] unfortunately, puppet's features for hashes and function parameters are not as extended as e.g. python with dics and keyword parameters [17:23:46] you know, that was how I had written it first, but I asked in #puppet and was told to put the key inside the scope lookup. [17:23:49] because that's what you'd really wish for here [17:24:0] that was going to be the first thincg I tried today. [17:24:22] well, with pain in my heart in my new proposal I got rid of the hash again, since this allows me to use class inheritance [17:24:32] noo!!!! [17:24:32] I'll just submit it to gerrit, let me know if you agree [17:24:35] we can discuss ;) [17:24:36] ::sigh:: [17:24:51] or we can do part of it in hashes or something [17:25:11] the thing I really liked about passing around a hash is that when I need to add another variable I don't need to chase it all the way through. add it in the node def. and it magically exists in the template. [17:25:25] well, I sorta changed that part also [17:25:30] (this as I went through 2 or 3 iterations of 'oh wait, I need that variable too.') [17:25:39] I disliked having to pass everything through [17:25:40] ok, I'll shut up till I see your diff. [17:25:57] so I'm now requiring swift::proxy::config to be done BEFORE swift::proxy, but let it be called from a role class [17:26:0] yeah, let me commit this [17:26:6] warning, untested [17:26:9] np [17:26:20] (since you can't actaully test it till you commit.) [17:26:32] indeed [17:26:39] later, in labs... ;) [17:26:59] yaeh, I saw your puppetmasterless lab conv. with jeff this morning. [17:28:38] good [17:28:46] New patchset: Mark Bergsma; "My proposal for handling swift configuration with role classes" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1563 [17:28:57] New review: gerrit2; "Change did not pass lint check. You will need to send an amended patchset for this (see: https://lab..." [operations/puppet] (production); V: -1 - https://gerrit.wikimedia.org/r/1563 [17:29:16] doesn't pass lint, but that's not very surprising [17:29:29] have a look at site.pp first [17:29:44] * maplebed ACTION looks [17:30:10] so instead of in a hash, you define classes for testing/production clusters as well [17:30:14] but ideally not in swift.pp, but in site.pp [17:30:23] like we do with varnish caches and squids and stuff [17:30:36] there, you pull in all required classes with the right parameters [17:30:40] now swift.pp is very generic [17:30:42] almost like a puppet module [17:30:54] almost anyone could use that manifest, it has very little wikimedia specific stuff in it [17:31:17] it just defines "I need swift::proxy::config to be run before I run" but doesn't actually pull it in, since it doesn't want to pass all parameters all the time [17:31:23] that's what the role classes do, swift-cluster::* [17:31:43] I would have used a hash like you did, if puppet supported inheritance or modifying them [17:31:46] but I don't think it does ;( [17:31:51] +1 rename role classes to role::foo [17:31:52] so I used class parameters instead [17:32:4] if puppet ever does, we can use a hash for everything again [17:33:2] so I decoupled configuration/passing of parameters from requirements, in swift.pp [17:33:15] that makes it not as annoying to use class parameters now [17:34:58] New patchset: Mark Bergsma; "My proposal for handling swift configuration with role classes" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1563 [17:35:12] New review: gerrit2; "Change did not pass lint check. You will need to send an amended patchset for this (see: https://lab..." [operations/puppet] (production); V: -1 - https://gerrit.wikimedia.org/r/1563 [17:35:21] i'll keep trying to fix the syntax errors in the mean time [17:35:44] mark: owa1-3 and ms1-3 both just include swift-cluster::pmtpa-test. what says that owa are proxies and ms are storage? [17:36:38] right,t hat's wrong [17:36:44] they should pull in ::proxy and ::storage respectively [17:36:46] let me change that [17:36:52] I also see another thing I need to fix... [17:37:8] you can't do default class params [17:37:11] the other thing I notice is that many more of the defaults need to be overridden in each of those classes... [17:37:30] ok, this is just a proof of concept [17:37:39] cool. [17:37:50] I don't know the swift details too well yet, so we'll have to tweak it [17:37:53] let me amend a bit more [17:38:21] can I cherry pick in the change and amend it as well? [17:38:26] i.e. can we both work on one changeset? [17:38:33] you don't need to, you can just branch off that changeset [17:38:41] I think [17:38:59] I mean, you need to branch of /refs/for/production/ [17:39:2] I don't thin k i'll try that at the moment. [17:39:10] besides, I haven't finished reading yet. [17:39:11] ok, let me fix this first then [17:39:14] I'll be gone in 20 mins anyway [17:39:18] you can then decide what to do with it [17:39:26] but it would be neat to understand whether we could work together on one changeset. [17:39:32] yeah [17:39:37] ideally we did this on a test branch etc... [17:39:40] but yeah [17:40:8] what you can do of course [17:40:8] is [17:40:13] we can restore the node definitions [17:40:21] although no [17:40:24] swift.pp has been changed [17:40:27] the old stuff no longer works ;( [17:40:29] no, let's just keep going. [17:41:58] bah [17:42:7] because of the split and shared proxy/storage stuff, i'll have to make these virtual... [17:45:42] I need to make sure that swift-cluster::base DOES define the default parameters of swift::proxy::config, but does NOT realize it [17:45:54] puppet doesn't support multiple inheritance, so I can't do that for both proxy and storage... [17:45:57] I can try virtual resources [17:46:39] I could set the noop option, although it's kind of lame [17:47:7] i'm gonna go with that until I think of something better :/ [17:48:23] New patchset: Mark Bergsma; "My proposal for handling swift configuration with role classes" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1563 [17:48:35] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1563 [17:48:46] yay [17:49:20] maplebed: a simple way to get that change is of course to merge it, quickly clone into your own local branch, then revert again in production if you need to [17:50:28] meh. [17:51:18] we can also get rid of the whole inheritance [17:51:30] and just define all parameters neatly in the role (sub)classes as shown [17:51:44] I like reusage, but if there's little reusage between cluster parameters anyway, then that doesn't matter that much [17:52:19] or we can research whether puppet does in fact allow you to add/override keys of dictionaries [17:52:21] I mean hashes [17:53:13] anyway, this setup somewhat resembles the "cache::mobile" and "cache::bits" setup of role classes higher up in site.pp [17:53:52] i hate the noop hack :P [17:54:30] I need to leave now [17:54:44] can you give me a brief what I should look at changing next? [17:54:48] to try and get it working? [17:54:53] I think you should try and see if this works at all, yes [17:54:58] it should, but it's based on the manual [17:55:1] I don't understand a lot of the stuff you're doing yet [17:55:3] it might fail in subtle ways [17:55:5] ok [17:55:9] I'll look at the cache stuff for examples. [17:55:15] just keep going over the change and try to understand it then [17:55:18] will you be back today? [17:55:20] yes [17:55:24] ok. [17:55:36] need to pick someone up at the train station now, and have food [17:55:37] thanks for your help; I'll see how much I can get done before you return. [17:55:38] will be back then [17:55:45] cool, happy to help [17:55:46] later! [17:59:5] Heya LeslieCarr, I am in eqiad. Is that labs switch good to be swapped into b3? [17:59:17] it is, how about i monitor just in case [17:59:58] ah I have 10 more mins it turns out [18:0:0] sounds good, I am going to go ahead and pull the labs switch out and relabel it [18:0:2] monitoring [18:0:6] cool [18:0:8] will ping you before i pull b3 =] [18:0:40] oh I see a nice fix [18:1:5] that allows me to get rid of the noop hack as well [18:2:6] we can use swift::proxy::config purely as a configuration object [18:2:7] it doesn't need to actually do anything [18:2:18] mark: I don't see how to vary hash_path_suffix per cluster. [18:2:21] then we can actually write out proxy.conf from swift::proxy, and only pull that in if we need it [18:2:28] maplebed: just override the base parameter below [18:2:28] I'll add an example in my next amendment [18:2:29] I'm gonna push it now [18:2:55] k. [18:4:19] New patchset: Mark Bergsma; "My proposal for handling swift configuration with role classes" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1563 [18:4:22] check [18:4:30] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/1563 [18:4:31] I haven't changed everything yet, no time now [18:4:37] but the hash path override is in there [18:5:21] it uses everything that is in swift-cluster::base, but you can override every class parameter [18:5:47] so this does not cause a conflict [18:5:54] normally this would, but it doesn't with inheritance [18:7:49] ok, leaving now [18:8:6] LeslieCarr: on racktables front, the new one seems to work fine [18:8:21] :) [18:11:9] ok, ready to kill existing asw-b3-eqiad, LeslieCarr confirm I can do this pls? [18:11:18] oh noes! poor asw-b3 ! [18:11:23] execute it [18:12:1] k, pulling it now [18:16:31] see the pull, it looks okay :) [18:20:2] New review: Bhartshorne; "first try..." [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1563 [18:20:2] Change merged: Bhartshorne; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1563 [18:24:58] LeslieCarr: its all wired back up now, labs switch is now asw-b3-eqiad in racktables as well [18:25:12] so if its ok, i will go ahead and wire up the ciscos in that rack for use =] [18:25:27] maplebed: did you have an ES host in eqiad that you have not installed yet? [18:25:32] so i can test a controller in it? [18:25:40] RobH: I took es1002 down for testing. [18:25:44] you're welcome to i. [18:25:51] I don't think it successfully boots right now. [18:26:0] you mean into os? [18:26:9] yea i just wanna see if it can handle the controller card to see all the dirves right now [18:26:11] no os needed. [18:26:14] RobH: gimme a sec, it's not recognizing the chassis for some reason [18:26:19] yeah; I made a change to the raid card and the autoinstall loops. [18:26:33] (it installs the os then reboots into the installer again) [18:26:35] LeslieCarr: ok, i wont mess with it, lemme know if you need me to do something [18:26:50] maplebed: ok, snagging it now to do some testing, getting crash cart and such as well, brb [18:26:52] RobH: summary - you're welcome to do whatever you want to it, except log in right now (cuz that won't work) [18:26:54] :) [18:27:3] cool [18:27:46] jeremyb: I see mark fixed the mail issue for you in otrs, all good now? [18:28:59] !log es1002 and cp1019 offline for harddisk controller testing [18:29:8] Logged the message, RobH [18:34:32] RobH: do you see a member ID on the lcd ? [18:39:5] reads loading junos now [18:39:9] before it was blank after it loaded the os [18:39:16] oh yeah, sorry, rebooted it again [18:39:18] no info but that was afer racking, it just had es4200 something [18:39:20] and rebooted, heh [18:39:22] grrr [18:39:30] ex even [18:39:37] i even got out the docs to make sure i'm doing it correctly… :( [18:39:40] New patchset: Bhartshorne; "adding default proxy address, overridden later" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1564 [18:40:56] New review: Bhartshorne; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1564 [18:40:56] Change merged: Bhartshorne; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1564 [18:46:24] yay [18:46:26] coming up now [18:46:39] i'm gonna have to reboot it again i think but you are free to hook stuff up RobH [18:47:44] New patchset: Bhartshorne; "filling in the last of the variables with a placeholder, overridden elsewhere" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1565 [18:49:13] maplebed: bad news [18:49:27] so the card appears like it would work, except the sas cables that have to be used wont clear [18:49:27] ruh roh [18:49:32] the case is too tight to allow it [18:49:39] so i cannot close the case and actually fire it up [18:49:44] =/ [18:50:23] do you have any longer spares? [18:51:14] New review: Bhartshorne; "(no comment)" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/1565 [18:51:14] Change merged: Bhartshorne; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/1565 [18:51:30] its not cable length [18:51:34] its the connector is too large [18:51:45] that's odd. [18:51:52] so where the card goes in, the raid uses a tiny sas cable and connector, the r610 has a differnet sas cable connector on the card side [18:52:2] so it wont clear all the air routing in the case by a few mm [18:52:7] scant MM [18:52:13] but, enough to stop it from working [18:52:13] just push harder. [18:52:17] :D [18:52:24] that torques the entire riser card badly [18:52:27] i tried ;] [18:52:48] is it worth while to fire it up caseless just to test the card with that many disks? [18:52:58] or would that be a waste of time... [18:56:1] PROBLEM - Puppet freshness on mw1055 is CRITICAL: Puppet has not run in the last 10 hours [18:56:1] PROBLEM - Puppet freshness on mw1062 is CRITICAL: Puppet has not run in the last 10 hours [18:56:1] PROBLEM - Puppet freshness on mw1063 is CRITICAL: Puppet has not run in the last 10 hours [18:56:1] PROBLEM - Puppet freshness on mw1083 is CRITICAL: Puppet has not run in the last 10 hours [18:56:1] PROBLEM - Puppet freshness on mw1121 is CRITICAL: Puppet has not run in the last 10 hours [18:56:1] PROBLEM - Puppet freshness on mw1140 is CRITICAL: Puppet has not run in the last 10 hours [18:56:1] PROBLEM - Puppet freshness on mw1122 is CRITICAL: Puppet has not run in the last 10 hours [18:56:2] PROBLEM - Puppet freshness on mw1148 is CRITICAL: Puppet has not run in the last 10 hours [18:56:2] PROBLEM - Puppet freshness on mw1159 is CRITICAL: Puppet has not run in the last 10 hours [18:56:3] PROBLEM - Puppet freshness on mw1156 is CRITICAL: Puppet has not run in the last 10 hours [18:56:3] PROBLEM - Puppet freshness on snapshot4 is CRITICAL: Puppet has not run in the last 10 hours [18:56:4] PROBLEM - Puppet freshness on virt2 is CRITICAL: Puppet has not run in the last 10 hours [18:56:4] PROBLEM - Puppet freshness on virt1 is CRITICAL: Puppet has not run in the last 10 hours [19:0:42] maplebed: so that sucks, i emailed dell about it, waiting on hearing back now [19:0:48] its so SO close [19:0:51] ;_; [19:0:55] I cleared the puppet yaml files for the mw's in eqiad again… looks like it is good [19:0:58] ok. [19:1:2] I have spent 15 minutes folding cables every which way, this sucks [19:1:6] =P [19:1:9] its so close. [19:1:11] RECOVERY - Puppet freshness on mw1148 is OK: puppet ran at Thu Dec 15 19:00:55 UTC 2011 [19:1:20] i wonder if i can use two of the smaller cables for sas side b [19:1:24] not sure if it will work, going to try [19:1:33] !log cp1018 also offline, stealing cables from it for testing [19:1:42] Logged the message, RobH [19:2:7] LeslieCarr: Ok, I will get to that once I finish working on this hdd thing. everything on that is cool on what you need from me right now right? [19:2:25] sounds good RobH [19:2:27] the servers have no names, so you could label the ports with the asset tags, I will email you when its all rigged up [19:2:29] with all the info [19:3:6] ok cool [19:5:11] RECOVERY - Puppet freshness on mw1156 is OK: puppet ran at Thu Dec 15 19:04:24 UTC 2011 [19:8:11] RECOVERY - Puppet freshness on mw1159 is OK: puppet ran at Thu Dec 15 19:07:21 UTC 2011 [19:9:13] i dont like how those cables [19:9:14] fi [19:9:15] t [19:9:27] its still under a bit of tension that i dont like, but seeing if it works at all anyhow [19:9:54] maplebed: if this works, did you want to snag for testing a bit, i can leave cp1014 and cp1019 offline for htis [19:10:11] RECOVERY - Puppet freshness on mw1055 is OK: puppet ran at Thu Dec 15 19:09:50 UTC 2011 [19:10:48] RobH: the only test I need to run is whether it boots and exposes its disks. [19:11:1] might be easier for you to run that then for us to hand back and forth. [19:11:11] but I'm happy to if you'd prefer.