[00:05:36] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:19:33] PROBLEM - NTP on search26 is CRITICAL: NTP CRITICAL: Offset -1.00406301 secs [00:19:42] PROBLEM - NTP on search17 is CRITICAL: NTP CRITICAL: Offset unknown [00:19:51] PROBLEM - NTP on search33 is CRITICAL: NTP CRITICAL: Offset -1.002327919 secs [00:19:51] PROBLEM - NTP on search18 is CRITICAL: NTP CRITICAL: Offset -1.003461003 secs [00:19:51] PROBLEM - NTP on search15 is CRITICAL: NTP CRITICAL: Offset -1.002972484 secs [00:19:51] PROBLEM - NTP on vanadium is CRITICAL: NTP CRITICAL: Offset -1.001446962 secs [00:19:51] PROBLEM - NTP on search22 is CRITICAL: NTP CRITICAL: Offset -1.002938747 secs [00:19:52] PROBLEM - NTP on search24 is CRITICAL: NTP CRITICAL: Offset -1.004598022 secs [00:19:52] PROBLEM - NTP on stat1 is CRITICAL: NTP CRITICAL: Offset -1.000285387 secs [00:20:00] PROBLEM - NTP on search27 is CRITICAL: NTP CRITICAL: Offset -1.002937198 secs [00:20:00] PROBLEM - NTP on search14 is CRITICAL: NTP CRITICAL: Offset -1.003344297 secs [00:20:00] PROBLEM - NTP on search35 is CRITICAL: NTP CRITICAL: Offset -1.003415585 secs [00:20:00] PROBLEM - NTP on db63 is CRITICAL: NTP CRITICAL: Offset unknown [00:20:00] PROBLEM - NTP on sockpuppet is CRITICAL: NTP CRITICAL: Offset -1.002735853 secs [00:20:01] PROBLEM - NTP on search23 is CRITICAL: NTP CRITICAL: Offset -1.009204268 secs [00:20:09] PROBLEM - NTP on ms10 is CRITICAL: NTP CRITICAL: Offset -1.002880216 secs [00:20:09] PROBLEM - NTP on virt1004 is CRITICAL: NTP CRITICAL: Offset -1.00008738 secs [00:20:09] PROBLEM - NTP on cp1041 is CRITICAL: NTP CRITICAL: Offset -1.001003861 secs [00:20:09] PROBLEM - NTP on search25 is CRITICAL: NTP CRITICAL: Offset -1.004829764 secs [00:20:09] PROBLEM - NTP on nitrogen is CRITICAL: NTP CRITICAL: Offset -1.00122726 secs [00:20:18] PROBLEM - NTP on search28 is CRITICAL: NTP CRITICAL: Offset -1.002691388 secs [00:20:18] PROBLEM - NTP on search36 is CRITICAL: NTP CRITICAL: Offset -1.003834605 secs [00:20:18] PROBLEM - NTP on search21 is CRITICAL: NTP CRITICAL: Offset -1.003881931 secs [00:20:27] PROBLEM - NTP on searchidx2 is CRITICAL: NTP CRITICAL: Offset -1.003004909 secs [00:20:36] PROBLEM - NTP on search31 is CRITICAL: NTP CRITICAL: Offset -1.001902938 secs [00:20:36] PROBLEM - NTP on virt1007 is CRITICAL: NTP CRITICAL: Offset -1.002343535 secs [00:20:36] PROBLEM - NTP on hydrogen is CRITICAL: NTP CRITICAL: Offset -1.002717495 secs [00:20:36] PROBLEM - NTP on virt1008 is CRITICAL: NTP CRITICAL: Offset -1.001419187 secs [00:20:36] PROBLEM - NTP on srv281 is CRITICAL: NTP CRITICAL: Offset -1.003268719 secs [00:20:37] PROBLEM - NTP on stafford is CRITICAL: NTP CRITICAL: Offset -1.00297296 secs [00:20:37] PROBLEM - NTP on search29 is CRITICAL: NTP CRITICAL: Offset -1.003409266 secs [00:20:38] PROBLEM - NTP on ms-be1002 is CRITICAL: NTP CRITICAL: Offset -1.001787305 secs [00:20:38] PROBLEM - NTP on ms-be1001 is CRITICAL: NTP CRITICAL: Offset -1.001975775 secs [00:20:39] PROBLEM - NTP on virt1003 is CRITICAL: NTP CRITICAL: Offset -1.000443697 secs [00:20:39] PROBLEM - NTP on search13 is CRITICAL: NTP CRITICAL: Offset -1.002311707 secs [00:20:45] PROBLEM - NTP on search34 is CRITICAL: NTP CRITICAL: Offset unknown [00:20:45] PROBLEM - NTP on virt1005 is CRITICAL: NTP CRITICAL: Offset -1.003082275 secs [00:20:54] PROBLEM - NTP on search20 is CRITICAL: NTP CRITICAL: Offset -1.003569365 secs [00:20:54] PROBLEM - NTP on manutius is CRITICAL: NTP CRITICAL: Offset -1.003699183 secs [00:20:54] PROBLEM - NTP on neon is CRITICAL: NTP CRITICAL: Offset -1.000853658 secs [00:20:54] PROBLEM - NTP on virt1002 is CRITICAL: NTP CRITICAL: Offset -1.003281355 secs [00:20:54] PROBLEM - NTP on search19 is CRITICAL: NTP CRITICAL: Offset -1.003387332 secs [00:21:03] PROBLEM - NTP on search16 is CRITICAL: NTP CRITICAL: Offset -1.002802253 secs [00:21:03] PROBLEM - NTP on cp1042 is CRITICAL: NTP CRITICAL: Offset -1.001209378 secs [00:21:03] PROBLEM - NTP on search30 is CRITICAL: NTP CRITICAL: Offset -1.00454247 secs [00:21:03] PROBLEM - NTP on chromium is CRITICAL: NTP CRITICAL: Offset -1.002126813 secs [00:21:03] PROBLEM - NTP on db1045 is CRITICAL: NTP CRITICAL: Offset -1.001263499 secs [00:21:12] PROBLEM - NTP on virt1001 is CRITICAL: NTP CRITICAL: Offset -1.001881003 secs [00:21:21] PROBLEM - NTP on capella is CRITICAL: NTP CRITICAL: Offset -1.00713253 secs [00:21:48] PROBLEM - NTP on ms-be1006 is CRITICAL: NTP CRITICAL: Offset -1.000199318 secs [00:24:21] PROBLEM - NTP on virt1004 is CRITICAL: NTP CRITICAL: Offset -1.001159787 secs [00:25:33] RECOVERY - NTP on db63 is OK: NTP OK: Offset -0.003840208054 secs [00:26:09] PROBLEM - NTP on ms-be1006 is CRITICAL: NTP CRITICAL: Offset -1.001355648 secs [00:31:06] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK HTTP/1.1 400 Bad Request - 336 bytes in 4.830 seconds [00:31:51] PROBLEM - NTP on ms-be1006 is CRITICAL: NTP CRITICAL: Offset -1.00000453 secs [00:32:51] New review: Krinkle; "Can we do that in our config, or does it have to happen upstream?" [operations/puppet] (production) C: 1; - https://gerrit.wikimedia.org/r/16841 [00:33:10] !log leap second event plus one month is causing an apparent 1s step in time reported by linne/dobson as seen by some clients, causing nagios errors etc. Will step. [00:33:19] Logged the message, Master [00:35:00] RECOVERY - NTP on search20 is OK: NTP OK: Offset 0.002711892128 secs [00:35:09] RECOVERY - NTP on search15 is OK: NTP OK: Offset -0.0005422830582 secs [00:35:09] PROBLEM - NTP on stat1 is CRITICAL: NTP CRITICAL: Offset -1.000931382 secs [00:35:18] RECOVERY - NTP on search18 is OK: NTP OK: Offset 0.002357125282 secs [00:35:27] RECOVERY - NTP on search14 is OK: NTP OK: Offset 0.002243876457 secs [00:35:45] RECOVERY - NTP on searchidx2 is OK: NTP OK: Offset 0.002533197403 secs [00:37:28] New review: Demon; "I'm fine with merging it--ask an opsen to do the honors :)" [operations/puppet] (production) C: 1; - https://gerrit.wikimedia.org/r/16841 [00:38:00] PROBLEM - NTP on stat1 is CRITICAL: NTP CRITICAL: Offset -1.004558086 secs [00:38:54] RECOVERY - NTP on search34 is OK: NTP OK: Offset -0.0005459785461 secs [00:39:03] RECOVERY - NTP on search26 is OK: NTP OK: Offset 0.0005159378052 secs [00:39:03] RECOVERY - NTP on ms-be1001 is OK: NTP OK: Offset 0.003136396408 secs [00:39:12] RECOVERY - NTP on manutius is OK: NTP OK: Offset 0.002767443657 secs [00:39:12] RECOVERY - NTP on search30 is OK: NTP OK: Offset 0.002802610397 secs [00:39:12] RECOVERY - NTP on search17 is OK: NTP OK: Offset -0.001473665237 secs [00:39:21] RECOVERY - NTP on cp1042 is OK: NTP OK: Offset 0.003164887428 secs [00:39:30] RECOVERY - NTP on search33 is OK: NTP OK: Offset 0.002197980881 secs [00:39:30] RECOVERY - NTP on neon is OK: NTP OK: Offset 0.003155469894 secs [00:39:30] RECOVERY - NTP on virt1001 is OK: NTP OK: Offset 0.003191590309 secs [00:39:30] RECOVERY - NTP on search24 is OK: NTP OK: Offset 0.002688527107 secs [00:39:39] RECOVERY - NTP on ms10 is OK: NTP OK: Offset 5.280971527e-05 secs [00:39:39] RECOVERY - NTP on search25 is OK: NTP OK: Offset 0.002616643906 secs [00:39:48] RECOVERY - NTP on cp1041 is OK: NTP OK: Offset 0.003150343895 secs [00:39:57] RECOVERY - NTP on sockpuppet is OK: NTP OK: Offset 0.002510428429 secs [00:40:15] RECOVERY - NTP on hydrogen is OK: NTP OK: Offset 0.003090023994 secs [00:40:24] RECOVERY - NTP on ms-be1002 is OK: NTP OK: Offset 0.00305891037 secs [00:41:09] PROBLEM - NTP on virt1004 is CRITICAL: NTP CRITICAL: Offset unknown [00:41:18] RECOVERY - NTP on search23 is OK: NTP OK: Offset -0.002405881882 secs [00:41:18] RECOVERY - NTP on search21 is OK: NTP OK: Offset -0.001321434975 secs [00:42:57] RECOVERY - NTP on search13 is OK: NTP OK: Offset -0.00150513649 secs [00:42:57] RECOVERY - NTP on virt1008 is OK: NTP OK: Offset 0.003241539001 secs [00:43:06] RECOVERY - NTP on virt1007 is OK: NTP OK: Offset 0.003159165382 secs [00:44:36] RECOVERY - NTP on search16 is OK: NTP OK: Offset -0.002539992332 secs [00:46:06] RECOVERY - NTP on virt1003 is OK: NTP OK: Offset -0.0004901885986 secs [00:47:00] RECOVERY - NTP on search36 is OK: NTP OK: Offset -0.001963496208 secs [00:47:00] RECOVERY - NTP on search28 is OK: NTP OK: Offset 0.004057884216 secs [00:48:30] RECOVERY - NTP on srv281 is OK: NTP OK: Offset -0.0007045269012 secs [00:48:48] RECOVERY - NTP on search19 is OK: NTP OK: Offset -0.002653121948 secs [00:48:57] RECOVERY - NTP on search29 is OK: NTP OK: Offset 0.000510931015 secs [00:49:51] RECOVERY - NTP on search31 is OK: NTP OK: Offset -0.001265764236 secs [00:50:09] RECOVERY - NTP on stafford is OK: NTP OK: Offset -0.002719640732 secs [00:52:15] RECOVERY - NTP on virt1004 is OK: NTP OK: Offset -0.0001796483994 secs [00:53:27] RECOVERY - NTP on db1045 is OK: NTP OK: Offset -0.0007469654083 secs [00:53:45] RECOVERY - NTP on search22 is OK: NTP OK: Offset -0.001126885414 secs [00:54:03] RECOVERY - NTP on nitrogen is OK: NTP OK: Offset -0.00114107132 secs [00:56:00] RECOVERY - NTP on virt1002 is OK: NTP OK: Offset -0.002690076828 secs [00:56:09] RECOVERY - NTP on chromium is OK: NTP OK: Offset -0.001908540726 secs [00:56:18] RECOVERY - NTP on search27 is OK: NTP OK: Offset 0.003482937813 secs [00:56:27] RECOVERY - NTP on search35 is OK: NTP OK: Offset -0.006003856659 secs [00:57:21] PROBLEM - NTP on search20 is CRITICAL: NTP CRITICAL: Offset unknown [00:57:21] RECOVERY - NTP on virt1005 is OK: NTP OK: Offset -0.001603364944 secs [00:59:09] RECOVERY - NTP on capella is OK: NTP OK: Offset -0.001192092896 secs [01:01:15] PROBLEM - NTP on search26 is CRITICAL: NTP CRITICAL: Offset unknown [01:01:24] PROBLEM - NTP on hydrogen is CRITICAL: NTP CRITICAL: Offset unknown [01:01:42] PROBLEM - NTP on search15 is CRITICAL: NTP CRITICAL: Offset unknown [01:02:37] RECOVERY - NTP on search26 is OK: NTP OK: Offset 0.002385258675 secs [01:02:54] RECOVERY - NTP on search20 is OK: NTP OK: Offset 0.003555178642 secs [01:03:12] RECOVERY - NTP on search15 is OK: NTP OK: Offset 0.00204539299 secs [01:04:24] RECOVERY - NTP on stat1 is OK: NTP OK: Offset 0.0004841089249 secs [01:04:51] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:06:12] PROBLEM - NTP on virt1001 is CRITICAL: NTP CRITICAL: Offset unknown [01:08:18] PROBLEM - NTP on manutius is CRITICAL: NTP CRITICAL: Offset unknown [01:14:45] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK HTTP/1.1 400 Bad Request - 336 bytes in 0.036 seconds [01:18:21] RECOVERY - NTP on ms-be1006 is OK: NTP OK: Offset -1.037120819e-05 secs [01:20:54] RECOVERY - NTP on hydrogen is OK: NTP OK: Offset -0.0009303092957 secs [01:24:39] RECOVERY - NTP on vanadium is OK: NTP OK: Offset -0.0007821321487 secs [01:29:27] RECOVERY - NTP on manutius is OK: NTP OK: Offset -0.0008246898651 secs [01:42:12] PROBLEM - MySQL Slave Delay on db1025 is CRITICAL: CRIT replication delay 263 seconds [01:42:48] PROBLEM - MySQL Slave Delay on storage3 is CRITICAL: CRIT replication delay 300 seconds [01:43:06] RECOVERY - NTP on virt1001 is OK: NTP OK: Offset -0.0001261234283 secs [01:44:07] New patchset: Catrope; "Fix the import sources on the Wikimania wikis" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/17158 [01:48:39] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:48:57] PROBLEM - Misc_Db_Lag on storage3 is CRITICAL: CHECK MySQL REPLICATION - lag - CRITICAL - Seconds_Behind_Master : 671s [01:53:27] RECOVERY - MySQL Slave Delay on db1025 is OK: OK replication delay 6 seconds [01:54:39] RECOVERY - Misc_Db_Lag on storage3 is OK: CHECK MySQL REPLICATION - lag - OK - Seconds_Behind_Master : 18s [01:55:15] RECOVERY - MySQL Slave Delay on storage3 is OK: OK replication delay 10 seconds [01:57:03] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK HTTP/1.1 400 Bad Request - 336 bytes in 6.408 seconds [02:30:57] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [02:37:06] PROBLEM - Puppet freshness on srv281 is CRITICAL: Puppet has not run in the last 10 hours [02:39:21] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK HTTP/1.1 400 Bad Request - 336 bytes in 0.049 seconds [03:03:57] RECOVERY - swift-account-replicator on ms-be1006 is OK: PROCS OK: 1 process with regex args ^/usr/bin/python /usr/bin/swift-account-replicator [03:21:03] PROBLEM - Puppet freshness on neon is CRITICAL: Puppet has not run in the last 10 hours [03:46:07] PROBLEM - Puppet freshness on ocg3 is CRITICAL: Puppet has not run in the last 10 hours [03:54:57] TimStarling: no, scap's in puppet:files/misc/scripts or something like that [03:56:01] but not scap-1 [03:57:57] http://www.mediawiki.org/wiki/Special:Code/MediaWiki/115635 [03:57:59] oh, i need to go reread scrollback for a 3rd time [06:20:09] PROBLEM - Puppet freshness on spence is CRITICAL: Puppet has not run in the last 10 hours [06:27:03] PROBLEM - Puppet freshness on nickel is CRITICAL: Puppet has not run in the last 10 hours [06:29:09] PROBLEM - Puppet freshness on hooper is CRITICAL: Puppet has not run in the last 10 hours [06:30:03] PROBLEM - Puppet freshness on virt4 is CRITICAL: Puppet has not run in the last 10 hours [06:30:03] PROBLEM - Puppet freshness on ssl1 is CRITICAL: Puppet has not run in the last 10 hours [06:31:06] PROBLEM - Puppet freshness on nfs2 is CRITICAL: Puppet has not run in the last 10 hours [06:31:06] PROBLEM - Puppet freshness on ssl1003 is CRITICAL: Puppet has not run in the last 10 hours [06:31:06] PROBLEM - Puppet freshness on ssl1001 is CRITICAL: Puppet has not run in the last 10 hours [06:31:06] PROBLEM - Puppet freshness on ssl3001 is CRITICAL: Puppet has not run in the last 10 hours [06:31:06] PROBLEM - Puppet freshness on williams is CRITICAL: Puppet has not run in the last 10 hours [06:32:09] PROBLEM - Puppet freshness on grosley is CRITICAL: Puppet has not run in the last 10 hours [06:32:09] PROBLEM - Puppet freshness on formey is CRITICAL: Puppet has not run in the last 10 hours [06:32:09] PROBLEM - Puppet freshness on sodium is CRITICAL: Puppet has not run in the last 10 hours [06:32:09] PROBLEM - Puppet freshness on kaulen is CRITICAL: Puppet has not run in the last 10 hours [06:32:09] PROBLEM - Puppet freshness on virt1 is CRITICAL: Puppet has not run in the last 10 hours [06:32:10] PROBLEM - Puppet freshness on virt2 is CRITICAL: Puppet has not run in the last 10 hours [06:32:10] PROBLEM - Puppet freshness on virt3 is CRITICAL: Puppet has not run in the last 10 hours [06:32:11] PROBLEM - Puppet freshness on ssl4 is CRITICAL: Puppet has not run in the last 10 hours [06:33:03] PROBLEM - Puppet freshness on aluminium is CRITICAL: Puppet has not run in the last 10 hours [06:33:03] PROBLEM - Puppet freshness on gallium is CRITICAL: Puppet has not run in the last 10 hours [06:33:03] PROBLEM - Puppet freshness on manganese is CRITICAL: Puppet has not run in the last 10 hours [06:33:03] PROBLEM - Puppet freshness on sanger is CRITICAL: Puppet has not run in the last 10 hours [06:33:03] PROBLEM - Puppet freshness on virt7 is CRITICAL: Puppet has not run in the last 10 hours [06:34:07] PROBLEM - Puppet freshness on marmontel is CRITICAL: Puppet has not run in the last 10 hours [06:34:07] PROBLEM - Puppet freshness on ssl1002 is CRITICAL: Puppet has not run in the last 10 hours [06:34:07] PROBLEM - Puppet freshness on ssl1004 is CRITICAL: Puppet has not run in the last 10 hours [06:35:09] PROBLEM - Puppet freshness on ekrem is CRITICAL: Puppet has not run in the last 10 hours [06:35:09] PROBLEM - Puppet freshness on fenari is CRITICAL: Puppet has not run in the last 10 hours [06:35:09] PROBLEM - Puppet freshness on nfs1 is CRITICAL: Puppet has not run in the last 10 hours [06:35:09] PROBLEM - Puppet freshness on ssl3 is CRITICAL: Puppet has not run in the last 10 hours [06:35:09] PROBLEM - Puppet freshness on ssl3002 is CRITICAL: Puppet has not run in the last 10 hours [06:36:03] PROBLEM - Puppet freshness on ssl2 is CRITICAL: Puppet has not run in the last 10 hours [06:38:55] PROBLEM - Puppet freshness on argon is CRITICAL: Puppet has not run in the last 10 hours [06:38:55] PROBLEM - Puppet freshness on virt6 is CRITICAL: Puppet has not run in the last 10 hours [06:38:55] PROBLEM - Puppet freshness on virt1000 is CRITICAL: Puppet has not run in the last 10 hours [06:40:07] PROBLEM - Puppet freshness on virt8 is CRITICAL: Puppet has not run in the last 10 hours [06:42:04] PROBLEM - Puppet freshness on virt5 is CRITICAL: Puppet has not run in the last 10 hours [06:42:04] PROBLEM - Puppet freshness on ssl3003 is CRITICAL: Puppet has not run in the last 10 hours [06:46:07] PROBLEM - Puppet freshness on virt0 is CRITICAL: Puppet has not run in the last 10 hours [06:46:07] PROBLEM - Puppet freshness on singer is CRITICAL: Puppet has not run in the last 10 hours [06:58:07] PROBLEM - Puppet freshness on ms-be10 is CRITICAL: Puppet has not run in the last 10 hours [07:19:07] PROBLEM - Puppet freshness on calcium is CRITICAL: Puppet has not run in the last 10 hours [08:57:31] RECOVERY - check_job_queue on spence is OK: JOBQUEUE OK - all job queues below 10,000 [08:58:07] RECOVERY - check_job_queue on neon is OK: JOBQUEUE OK - all job queues below 10,000 [10:31:25] PROBLEM - NTP peers on linne is CRITICAL: NTP CRITICAL: Server not synchronized, Offset unknown [10:34:07] RECOVERY - NTP peers on linne is OK: NTP OK: Offset 0.000828 secs [10:35:19] PROBLEM - NTP peers on dobson is CRITICAL: NTP CRITICAL: Server not synchronized, Offset unknown [10:36:40] RECOVERY - NTP peers on dobson is OK: NTP OK: Offset -0.0009 secs [11:51:42] New patchset: Mark Bergsma; "Restart the NTP client if hit by the leap second bug" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/17176 [11:52:21] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/17176 [11:52:53] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/17176 [11:56:10] PROBLEM - Puppet freshness on analytics1006 is CRITICAL: Puppet has not run in the last 10 hours [11:58:07] PROBLEM - Puppet freshness on cp1032 is CRITICAL: Puppet has not run in the last 10 hours [12:38:10] PROBLEM - Puppet freshness on srv281 is CRITICAL: Puppet has not run in the last 10 hours [12:59:22] <^demon> jeremyb: I fixed All-Projects for you yesterday :) [13:22:07] PROBLEM - Puppet freshness on neon is CRITICAL: Puppet has not run in the last 10 hours [13:33:37] Change abandoned: Demon; "Not really as necessary as it was before the database move--not going to bother dealing with this." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/13356 [13:47:10] PROBLEM - Puppet freshness on ocg3 is CRITICAL: Puppet has not run in the last 10 hours [13:51:43] New patchset: Hashar; "basic README introducing our files" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/16035 [13:53:49] New patchset: Hashar; "basic README introducing our files" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/16035 [13:54:03] Change merged: Hashar; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/16035 [14:24:08] ^demon: woot, danke [14:58:00] New patchset: Demon; "Overhauling gerrit manifest to be a role class" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/13484 [14:58:38] New review: gerrit2; "Change did not pass lint check. You will need to send an amended patchset for this (see: https://lab..." [operations/puppet] (production); V: -1 - https://gerrit.wikimedia.org/r/13484 [15:06:50] I keep turning puppet off on virt1002 (service puppet stop) but whenever I return to it after leaving it unattended I find puppet running again. [15:07:05] Is there some external service that goes through a list of servers and ensures that puppet is always up? [15:07:31] andrewbogott: cron [15:07:41] Oh, of course. [15:07:49] * andrewbogott breaks cron [15:07:59] andrewbogott: try START=no @ /etc/default/puppet [15:08:01] New patchset: Demon; "Overhauling gerrit manifest to be a role class" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/13484 [15:08:39] New review: gerrit2; "Change did not pass lint check. You will need to send an amended patchset for this (see: https://lab..." [operations/puppet] (production); V: -1 - https://gerrit.wikimedia.org/r/13484 [15:09:55] New patchset: Demon; "Overhauling gerrit manifest to be a role class" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/13484 [15:10:32] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/13484 [15:10:36] whoo [15:43:21] New patchset: Demon; "Overhauling gerrit manifest to be a role class" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/13484 [15:43:55] New review: gerrit2; "Change did not pass lint check. You will need to send an amended patchset for this (see: https://lab..." [operations/puppet] (production); V: -1 - https://gerrit.wikimedia.org/r/13484 [15:44:26] New patchset: Demon; "Overhauling gerrit manifest to be a role class" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/13484 [15:44:44] hehe [15:45:04] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/13484 [16:05:10] New patchset: Demon; "Overhauling gerrit manifest to be a role class" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/13484 [16:05:44] New review: gerrit2; "Change did not pass lint check. You will need to send an amended patchset for this (see: https://lab..." [operations/puppet] (production); V: -1 - https://gerrit.wikimedia.org/r/13484 [16:06:21] New patchset: Demon; "Overhauling gerrit manifest to be a role class" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/13484 [16:06:59] New review: gerrit2; "Change did not pass lint check. You will need to send an amended patchset for this (see: https://lab..." [operations/puppet] (production); V: -1 - https://gerrit.wikimedia.org/r/13484 [16:08:05] New patchset: Demon; "Overhauling gerrit manifest to be a role class" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/13484 [16:08:42] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/13484 [16:21:04] PROBLEM - Puppet freshness on spence is CRITICAL: Puppet has not run in the last 10 hours [16:28:08] PROBLEM - Puppet freshness on nickel is CRITICAL: Puppet has not run in the last 10 hours [16:28:49] New patchset: Reedy; "Bug 38905 - ShortUrl does not work on non wikipedia projects" [operations/apache-config] (master) - https://gerrit.wikimedia.org/r/17191 [16:30:04] PROBLEM - Puppet freshness on hooper is CRITICAL: Puppet has not run in the last 10 hours [16:30:12] New patchset: Reedy; "Bug 38905 - ShortUrl does not work on non wikipedia projects" [operations/apache-config] (master) - https://gerrit.wikimedia.org/r/17191 [16:30:35] maplebed: ^ [16:31:07] PROBLEM - Puppet freshness on ssl1 is CRITICAL: Puppet has not run in the last 10 hours [16:31:07] PROBLEM - Puppet freshness on virt4 is CRITICAL: Puppet has not run in the last 10 hours [16:32:10] PROBLEM - Puppet freshness on nfs2 is CRITICAL: Puppet has not run in the last 10 hours [16:32:10] PROBLEM - Puppet freshness on ssl1003 is CRITICAL: Puppet has not run in the last 10 hours [16:32:10] PROBLEM - Puppet freshness on ssl1001 is CRITICAL: Puppet has not run in the last 10 hours [16:32:10] PROBLEM - Puppet freshness on ssl3001 is CRITICAL: Puppet has not run in the last 10 hours [16:32:10] PROBLEM - Puppet freshness on williams is CRITICAL: Puppet has not run in the last 10 hours [16:32:27] Reedy: do you have merge and push rights on that repo? I +1ed it. [16:33:04] PROBLEM - Puppet freshness on formey is CRITICAL: Puppet has not run in the last 10 hours [16:33:04] PROBLEM - Puppet freshness on grosley is CRITICAL: Puppet has not run in the last 10 hours [16:33:04] PROBLEM - Puppet freshness on kaulen is CRITICAL: Puppet has not run in the last 10 hours [16:33:04] PROBLEM - Puppet freshness on sodium is CRITICAL: Puppet has not run in the last 10 hours [16:33:04] PROBLEM - Puppet freshness on virt3 is CRITICAL: Puppet has not run in the last 10 hours [16:33:05] PROBLEM - Puppet freshness on virt2 is CRITICAL: Puppet has not run in the last 10 hours [16:33:05] PROBLEM - Puppet freshness on virt1 is CRITICAL: Puppet has not run in the last 10 hours [16:33:06] PROBLEM - Puppet freshness on ssl4 is CRITICAL: Puppet has not run in the last 10 hours [16:34:07] PROBLEM - Puppet freshness on aluminium is CRITICAL: Puppet has not run in the last 10 hours [16:34:07] PROBLEM - Puppet freshness on gallium is CRITICAL: Puppet has not run in the last 10 hours [16:34:07] PROBLEM - Puppet freshness on manganese is CRITICAL: Puppet has not run in the last 10 hours [16:34:07] PROBLEM - Puppet freshness on virt7 is CRITICAL: Puppet has not run in the last 10 hours [16:34:07] PROBLEM - Puppet freshness on sanger is CRITICAL: Puppet has not run in the last 10 hours [16:35:10] PROBLEM - Puppet freshness on marmontel is CRITICAL: Puppet has not run in the last 10 hours [16:35:10] PROBLEM - Puppet freshness on ssl1002 is CRITICAL: Puppet has not run in the last 10 hours [16:35:10] PROBLEM - Puppet freshness on ssl1004 is CRITICAL: Puppet has not run in the last 10 hours [16:35:41] New patchset: Demon; "Overhauling gerrit manifest to be a role class" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/13484 [16:36:04] PROBLEM - Puppet freshness on ekrem is CRITICAL: Puppet has not run in the last 10 hours [16:36:04] PROBLEM - Puppet freshness on fenari is CRITICAL: Puppet has not run in the last 10 hours [16:36:04] PROBLEM - Puppet freshness on ssl3 is CRITICAL: Puppet has not run in the last 10 hours [16:36:04] PROBLEM - Puppet freshness on ssl3002 is CRITICAL: Puppet has not run in the last 10 hours [16:36:04] PROBLEM - Puppet freshness on nfs1 is CRITICAL: Puppet has not run in the last 10 hours [16:36:20] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/13484 [16:37:07] PROBLEM - Puppet freshness on ssl2 is CRITICAL: Puppet has not run in the last 10 hours [16:37:33] RobHalsell: could you update racktables for ms-be1008 and ms-be1012? [16:37:40] yep, sorry about that [16:37:54] New patchset: Mark Bergsma; "Revert "certs: use c_rehash instead of manually symlinking"" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/17193 [16:37:57] np, I just don't want to confuse future swift ring manipulators. [16:38:37] New patchset: Mark Bergsma; "Revert "Follow up to change 17065, adding the rapidssl ca source back in"" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/17194 [16:39:04] PROBLEM - Puppet freshness on ms-be1005 is CRITICAL: Puppet has not run in the last 10 hours [16:39:04] PROBLEM - Puppet freshness on ms-be1006 is CRITICAL: Puppet has not run in the last 10 hours [16:39:04] PROBLEM - Puppet freshness on ms-be1009 is CRITICAL: Puppet has not run in the last 10 hours [16:39:10] maplebed: fixed [16:39:11] mark: it was broken but I was told that someone immediately fixed it [16:39:19] after merging it [16:39:20] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/17193 [16:39:20] New review: Mark Bergsma; "Puppet is currently broken, feel free to remerge after fixing" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/17193 [16:39:20] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/17194 [16:39:23] jeremyb: wasn't that the case? [16:39:39] jeremyb: I think you gave it the -1 initially and you told me so [16:39:43] New review: Mark Bergsma; "Puppet is currently broken, feel free to remerge after fixing" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/17194 [16:39:44] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/17194 [16:39:54] New patchset: Mark Bergsma; "Revert "certs: use c_rehash instead of manually symlinking"" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/17193 [16:40:07] PROBLEM - Puppet freshness on argon is CRITICAL: Puppet has not run in the last 10 hours [16:40:07] PROBLEM - Puppet freshness on virt6 is CRITICAL: Puppet has not run in the last 10 hours [16:40:07] PROBLEM - Puppet freshness on virt1000 is CRITICAL: Puppet has not run in the last 10 hours [16:40:20] paravoid: no idea [16:40:21] mark: or is it broken for a different reason? [16:40:23] don't want to figure it out now [16:40:27] what was broken exactly? [16:40:37] i'm just reverting the cert related changes until someone can figure it out [16:40:38] New patchset: Pyoungmeister; "page triage cleanup: use correct syntax for mwscript" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/17195 [16:40:42] err: Could not apply complete catalog: Found 1 dependency cycle: [16:40:42] (File[/etc/ssl/certs/star.wikibooks.org.pem] => Exec[c_rehash] => Class[Certificates::Base] => Install_certificate[star.wikibooks.org] => File[/etc/ssl/certs/star.wikibooks.org.pem]) [16:40:42] Try the '--graph' option and opening the resulting '.dot' file in OmniGraffle or GraphViz [16:40:50] cooool! [16:41:10] PROBLEM - Puppet freshness on virt8 is CRITICAL: Puppet has not run in the last 10 hours [16:41:20] hey Reedy, could you take a look at https://gerrit.wikimedia.org/r/17195 and make sure that my syntax for invoking mwscript is correct? [16:41:20] New review: Mark Bergsma; "Puppet is currently broken, feel free to remerge after fixing" [operations/puppet] (production); V: 0 C: 2; - https://gerrit.wikimedia.org/r/17193 [16:41:20] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/17193 [16:41:20] New review: gerrit2; "Lint check passed." [operations/puppet] (production); V: 1 - https://gerrit.wikimedia.org/r/17195 [16:41:41] New review: Mark Bergsma; "Puppet is currently broken, feel free to remerge after fixing" [operations/puppet] (production); V: 1 C: 2; - https://gerrit.wikimedia.org/r/17193 [16:41:44] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/17193 [16:42:40] RobHalsell: ping [16:42:58] paravoid: ma rk reverted both yours and also the immediate followup ;) [16:43:07] RECOVERY - Puppet freshness on ssl1 is OK: puppet ran at Wed Aug 1 16:42:50 UTC 2012 [16:43:07] PROBLEM - Puppet freshness on ssl3003 is CRITICAL: Puppet has not run in the last 10 hours [16:43:07] PROBLEM - Puppet freshness on virt5 is CRITICAL: Puppet has not run in the last 10 hours [16:43:07] RECOVERY - Puppet freshness on virt8 is OK: puppet ran at Wed Aug 1 16:43:01 UTC 2012 [16:43:25] RECOVERY - Puppet freshness on virt1000 is OK: puppet ran at Wed Aug 1 16:43:09 UTC 2012 [16:43:58] and now puppet freshness is recovering [16:45:04] RECOVERY - Puppet freshness on ssl1004 is OK: puppet ran at Wed Aug 1 16:45:00 UTC 2012 [16:47:10] PROBLEM - Puppet freshness on singer is CRITICAL: Puppet has not run in the last 10 hours [16:47:10] PROBLEM - Puppet freshness on virt0 is CRITICAL: Puppet has not run in the last 10 hours [16:48:28] Reedy: thanks! [16:48:46] Change merged: Pyoungmeister; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/17195 [16:49:54] aude: heyas [16:50:10] RECOVERY - Puppet freshness on nickel is OK: puppet ran at Wed Aug 1 16:49:55 UTC 2012 [16:50:12] sorry about that, had tunnel vision on something else [16:50:49] RobHalsell: just wondering when we get wikimania videos in (on disk i assume), any suggestions on how we can get them uploaded to a holding bin / dropbox type place? [16:51:05] so we can review and put together the metadata before putting on commons [16:51:08] honestly the easiest thing to do is mail me a hard disk [16:51:13] RobHalsell: exactly [16:51:21] * aude hoping you'd say that ;) [16:51:30] RobHalsell: is giving you a disk easier than getting a place to rsync to? [16:51:35] it will probably be a couple more weeks [16:51:37] we have a sata to usb disk toaster thing already [16:51:46] so it doesnt even need to be an external disk [16:51:52] jeremyb: it would be faster i think unless someone has a great internet connection [16:52:01] RobHalsell: ok [16:52:07] RECOVERY - Puppet freshness on ssl4 is OK: puppet ran at Wed Aug 1 16:51:41 UTC 2012 [16:52:08] we tend to allocate a host for a month or two to transcode videos and the like [16:52:16] RobHalsell: ok :) [16:52:23] Roan handled this past years [16:52:32] i just allocated the hardware for him and plugged in the disks [16:52:52] use labs? [16:52:54] RobHalsell: so we could theoretically have someone just rsync straight to said misc host? [16:53:05] mark: for this amount of storage? [16:53:10] RECOVERY - Puppet freshness on ssl2 is OK: puppet ran at Wed Aug 1 16:52:45 UTC 2012 [16:53:10] RECOVERY - Puppet freshness on ssl3002 is OK: puppet ran at Wed Aug 1 16:52:54 UTC 2012 [16:53:12] what amount of storage? [16:53:24] if labs cant handle that it's pretty useless isn' it [16:53:25] mark: i'm guess more than a TB if it's multiple disks [16:53:27] jeremyb: thats more annoying. [16:53:35] then we have to setup some access and such [16:53:46] use labs [16:53:48] that's what it's for [16:53:59] we could if you think it's reliable enough [16:54:04] RECOVERY - Puppet freshness on sodium is OK: puppet ran at Wed Aug 1 16:54:03 UTC 2012 [16:54:10] mark: checked your schedule? [16:54:28] * aude would just like to get the disk to someone like rob who can get the raw files online somewhere [16:54:51] well, when we do the plug in the disk, its only available to ops and like... roan. [16:54:52] e.g. online but private such as dropbox type thing [16:55:02] add a few s1.xlarge instances and email labs-l to make sure that nobody migrates it ;) [16:55:07] but if you want others to work and access them, then what we have done in the past is not ideal [16:55:18] then you want labs, indeed [16:55:21] RobHalsell: then you can tranfer them to somewhere we can access? [16:55:31] RobHalsell: would it not be possible to just stick it in the labs vlan? i guess too much work [16:55:33] not any differently than you do [16:55:33] it might be me and jeremyb , but not really sure [16:55:34] RECOVERY - Puppet freshness on singer is OK: puppet ran at Wed Aug 1 16:55:18 UTC 2012 [16:55:34] RECOVERY - Puppet freshness on kaulen is OK: puppet ran at Wed Aug 1 16:55:33 UTC 2012 [16:55:37] labs isnt bare metal [16:55:47] * paravoid coughs. [16:55:48] you would have to ask ryan how easy it would be to access osmething like that from them [16:56:10] RECOVERY - Puppet freshness on virt2 is OK: puppet ran at Wed Aug 1 16:55:42 UTC 2012 [16:56:12] ie: i have no idea if it is feasible to expect to plug in an external disk into some box and have a labs instance be able to reach it [16:56:35] copying the files in from a physical server is likely gonna be easier [16:56:37] if its terabytes of video, i can understand wanting to have some kind of local disk transfer [16:56:40] RobHalsell: that's more leslie's dept. and the answer is no ;) [16:56:52] that would work fine [16:56:53] I know some vm hosts can't [16:56:57] jeremyb: how is that leslie? [16:57:04] i meant into the physical virtual host server [16:57:09] and then have the vm moun tit [16:57:09] ah [16:57:12] mark, could you take a look at https://gerrit.wikimedia.org/r/#/c/16990/ ? [16:57:12] seems like ryans area [16:57:15] hrm, interesting [16:57:19] RobHalsell: what can mount from one vlan to another? [16:57:22] MaxSem: yes [16:57:39] jeremyb: not vlan, what can a virtual host access on the actual hardware hosts they reside on [16:57:54] RobHalsell: ohhhhhhhhhh [16:58:07] RECOVERY - Puppet freshness on virt5 is OK: puppet ran at Wed Aug 1 16:57:52 UTC 2012 [16:58:34] RECOVERY - Puppet freshness on fenari is OK: puppet ran at Wed Aug 1 16:58:23 UTC 2012 [16:58:36] heh, i automatically go to 'what solution has the least hardware involved' [16:59:01] RECOVERY - Puppet freshness on marmontel is OK: puppet ran at Wed Aug 1 16:58:39 UTC 2012 [16:59:02] s3 [16:59:04] ;-) [16:59:10] PROBLEM - Puppet freshness on ms-be10 is CRITICAL: Puppet has not run in the last 10 hours [16:59:15] plugging a sata disk thats in our sata to usb toaster thing into the actual server that houses the virtual machine that they would use for transcoding [16:59:44] in my brain i want to say that it wouldn't, because it can't access the local storage on the machine [16:59:48] but i have no idea how well our systems can handle that with openstack and such [16:59:52] i'm going ot start googling [17:00:08] well, you can in esx, esxi, and regular vmware virtual server [17:00:09] LeslieCarr: well you could virsh it... [17:00:11] or you used to be able to [17:00:28] all of those solutions have drawbacks so we dont use them though ;] [17:00:28] and you can in qemu [17:00:39] i don't remember if we use xen or kvm [17:00:39]