[00:00:35] PROBLEM - HTTP radosgw on ms-fe1004 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:01:14] PROBLEM - HTTP radosgw on ms-fe1002 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:02:18] Error 400 on SERVER: Duplicate definition: Exec[apt-update-for-${name}] is already defined in file /var/lib/git/operations/puppet/modules/apt/manifests/repository.pp at line 59; cannot redefine at /var/lib/git/operations/puppet/modules/apt/manifests/repository.pp:59 on node virt0.wikimedia.org [00:02:19] * Ryan_Lane sighs [00:02:46] Yeah, that is the single worst thing about puppet. It should accept multiple non-conflicting definitions. [00:03:07] well, that one specifically is because single quotes were used [00:03:15] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 00:03:09 UTC 2013 [00:04:14] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [00:04:30] New patchset: Ryan Lane; "Use double quotes where dereferencing is needed" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74302 [00:05:12] Change merged: Ryan Lane; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74302 [00:06:36] PROBLEM - HTTP radosgw on ms-fe1001 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:06:46] PROBLEM - LVS HTTP IPv4 on ms-fe.eqiad.wmnet is CRITICAL: CRITICAL - Socket timeout after 10 seconds [00:22:18] New review: Catrope; "Looks good to me. Scheduling this for deployment in tomorrow's lightning deploy window (23 hours fro..." [operations/mediawiki-config] (master) C: 1; - https://gerrit.wikimedia.org/r/72356 [00:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 00:24:49 UTC 2013 [00:25:26] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [00:28:06] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 00:27:58 UTC 2013 [00:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [00:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 00:28:54 UTC 2013 [00:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [00:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 00:32:45 UTC 2013 [00:33:16] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [00:45:36] PROBLEM - Disk space on ms-be1003 is CRITICAL: DISK CRITICAL - free space: / 5666 MB (3% inode=98%): [00:53:16] New patchset: TTO; "(bug 51328 and bug 51232) add autopatrolled and uploader groups to ckbwiki" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74308 [00:54:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 00:54:45 UTC 2013 [00:55:26] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [00:55:56] PROBLEM - Puppet freshness on analytics1019 is CRITICAL: No successful Puppet run in the last 10 hours [00:57:36] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 00:57:33 UTC 2013 [00:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [00:58:46] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 00:58:45 UTC 2013 [00:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [00:59:56] PROBLEM - Puppet freshness on analytics1018 is CRITICAL: No successful Puppet run in the last 10 hours [01:00:56] PROBLEM - Puppet freshness on analytics1020 is CRITICAL: No successful Puppet run in the last 10 hours [01:02:36] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 01:02:35 UTC 2013 [01:03:16] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [01:25:00] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 01:24:53 UTC 2013 [01:25:30] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [01:27:50] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 01:27:45 UTC 2013 [01:28:20] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [01:29:00] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 01:28:55 UTC 2013 [01:29:10] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [01:32:41] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [01:33:30] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 01:33:28 UTC 2013 [01:33:30] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.127 second response time [01:34:10] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [01:35:30] PROBLEM - Disk space on ms-be1005 is CRITICAL: DISK CRITICAL - free space: / 5690 MB (3% inode=97%): [01:36:00] PROBLEM - Puppet freshness on erzurumi is CRITICAL: No successful Puppet run in the last 10 hours [01:36:00] PROBLEM - Puppet freshness on lvs1004 is CRITICAL: No successful Puppet run in the last 10 hours [01:36:00] PROBLEM - Puppet freshness on lvs1005 is CRITICAL: No successful Puppet run in the last 10 hours [01:36:00] PROBLEM - Puppet freshness on lvs1006 is CRITICAL: No successful Puppet run in the last 10 hours [01:36:00] PROBLEM - Puppet freshness on virt1 is CRITICAL: No successful Puppet run in the last 10 hours [01:36:00] PROBLEM - Puppet freshness on virt3 is CRITICAL: No successful Puppet run in the last 10 hours [01:36:00] PROBLEM - Puppet freshness on virt4 is CRITICAL: No successful Puppet run in the last 10 hours [01:54:50] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 01:54:43 UTC 2013 [01:55:30] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [01:57:40] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 01:57:38 UTC 2013 [01:58:20] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [01:58:50] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 01:58:44 UTC 2013 [01:59:10] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [02:02:50] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 02:02:43 UTC 2013 [02:03:10] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [02:15:04] !log LocalisationUpdate completed (1.22wmf10) at Thu Jul 18 02:15:04 UTC 2013 [02:15:16] Logged the message, Master [02:24:57] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 02:24:51 UTC 2013 [02:25:27] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [02:28:17] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 02:28:07 UTC 2013 [02:28:27] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [02:28:28] !log LocalisationUpdate completed (1.22wmf9) at Thu Jul 18 02:28:28 UTC 2013 [02:28:39] Logged the message, Master [02:28:57] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 02:28:48 UTC 2013 [02:29:17] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [02:33:17] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 02:33:13 UTC 2013 [02:34:17] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [02:43:17] !log LocalisationUpdate ResourceLoader cache refresh completed at Thu Jul 18 02:43:17 UTC 2013 [02:43:29] Logged the message, Master [02:54:57] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 02:54:47 UTC 2013 [02:55:27] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [02:57:47] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 02:57:46 UTC 2013 [02:58:27] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [02:58:27] PROBLEM - HTTP Apache on ms-fe1002 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [02:58:57] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 02:58:47 UTC 2013 [02:59:17] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [03:02:47] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 03:02:46 UTC 2013 [03:03:17] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [03:04:27] PROBLEM - HTTP Apache on ms-fe1001 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [03:24:53] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 03:24:49 UTC 2013 [03:25:33] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [03:27:09] New patchset: Ryan Lane; "Explicitly disable php for uploads directory" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74319 [03:27:43] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 03:27:38 UTC 2013 [03:28:03] Change merged: Ryan Lane; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74319 [03:28:23] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [03:28:53] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 03:28:49 UTC 2013 [03:29:13] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [03:33:03] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 03:32:54 UTC 2013 [03:33:13] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [03:35:43] PROBLEM - HTTP Apache on ms-fe1003 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [03:37:43] PROBLEM - HTTP Apache on ms-fe1004 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [03:40:52] New patchset: Ryan Lane; "Bind memcached to 127.0.0.1 on openstackmanager nodes" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74321 [03:45:00] Change merged: Ryan Lane; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74321 [03:54:13] PROBLEM - Memcached on virt0 is CRITICAL: Connection refused [03:54:53] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 03:54:46 UTC 2013 [03:55:33] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [03:58:13] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 03:58:12 UTC 2013 [03:58:23] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [03:58:53] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 03:58:52 UTC 2013 [03:59:13] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [04:03:23] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 04:03:17 UTC 2013 [04:04:13] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [04:07:24] PROBLEM - LVS HTTP IPv4 on ms-fe.eqiad.wmnet is CRITICAL: CRITICAL - Socket timeout after 10 seconds [04:25:04] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 04:24:59 UTC 2013 [04:25:34] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [04:27:44] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 04:27:41 UTC 2013 [04:28:24] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [04:28:54] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 04:28:49 UTC 2013 [04:29:14] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [04:36:01] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 04:35:47 UTC 2013 [04:36:14] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [04:36:15] New patchset: Ori.livneh; "Expand documentation of EventLogging module" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74322 [04:54:54] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 04:54:47 UTC 2013 [04:55:34] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [04:57:44] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 04:57:34 UTC 2013 [04:58:24] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [04:58:54] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 04:58:45 UTC 2013 [04:59:14] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [05:02:54] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 05:02:48 UTC 2013 [05:03:14] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [05:06:19] RECOVERY - HTTP Apache on ms-fe1001 is OK: HTTP OK: HTTP/1.1 200 OK - 229 bytes in 3.065 second response time [05:06:19] RECOVERY - HTTP Apache on ms-fe1003 is OK: HTTP OK: HTTP/1.1 200 OK - 229 bytes in 6.342 second response time [05:06:19] RECOVERY - HTTP Apache on ms-fe1002 is OK: HTTP OK: HTTP/1.1 200 OK - 229 bytes in 0.001 second response time [05:06:19] RECOVERY - HTTP radosgw on ms-fe1003 is OK: HTTP OK: HTTP/1.1 200 OK - 311 bytes in 0.004 second response time [05:06:19] RECOVERY - HTTP radosgw on ms-fe1004 is OK: HTTP OK: HTTP/1.1 200 OK - 311 bytes in 0.003 second response time [05:06:39] RECOVERY - HTTP radosgw on ms-fe1001 is OK: HTTP OK: HTTP/1.1 200 OK - 311 bytes in 1.839 second response time [05:07:09] RECOVERY - HTTP Apache on ms-fe1004 is OK: HTTP OK: HTTP/1.1 200 OK - 229 bytes in 0.001 second response time [05:07:09] RECOVERY - LVS HTTP IPv4 on ms-fe.eqiad.wmnet is OK: HTTP OK: HTTP/1.1 200 OK - 311 bytes in 0.004 second response time [05:07:12] RECOVERY - HTTP radosgw on ms-fe1002 is OK: HTTP OK: HTTP/1.1 200 OK - 311 bytes in 0.004 second response time [05:07:35] ignore that [05:07:37] that would be me [05:07:44] I should really make this non-paging again... [05:10:48] New patchset: Faidon; "Remove paging ms-fe.eqiad.wmnet check" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74324 [05:11:43] Change merged: Faidon; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74324 [05:14:39] RECOVERY - Disk space on ms-be1003 is OK: DISK OK [05:17:29] RECOVERY - Disk space on ms-be1001 is OK: DISK OK [05:17:39] RECOVERY - Disk space on ms-be1005 is OK: DISK OK [05:24:59] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 05:24:55 UTC 2013 [05:25:39] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [05:27:39] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 05:27:36 UTC 2013 [05:28:19] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [05:28:49] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 05:28:47 UTC 2013 [05:29:19] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [05:33:09] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 05:33:01 UTC 2013 [05:33:09] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [05:54:59] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 05:54:49 UTC 2013 [05:55:39] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [05:58:09] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 05:58:07 UTC 2013 [05:58:19] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [05:58:49] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 05:58:48 UTC 2013 [05:59:19] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [06:02:59] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 06:02:56 UTC 2013 [06:03:09] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [06:17:19] Change merged: Faidon; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74322 [06:20:08] thanks paravoid [06:25:01] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 06:24:52 UTC 2013 [06:25:31] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [06:26:21] PROBLEM - Disk space on ms-be1002 is CRITICAL: DISK CRITICAL - free space: / 5660 MB (3% inode=98%): [06:28:01] PROBLEM - Puppet freshness on manutius is CRITICAL: No successful Puppet run in the last 10 hours [06:28:51] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 06:28:48 UTC 2013 [06:29:01] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 06:28:58 UTC 2013 [06:29:21] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [06:29:21] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [06:30:21] RECOVERY - Disk space on ms-be1002 is OK: DISK OK [06:32:41] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 06:32:39 UTC 2013 [06:33:11] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [06:55:01] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 06:54:51 UTC 2013 [06:55:31] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [06:57:41] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 06:57:39 UTC 2013 [06:58:21] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [06:58:51] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 06:58:50 UTC 2013 [06:59:21] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [07:02:41] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 07:02:39 UTC 2013 [07:03:11] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [07:25:00] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 07:24:50 UTC 2013 [07:25:30] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [07:27:40] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 07:27:38 UTC 2013 [07:28:20] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [07:28:50] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 07:28:49 UTC 2013 [07:29:20] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [07:31:40] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:32:30] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.143 second response time [07:33:40] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 07:33:30 UTC 2013 [07:34:10] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [07:54:50] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 07:54:47 UTC 2013 [07:55:30] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [07:56:40] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:57:30] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.134 second response time [07:58:20] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 07:58:17 UTC 2013 [07:58:20] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [07:59:00] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 07:58:54 UTC 2013 [07:59:20] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [08:03:20] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 08:03:10 UTC 2013 [08:04:10] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [08:24:57] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 08:24:54 UTC 2013 [08:25:37] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [08:29:07] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 08:28:59 UTC 2013 [08:29:07] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 08:28:59 UTC 2013 [08:29:17] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [08:29:27] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [08:32:47] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 08:32:41 UTC 2013 [08:33:07] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [08:54:57] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 08:54:47 UTC 2013 [08:55:37] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [08:56:37] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [08:57:28] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.124 second response time [08:57:47] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 08:57:40 UTC 2013 [08:58:27] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [08:58:57] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 08:58:47 UTC 2013 [08:59:17] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [09:02:47] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 09:02:41 UTC 2013 [09:03:07] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [09:23:36] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [09:24:27] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.124 second response time [09:25:06] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 09:24:59 UTC 2013 [09:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [09:28:06] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 09:28:02 UTC 2013 [09:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [09:29:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 09:29:48 UTC 2013 [09:30:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [09:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 09:32:42 UTC 2013 [09:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [09:46:49] New patchset: Ori.livneh; "Clean-up: port 'analysis' class to a role class." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74334 [09:46:49] New patchset: Ori.livneh; "Add eventlogging::plugin custom resource type" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74335 [09:49:48] New patchset: Mark Bergsma; "Factor out LVS realserver ip include, add text-varnish IPs" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74336 [11:38:26] PROBLEM - SSH on searchidx1001 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [11:54:01] probably we can't deploy traffic today [11:54:07] ok [11:54:08] and on friday that usually doesn't make people happy [11:54:13] yeah [11:54:20] so perhaps early next week we'll do that [11:54:28] hopefully they notice stuff gets better and faster :) [11:54:37] ok [11:54:37] hehe while it lasts [11:54:43] while wikidata is the only one on those varnish servers ;) [11:54:49] ah, ok [11:54:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 11:54:48 UTC 2013 [11:55:19] wikidata will have 6.4 TB of cache space available in eqiad alone [11:55:28] but if it goes well, the other wikis will follow soon ;) [11:55:31] :D [11:55:59] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [11:56:49] I guess, just let the community know now that we're gonna do this [11:56:54] that it will likely happen early next week, we'll post a new notice when we're switching traffic [11:56:54] and to let us know of any problems that may be related? [11:56:54] ok [11:57:36] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 11:57:32 UTC 2013 [11:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [11:59:14] ok, the PURGE hit/miss ratio patch compiles [11:59:18] I guess I can roll it out ;) [11:59:48] EVERYWHERE! [11:59:50] :) [12:01:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 12:01:53 UTC 2013 [12:02:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [12:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 12:02:39 UTC 2013 [12:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [12:04:45] something's fucked hehe [12:04:52] the init script doesn't finish starting varnish, [12:04:59] and both varnish and vhtcpd are consuming 100% cpu [12:07:02] perhaps that was just vhtcpd backlog processing [12:07:59] because it seems to work [12:12:54] deployed it on cp1046 (mobile cache) now [12:15:02] 34% cache ratio? [12:15:07] that can't be right [12:16:59] it was around 20 [12:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 12:24:49 UTC 2013 [12:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [12:27:27] !log Inserted varnish 3.0.3plus~rc1-wm14 packages into the precise-wikimedia APT repository [12:27:36] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 12:27:32 UTC 2013 [12:27:38] Logged the message, Master [12:28:19] New patchset: Mark Bergsma; "varnish (3.0.3plus~rc1-wm14) precise; urgency=low" [operations/debs/varnish] (testing/3.0.3plus-rc1) - https://gerrit.wikimedia.org/r/74354 [12:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [12:28:52] Change merged: Mark Bergsma; [operations/debs/varnish] (testing/3.0.3plus-rc1) - https://gerrit.wikimedia.org/r/74344 [12:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 12:28:53 UTC 2013 [12:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [12:29:51] wouldn't it be interesting to have counters for Vary based hits/misses? [12:32:46] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [12:32:56] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 12:32:47 UTC 2013 [12:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [12:33:36] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.125 second response time [12:36:27] /* No Vary: header, no worries */ [12:36:31] excellent comment [12:37:12] lol [12:39:23] New patchset: Hashar; "beta: text cache in $wgSquidServerNoPurge" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74355 [12:39:54] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74355 [12:55:06] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 12:55:00 UTC 2013 [12:55:37] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [12:58:26] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 12:58:19 UTC 2013 [12:58:46] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [12:59:26] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 12:59:16 UTC 2013 [12:59:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [12:59:36] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.147 second response time [13:00:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [13:02:56] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 13:02:45 UTC 2013 [13:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [13:07:16] New review: Hashar; "Also need to add DataTypes in the wmf branches." [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74291 [13:09:17] New review: Aude; "needs testing. " [operations/mediawiki-config] (master) C: -1; - https://gerrit.wikimedia.org/r/74291 [13:21:42] wow [13:21:46] while reading the code [13:21:54] varnish is doing a lot of unnecessary work just to purge objects [13:22:14] it's going through all variants and expired objects and all that to find the best one [13:22:20] only to purge them all later [13:24:46] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 13:24:44 UTC 2013 [13:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [13:26:32] New patchset: Ottomata; "Granting Christian access to privatedata on stat1002. RT 5467" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74359 [13:28:16] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 13:28:08 UTC 2013 [13:28:23] New patchset: Ottomata; "Granting Christian access to privatedata on stat1002. RT 5467" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74359 [13:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [13:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 13:28:54 UTC 2013 [13:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [13:29:18] Change merged: Ottomata; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74359 [13:30:46] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [13:31:36] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 5.030 second response time [13:33:16] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 13:33:12 UTC 2013 [13:34:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [13:37:05] paravoid: hi! have you done the puppet glue to get gdnsd installed on jenkins servers? :) [13:38:53] not yet, sorry [13:43:12] https://integration.wikimedia.org/ci/job/operations-puppet-pep8/3693/violations/file/modules/authdns/files/authdns-gen-zones.py/? [13:43:15] stupid pep8 [13:43:20] :D [13:44:11] the suggestion is completely stupid [13:44:31] oh that one [13:46:39] hey ^demon - I keep missing you on irc. we should talk about merging CirrusSearch's elasticsearch stuff at some point. [13:47:25] <^demon> Yeah. I was going to merge yesterday but I saw that the submodule for Elastica was causing problems with jenkins. [13:47:51] ah! now it all makes sese. [13:48:14] so you think it'd be better to move that submodule to gerrit? [13:48:33] also, are you still jet lagged? it is super early there [13:48:50] <^demon> No, no, been here a week (and I drove, so no jets). [13:48:59] <^demon> cmjohnson1 and I have some work to get done early today :) [13:49:04] car lagging .. [13:49:20] what is the issue with the submodules in jenkins ? [13:49:28] the job might not be properly configured [13:49:44] <^demon> https://integration.wikimedia.org/ci/job/mwext-CirrusSearch-lint/45/console - submodule tries to clone from on-disk, but it's a github submodule. [13:50:28] driving sounds like it'd be equal parts fun and torture [13:50:32] New patchset: Petr Onderka; "reading dump; command-line parameters" [operations/dumps/incremental] (gsoc) - https://gerrit.wikimedia.org/r/74361 [13:50:33] New patchset: Aude; "Update and cleanup settings for Wikidata" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74362 [13:50:48] <^demon> hashar: I figured the path of least resistance would be to just copy the library from github to gerrit. [13:51:03] <^demon> manybubbles: About 60/40 torture/fun ;-) [13:51:18] New review: Aude; "although I am quite confident this patch does everything correct, it needs careful sanity check and ..." [operations/mediawiki-config] (master) C: -1; - https://gerrit.wikimedia.org/r/74362 [13:51:29] ^demon: hmm that is crazy [13:51:31] ^demon and hashar: that also has the advantage of making sure it doesn't go away - as unlikely as that is. [13:51:38] New patchset: Jeremyb; "rv tab in the middle of a line" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74363 [13:51:42] ^demon: seems Jenkins rewrite the github URL to be a relative path [13:52:14] <^demon> Yeah, which makes sense for most things since they are in gerrit and would be on disk, but not in this case. [13:54:19] <^demon> hashar: It might make sense to kill zuul for the bit while gerrit is down, so it doesn't keep trying to reconnect pointlessly. [13:54:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 13:54:49 UTC 2013 [13:55:02] New patchset: Aude; "Update and cleanup settings for Wikidata" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74362 [13:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [13:56:13] ^demon: sure [13:56:18] fucking pep8 [13:56:26] so stupid [13:56:32] New patchset: Aude; "Update and cleanup settings for Wikidata" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74362 [13:56:34] ^demon: feel free to shut zuul down whenever you want :) [13:56:36] if not args.force and \ [13:56:36] (os.path.getmtime(filepath) <= os.path.getmtime(zonefilepath)): [13:56:39] continue [13:56:44] files/authdns-gen-zones.py:77:13: E122 continuation line missing indentation or outdented [13:56:51] if not args.force and \ [13:56:51] (os.path.getmtime(filepath) <= os.path.getmtime(zonefilepath)): [13:56:54] continue [13:56:55] files/authdns-gen-zones.py:77:17: E125 continuation line does not distinguish itself from next logical line [13:56:55] New review: Aude; "still needs additional testing and sanity check" [operations/mediawiki-config] (master) C: -1; - https://gerrit.wikimedia.org/r/74362 [13:56:55] <^demon> hashar: Will do, just wanted to let you know :) [13:57:06] what the heck am I supposed to do then? [13:57:35] and if I put a parentheses around the second line gets at 80 chars and I get a warning for line length [13:57:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 13:57:38 UTC 2013 [13:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [13:58:40] paravoid: https://gist.github.com/hashar/6029529 :-D [13:58:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 13:58:50 UTC 2013 [13:59:15] and that's supposed to be more readable? [13:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [13:59:18] because it's not really [14:00:40] <^demon> cmjohnson1: Ok, let's do this shiznit [14:01:16] paravoid: but it doesn't have the \ in it which makes the python gods happy [14:01:31] okay [14:02:05] !log removing/swapping sdb2 on manganese [14:02:16] Logged the message, Master [14:02:37] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 14:02:34 UTC 2013 [14:02:45] <^demon> !log stopped gerrit & zuul services on manganese and gallium respectively [14:02:49] paravoid: another possibility is to use a variable for some of the long boolean parts [14:02:56] Logged the message, Master [14:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [14:03:39] stop the presses! [14:03:43] !log shutting down manganese [14:03:53] Logged the message, Master [14:04:36] PROBLEM - zuul_service_running on gallium is CRITICAL: PROCS CRITICAL: 0 processes with regex args ^/usr/bin/python /usr/local/bin/zuul-server [14:06:26] PROBLEM - Host manganese is DOWN: PING CRITICAL - Packet loss = 100% [14:07:25] paravoid: or you can slightly adapt your code https://gist.github.com/hashar/6029608 [14:07:58] > 79 :) [14:08:13] don't worry, I'll rework it [14:08:22] ǃlog Gerrit sleeping [14:08:26] it's just such a waste of time to adapt perfectly readable code [14:08:37] paravoid: or import getmtime with a shorter name [14:08:54] hashar: morebots sleeping? [14:09:24] paravoid: I tend to agree. My editor just reports me all the errors whenever I save and I got used to fix them [14:10:34] I had pep8 from wheezy which didn't show these :) [14:10:44] (1.2) [14:10:49] ahhh [14:11:10] one day I will get tox back ported on Precise and make use of virtual env to do the checks [14:11:20] this way you can pin which version of pep8 to use :) [14:11:34] hmm wait no. That would install the package from pip [14:11:36] RECOVERY - Host manganese is UP: PING OK - Packet loss = 0%, RTA = 0.30 ms [14:14:28] ^demon the disk is replaced and is synchronizing [14:14:39] <^demon> Mmk [14:19:50] <^demon> cmjohnson1: So, I had someone ask me a question that I didn't know the answer to. "Why does a disk replacement require downtime if there's no-interrupt RAIDs?" [14:20:07] <^demon> I know zilch here so was like "Err, cuz ops told me it had to come down :))" [14:21:14] in this case, the disks are located inside the server so I had to take it down to remove it [14:21:50] yay, no hotswap [14:22:11] <^demon> cmjohnson1: And that answer makes absolute perfect sense now. Thanks :) [14:22:52] ^demon the sync process is taking a little long...you can monitor to 'watch cat /proc/mdstat' [14:23:20] 2.7% [14:23:24] How much data is there? [14:23:34] <^demon> All the data! [14:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 14:24:47 UTC 2013 [14:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [14:26:26] 503 Service Temporarily Unavailable? [14:26:52] <^demon> twkozlowski: Gerrit? Planned downtime. [14:27:40] One more reason to subscribe to wikitech-l, I see. [14:27:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 14:27:39 UTC 2013 [14:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [14:29:36] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 14:29:32 UTC 2013 [14:30:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [14:31:33] <^demon> !log gerrit service back online [14:31:43] Logged the message, Master [14:31:52] <^demon> !log zuul back up too [14:32:02] Logged the message, Master [14:32:32] what is login.wikimedia.org ? [14:32:36] RECOVERY - zuul_service_running on gallium is OK: PROCS OK: 1 process with regex args ^/usr/bin/python /usr/local/bin/zuul-server [14:32:45] matanya: new central login [14:32:56] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 14:32:48 UTC 2013 [14:33:02] <^demon> Ok, gerrit's back up. Things might be a little slow for a bit while the disks finish sync'ing, but nothing to worry about. [14:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [14:33:27] thanks aude [14:33:48] not sure where it's enabled yet for login, but saw something about it today [14:33:51] <^demon> hashar: zuul came up just fine, you should start seeing events again. [14:34:15] oh, logging in goes through there now [14:34:35] what does that mean? [14:34:41] Yeah, there was a deploy last night for it [14:35:01] any technical doc to read about it? [14:35:07] ^demon: tail -f /var/log/zuul/zuul.log would show sometehing [14:35:09] https://www.mediawiki.org/wiki/Auth_systems/SUL2 [14:35:25] there is a overview on the mailing list [14:36:49] and short info in tech news, i think [14:36:50] !technews [14:37:04] meh, wm-bot doesn't repond to that here? [14:37:15] actually, anywhere. is it dead again? :/ [14:37:45] https://meta.wikimedia.org/wiki/Tech/News/Latest [14:51:36] New patchset: Akosiaris; "Introducing bacula module" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/70840 [14:53:46] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [14:54:37] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.125 second response time [14:54:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 14:54:47 UTC 2013 [14:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [14:58:16] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 14:58:07 UTC 2013 [14:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [14:58:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 14:58:49 UTC 2013 [14:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [15:00:25] New review: Akosiaris; "So after having a few full days fighting wih xtrabackup and its family I ended up with a new version..." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/70840 [15:03:06] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 15:03:02 UTC 2013 [15:04:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [15:07:33] New review: Denny Vrandecic; "(1 comment)" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74362 [15:09:36] heya paravoid, you there? [15:09:58] orrr akosiaris1, need an opinion on something [15:10:08] shoot [15:10:34] https://issues.cloudera.org/browse/HUE-1398 [15:10:44] basically, the hue init.d script is dumb [15:10:47] I just filed a bug report [15:10:59] but, i don't want to have to wait to upgrade to make my puppetization work. [15:11:26] what woudl you think if I provided my own patched init.d script with puppet [15:11:29] for hue [15:11:29] ? [15:11:50] paravoid: hi [15:12:00] paravoid: got q in relation to your comment here https://gerrit.wikimedia.org/r/#/c/73860/6/configure.ac [15:12:18] paravoid: does the version need to be the same in upstream and debian package version ? [15:14:07] ottomata: not terribly happy about it but if its that dump (how dump are we talking ???) it should be fine [15:14:37] all I need to do is add [15:14:40] —chuid $DAEMONUSER [15:14:48] to the start-stop-daemon command [15:15:38] hmmm... damn even if they fix it we are stuck to 4.2.1 for now [15:16:14] so yeah go for that but submit the patch to cloudera [15:16:37] and when we move to some other version of cloudera's packages we should see if they merged it [15:16:59] yeah cool [15:17:05] average: debian version is upstream version plus a revision [15:17:53] New patchset: Aude; "Update and cleanup settings for Wikidata" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74362 [15:18:35] Anyone mind if we do a quick deploy to fix a SUL bug this morning? Looks like sal is pretty quiet, but wanted to make sure no one is in the middle of something. [15:18:37] New review: Aude; "the site link groups need to be specified per wikibase repo" [operations/mediawiki-config] (master) C: -1; - https://gerrit.wikimedia.org/r/74362 [15:20:14] akosiaris1: you mean like 2.0.15-2 [15:20:22] yes [15:20:45] ok, thank you [15:20:57] :-) [15:25:16] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 15:25:10 UTC 2013 [15:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [15:29:06] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 15:28:59 UTC 2013 [15:29:06] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 15:29:04 UTC 2013 [15:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [15:29:29] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [15:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 15:32:43 UTC 2013 [15:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [15:36:48] New patchset: Aude; "Update and cleanup settings for Wikidata" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74362 [15:39:25] New review: Aude; "i think this is ready now but wouldn't mind careful review from others " [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74362 [15:50:09] New patchset: Akosiaris; "Introducing bacula module" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/70840 [15:51:58] !log csteipp synchronized php-1.22wmf10/extensions/CentralAuth 'Fix SUL issue' [15:52:09] Logged the message, Master [15:54:20] !log csteipp synchronized php-1.22wmf9/extensions/CentralAuth 'Fix SUL issue - wmf9' [15:54:31] Logged the message, Master [15:54:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 15:54:51 UTC 2013 [15:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [15:57:46] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [15:58:26] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 15:58:19 UTC 2013 [15:58:36] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.128 second response time [15:58:47] What just happened. [15:58:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 15:58:52 UTC 2013 [15:59:02] create mode 100644 docroot/foundation/presentations/anthere/Frankfurt4.ppt [15:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [15:59:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [15:59:36] We do need to keep PPT presentations inside operations/mediawiki-config, do we. [16:00:01] New review: Daniel Kinzler; "(1 comment)" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74362 [16:01:48] New patchset: Ottomata; "Installing java on stat1002" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74377 [16:02:06] Change merged: Ottomata; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74377 [16:02:20] twkozlowski: They're very important [16:02:47] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 16:02:41 UTC 2013 [16:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [16:03:20] Apparently :-) [16:03:39] I blamed Krinkle|detached [16:04:30] https://gerrit.wikimedia.org/r/#/c/74169/ — note that the link inside commit message /does not/ work [16:05:43] New patchset: Reedy; "Add new symlinks" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74379 [16:07:04] Remove the trailing full stop [16:07:23] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74379 [16:10:51] True dat Reedy, thanks. [16:10:54] Reedy: Can you do mailing list stuff? I lost my pw for mw-distributors. [16:11:14] Nope sorry, I don't have mailman admin [16:16:37] New patchset: Ottomata; "Slight restructure for java module." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74380 [16:17:59] New review: Ottomata; "Faidon, I thought you might like this." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74380 [16:22:10] !log reedy synchronized php-1.22wmf11/ 'Initial code sync' [16:22:21] Logged the message, Master [16:22:23] That took a while [16:22:53] New review: Ori.livneh; "(1 comment)" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74380 [16:24:06] Not sure it's relevant, but it's taking me ages to submit a small patch. [16:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 16:24:51 UTC 2013 [16:24:58] Gerrit is likely to still have reduced performance, but I've submitted numerous and the timing seems fine [16:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [16:26:45] New patchset: Odder; "(bug 51608) Configure Babel-related variables for uk.wikisource" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74381 [16:27:09] !log reedy synchronized docroot and w [16:27:20] Logged the message, Master [16:28:54] hexmode: philippe can do mailman [16:28:56] PROBLEM - Puppet freshness on manutius is CRITICAL: No successful Puppet run in the last 10 hours [16:28:56] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 16:28:55 UTC 2013 [16:29:04] or Thehelpfulone [16:29:08] not Thehelpfulone [16:29:17] jeremyb: ty [16:29:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [16:29:54] Reedy: np, tya [16:31:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 16:31:52 UTC 2013 [16:32:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [16:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 16:32:43 UTC 2013 [16:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [16:34:23] !log reedy Started syncing Wikimedia installation... : test2wiki to 1.22wmf11 and build l10n cache [16:34:34] Logged the message, Master [16:35:01] New patchset: Ottomata; "Puppetizing hue." [operations/puppet/cdh4] (master) - https://gerrit.wikimedia.org/r/69805 [16:38:06] Could someone please clear up some stray directories with permission errors? Running the following as root would be great [16:38:08] dsh -F10 -cM -g mediawiki-installation -o -oSetupTimeout=10 'rm -rf /usr/local/apache/common/php-1.22wmf2' [16:48:03] !log reedy Finished syncing Wikimedia installation... : test2wiki to 1.22wmf11 and build l10n cache [16:48:13] Logged the message, Master [16:49:21] New patchset: Reedy; "test2wiki to 1.22wmf11" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74382 [16:50:48] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74382 [16:54:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 16:54:51 UTC 2013 [16:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [16:56:46] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [16:57:37] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.137 second response time [16:58:56] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 16:58:54 UTC 2013 [16:58:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 16:58:54 UTC 2013 [16:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [16:59:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [17:03:06] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 17:02:57 UTC 2013 [17:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [17:13:30] New patchset: Andrew Bogott; "Use the apache module for mediawiki_singlenode" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74385 [17:24:17] New review: Aude; "(1 comment)" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74362 [17:27:06] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 17:26:56 UTC 2013 [17:27:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [17:27:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 17:27:42 UTC 2013 [17:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [17:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 17:28:54 UTC 2013 [17:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [17:30:44] New patchset: Reedy; "Update and cleanup settings for Wikidata" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74362 [17:31:25] New patchset: Ottomata; "Adding role::analytics::hue." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74388 [17:31:46] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [17:32:56] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 17:32:46 UTC 2013 [17:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [17:33:36] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.125 second response time [17:33:40] Change merged: Petr Onderka; [operations/dumps/incremental] (gsoc) - https://gerrit.wikimedia.org/r/74361 [17:33:53] New patchset: Petr Onderka; "used cmake; fixed code for gcc" [operations/dumps/incremental] (gsoc) - https://gerrit.wikimedia.org/r/74389 [17:34:06] <^demon> manybubbles: solr[0-3] and solr-zk[0-2] nuked. [17:34:14] Change merged: Petr Onderka; [operations/dumps/incremental] (gsoc) - https://gerrit.wikimedia.org/r/74389 [17:34:14] Thanks! [17:34:32] New patchset: Ottomata; "Adding role::analytics::hue." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74388 [17:35:06] <^demon> manybubbles: Also made a branch 'solr' on CirrusSearch as of the current master. I'll look at getting that merge sorted out now. [17:35:22] ^demon: you are my hero [17:41:40] <^demon> manybubbles: I'm going to import elastica to mediawiki/extensions/CirrusSearch/Elastica for now, path of least resistance for me :) [17:45:40] New patchset: Ori.livneh; "Clean-up: port 'analysis' class to a role class." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74334 [17:48:24] <^demon> Grrrr... [17:48:29] * ^demon whacks jenkins with a cluebat [17:49:27] ^demon: pix or it didn't happen [17:51:34] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74362 [17:53:16] PROBLEM - Host labstore3 is DOWN: PING CRITICAL - Packet loss = 100% [17:53:19] <^demon> Krinkle|detached: You around? [17:54:09] New patchset: Cmjohnson; "changing argon dhcpd entry" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74398 [17:54:26] RECOVERY - Host labstore3 is UP: PING OK - Packet loss = 0%, RTA = 27.57 ms [17:54:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 17:54:47 UTC 2013 [17:55:11] Change merged: Cmjohnson; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74398 [17:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [17:57:12] !log reedy synchronized wmf-config/ [17:57:23] Logged the message, Master [17:57:41] !log authdns update [17:57:58] Logged the message, Master [17:58:26] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 17:58:20 UTC 2013 [17:59:16] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 17:59:11 UTC 2013 [17:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [17:59:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [17:59:32] <^demon> jeremyb: I think I might have to ditch the cluebat. Some good old fashioned gasoline + matches is in order. [18:01:15] !log reedy rebuilt wikiversions.cdb and synchronized wikiversions files: All wikipedias to 1.22wmf11 [18:01:25] Logged the message, Master [18:01:57] New patchset: Reedy; "All wikipedias to 1.22wmf10" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74403 [18:02:25] New patchset: Mark Bergsma; "Don't run the default vcl_fetch function" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74404 [18:02:46] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [18:02:49] woah, wikipedias on wmf11 :) [18:02:56] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 18:02:50 UTC 2013 [18:03:05] <^demon> manybubbles: I've gotta step out for a bit to run an errand. I'll try to sort this merge when I'm back. Still fighting jenkins (although I may just tell jenkins to shove it and merge anyway ;-)) [18:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [18:03:36] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.131 second response time [18:03:40] New patchset: Mark Bergsma; "Don't run the default vcl_fetch function" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74404 [18:04:05] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74403 [18:04:55] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74404 [18:05:06] !log reedy rebuilt wikiversions.cdb and synchronized wikiversions files: testwiki, wikidatawiki and loginwiki to 1.22wmf11 [18:05:14] New patchset: Reedy; "testwiki, wikidatawiki and loginwiki to 1.22wmf11" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74405 [18:05:26] Logged the message, Master [18:05:36] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74405 [18:06:33] !log Ran extensions/UploadWizard/maintenance/migrateCampaigns.php against testwiki [18:06:43] Logged the message, Master [18:11:28] !log reedy rebuilt wikiversions.cdb and synchronized wikiversions files: testwikidatawiki to 1.22wmf11 [18:11:38] Logged the message, Master [18:12:22] !log Created wb_property_info on testwikidatawiki [18:12:32] Logged the message, Master [18:13:04] !log Added term weight to wb_terms on testwikidatawiki [18:13:14] Logged the message, Master [18:15:49] New patchset: Reedy; "testwikidatawiki to 1.22wmf11" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74409 [18:16:10] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74409 [18:17:46] New patchset: RobH; "setting bastions domains search & descriptions" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74173 [18:19:40] New review: RobH; "im not getting nearly enough credit for how damned witty my self-review comments are, either no one ..." [operations/puppet] (production) C: 2; - https://gerrit.wikimedia.org/r/74173 [18:19:48] Change merged: RobH; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74173 [18:20:43] !log adding new links into cr1/cr2-eqiad asw-c-eqiad bundle [18:20:54] Logged the message, Mistress of the network gear. [18:21:30] New patchset: Mark Bergsma; "Set a default 30d cache ttl on text" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74410 [18:22:13] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74410 [18:23:57] Could someone please clear up some stray directories with permission errors? Running the following as root would be great [18:23:57] dsh -F10 -cM -g mediawiki-installation -o -oSetupTimeout=10 'rm -rf /usr/local/apache/common/php-1.22wmf2' [18:24:58] New patchset: Mark Bergsma; "Don't run the default vcl_fetch function on text" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74411 [18:25:06] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 18:24:57 UTC 2013 [18:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [18:26:20] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74411 [18:27:56] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 18:27:46 UTC 2013 [18:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [18:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 18:28:53 UTC 2013 [18:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [18:33:16] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 18:33:14 UTC 2013 [18:34:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [18:45:06] New patchset: Mark Bergsma; "Set upload backend default cache TTL to 30d" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74417 [18:46:08] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74417 [18:53:12] hi springle [18:54:44] aude, hi [18:54:46] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 18:54:45 UTC 2013 [18:55:20] springle: i'm katie from the wikidata team... [18:55:29] thought it'd be easier to talk here [18:55:31] aude, ah cool :) [18:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [18:55:47] so, i'll have daniel reply tomorrow probably about property info [18:55:49] springle: we were thinking if we can bribe you into doing or scheduling the big schema change :) [18:55:55] yeah :) [18:56:08] Denny_WMDE, both tables? [18:56:12] as long as the terms table one is good (seems so), we'd like to get it done sooner than later [18:56:23] springle: reedy can handle the property info one once approved [18:56:34] the terms_weight on the wb_terms table [18:56:35] so just wb_terms alter [18:56:37] since it's only adding it and running a script [18:56:37] ok [18:56:41] springle: exactly [18:56:53] and it is expected to take a while [18:57:04] yes, but the table has a primary key [18:57:08] thus the OSC tool can be used [18:57:18] which minimized disruption to the users [18:57:23] minimizes [18:57:26] it will take a while, yeah. will batch it with osc [18:57:28] yep [18:57:31] thanks to asher for putting that pk there :) [18:57:33] a while is fine [18:57:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 18:57:38 UTC 2013 [18:57:51] yeah, we always do pk now for tables [18:57:57] Denny_WMDE: if only you'd used MongoDB... [18:57:59] aude, Denny_WMDE, just review my last email (3mins ago). if you're still happy, i'll organize it [18:58:00] learned that the hard way :) [18:58:05] springle: ok [18:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [18:58:42] i would say float is okay, but up to Denny_WMDE [18:58:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 18:58:49 UTC 2013 [18:59:15] bblack around ? [18:59:16] it's used to give, say Athens (greece) a higher weight than say Athens, Louisiana [18:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [18:59:18] !seen bblack [18:59:19] you probably wanted to use @seen [18:59:23] @seen bblack [18:59:29] when people search for wikidata entities and items [18:59:33] wm-bot: have you seen bblack ? [18:59:47] wm-bot: mr bot, do you want a glass of water ? [18:59:57] aude, right. so no calculations and relatively small values [19:00:00] wm-bot: @seen bblack [19:00:12] Denny_WMDE should answer / confirm [19:00:17] it's his code [19:00:24] aude, Denny_WMDE, brb 10min [19:00:28] ok [19:00:51] springle: aude: confirmed [19:00:53] New patchset: Mark Bergsma; "Cap upload object cache TTLs to 1h instead of an unconditional set" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74418 [19:00:56] New patchset: Ori.livneh; "Clean-up: port 'analysis' class to a role class." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74334 [19:01:26] New patchset: Pyoungmeister; "moving tfinc from roots to mortals per rt 5485" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74419 [19:01:42] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74418 [19:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 19:02:39 UTC 2013 [19:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [19:03:26] Denny_WMDE: in that case, i think we want to make a patch to wikibase to change the column type [19:03:30] and can backport [19:03:52] huh? why? it does say double [19:04:19] oh, ok [19:04:24] double is good then [19:05:33] cmjohnson1: yay moving links [19:06:00] cmjohnson1: feel free to start moving now [19:06:02] woot [19:06:12] okay..going to move cr1 first [19:06:46] cool [19:07:37] Change merged: Pyoungmeister; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74419 [19:08:09] is anyone available to review https://gerrit.wikimedia.org/r/#/c/74334/ and https://gerrit.wikimedia.org/r/#/c/74335/ ? paravoid reviewed them once & i took his suggestions. [19:08:42] lesliecarr: added to asw-c1..see it? [19:09:37] moving links? [19:10:42] mark yes [19:11:17] what's that? the extra uplinks? [19:11:19] the link for row c is in c2 cuz we had to rma the switch awhile ago...moving back to c1 [19:11:32] and then the 2nd links going to asw-c2 [19:12:01] ahh [19:12:05] mark: we put in the uplinks to 7 [19:12:12] good [19:12:13] well i turned them up, chris put htem in a bit ago [19:12:18] to make sure we still had at least 20 [19:12:24] we can also remove that extra switch in row A again [19:12:46] cmjohnson1: link is up and happy [19:13:08] ok moving cr2 [19:14:08] !log adding elasticsearch v. 0.90.2 package to brewster [19:14:12] aude, Denny_WMDE. ok double it is. will start wb_terms OSC today. don't know yet how long it will take [19:14:13] manybubbles: ^^ [19:14:18] Logged the message, notpeter [19:14:20] springle: great! [19:14:27] however long it takes is fine [19:14:31] notpeter: thanks! [19:15:26] notpeter: spinning up a local puppetmaster to build the little elasticsearch puppet module [19:16:05] cool! [19:16:15] aude, Denny_WMDE, actually, one more question: is it likely that term_weight will ever be indexed? [19:16:29] Denny_WMDE: ^ [19:16:51] notpeter: my fist one got stuck doing its first puppet run and never finished. so I shot it. the second one is working fine [19:17:08] springle: tough one [19:17:12] why you ask? [19:17:35] springle: I hope not. [19:18:05] i think we can do a search on term name [19:18:14] and then the results should be smallish and can be sorted [19:18:25] aude: that's what we do [19:18:28] yes [19:18:44] Denny_WMDE, index size. if float can be trivially backported, future indexes covering it would be smaller [19:18:46] indexing only by weight will never be done [19:18:48] and i think that's probably okay, although i don't know enough to rule out an index [19:19:01] Denny_WMDE: we can use float [19:19:22] it's sufficient precision [19:19:42] New patchset: Mark Bergsma; "Don't run the default vcl_fetch function on upload" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74422 [19:20:32] Change merged: Mark Bergsma; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74422 [19:23:58] Denny_WMDE, ok, float it is. thanks [19:24:09] aude, ^, if you need to backport anything [19:24:36] springle: ok [19:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 19:24:50 UTC 2013 [19:25:29] New patchset: Andrew Bogott; "Use the apache module for mediawiki_singlenode" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74385 [19:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [19:26:01] Denny_WMDE: do you want to make the patch? [19:26:11] sure [19:26:16] will do [19:26:17] ok and i can backport [19:26:41] New review: Andrew Bogott; "I'm not confident about how I'm using the apache module here. The vhost class seems to expect to us..." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74385 [19:27:56] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 19:27:48 UTC 2013 [19:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [19:28:43] Ryan_Lane, I'm interested in whether you think that ^^ is a valid pattern for migrating to the apache module. [19:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 19:28:54 UTC 2013 [19:28:56] RECOVERY - Puppet freshness on lanthanum is OK: puppet ran at Thu Jul 18 19:28:54 UTC 2013 [19:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [19:29:23] andrewbogott: so, we have a problem with apache in general. everyone hates every single way we're handling apache right now [19:29:28] New review: Andrew Bogott; "Don't merge this yet -- the vhost it adds has a different filename so I need to figure out about cle..." [operations/puppet] (production) C: -1; - https://gerrit.wikimedia.org/r/74385 [19:29:52] Ryan_Lane, and the apache module isn't the presumed long-term winner? [19:29:57] nope :( [19:30:02] hm [19:30:06] we have no winner [19:30:24] though I think it's better than the horrible webserver::php5 class [19:31:08] Since my patch is +7, -20 I'm tempted to agree. [19:31:27] New review: Ryan Lane; "Yep. This is a sane way of handling it. You can pass in your own template, then use the variables li..." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74385 [19:32:02] But, sounds like I should postpone refactoring other code since it'll provoke a holy war. [19:32:22] yeah. probably [19:32:38] for certain things I'd say go for it (like openstack, that mediawiki module, etc) [19:32:44] Is there an email thread or something about this that I can catch up on? Or just general IRC grumblings? [19:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 19:32:38 UTC 2013 [19:32:47] because I don't see a viable alternative [19:32:51] IRC grumblings [19:32:59] 'k [19:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [19:33:06] <^demon> andrewbogott: We can probably move forward with https://gerrit.wikimedia.org/r/#/c/70429/. Someone other than me should review it though. [19:33:11] I'll stay clear until I can't stand it anymore. [19:33:11] I'd say start an ops thread asking about how we want to handle apache, since we have like 20 ways now [19:34:01] ^demon: I think I'm going to leave that patch rot for now, since the whole idea of the 'wmrole' class was shockingly unpopular. [19:34:28] <^demon> In that case should we just abandon? [19:34:38] ^demon: Yeah, probably. I'll do it. [19:34:43] <^demon> mmk [19:34:51] aude: done [19:35:07] Denny_WMDE: ok [19:37:50] springle: https://gerrit.wikimedia.org/r/#/c/74426/ [19:38:19] denny is amending it so it's changed for fresh installs, but otherwise that's the patch [19:39:23] New patchset: Demon; "Fix change-abandoned hook" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74429 [19:40:59] New review: Demon; "These hooks f'ing suck. They break each time we upgrade, optparse is deprecated, and there's way too..." [operations/puppet] (production) C: 1; - https://gerrit.wikimedia.org/r/74429 [19:41:22] <^demon> Ryan_Lane: One line change ^ [19:41:51] yes. we should use streams :) [19:41:58] they didn't exist when I wrote the hook stuff [19:42:19] <^demon> stream-events has existed since...almost forever? [19:42:19] and if we use streams, there's no need for it to run on manganese [19:42:32] ^demon: in the very first version of gerrit I installed? I'm pretty sure it didn't [19:42:44] aude: springle: amended [19:42:47] maybe it did. it's used for the jenkins connector, isn't it? [19:42:51] i'm backporting [19:43:11] Coren: I'm going ti dist-upgrade all compute nodes [19:43:13] <^demon> Ryan_Lane: zuul? yep it does. [19:43:22] ^demon: not zuul. the old jenkins connector [19:43:43] <^demon> I think that used stream-events too, can't remember. [19:43:59] <^demon> Neither zuul or the plugin use the REST api, but that's another battle. [19:44:32] Ryan_Lane: Does that cause outage of the guests? [19:44:37] nope [19:44:56] <^demon> YuviPanda: You're such a great person. [19:45:03] <^demon> Have I mentioned how awesome you are recently? [19:45:09] ^demon: did I spam you again? [19:45:09] * ^demon continues to suck up [19:45:30] want me to rewrite gerrit to bugzilla bot? :D [19:45:38] <^demon> How did you ever guess? ;-) [19:45:51] <^demon> No, actually I want you to replace the gerrit -> IRC crap. [19:45:56] gerrit-wm? [19:45:59] <^demon> Yep [19:46:08] that's easier, I think. [19:46:15] I already have almost all the infrastructure in place right now [19:46:34] <^demon> Right now when gerrit's hooks are called, we output to text files on disk. Ircecho monitors those and vomits it to IRC. [19:46:51] <^demon> Since you've done stuff with stream-events, I'd rather replace the whole mess with a bot that just monitors that. [19:46:53] <^demon> And then vomits :) [19:48:47] ^demon: eeeek [19:49:04] ^demon: yeah, that does sound much simpler to do it on toollabs. But toollabs is now a bit unstable, is that okay? [19:49:07] * YuviPanda looks at Ryan_Lane and Coren [19:49:30] YuviPanda: it's been unstable due to storage [19:49:36] indeed [19:49:41] but everything comes back up when it's fixed [19:49:42] <^demon> The bot shouldn't need really any storage. [19:49:43] I can happily move this over when NFS becomes stabler [19:49:43] YuviPanda: It's not all that unstabled; there are problems with the filesystem but they normally only lead to stalls, not breakage. [19:49:48] and it looks like it may be fixed [19:49:50] whee [19:50:16] and indeed, if it runs in memory, it should continue working, even during storage failures [19:50:28] Ryan_Lane: Also true. [19:50:32] Ryan_Lane: true, this is just Redis + network, so shouldn't be a problem [19:50:36] yep [19:50:42] fucking storage [19:50:44] I hate storage [19:50:44] hmm, okay. let me get to that then. later today. [19:50:50] the bane of labs since the beginning [19:50:53] Ryan_Lane: let's put Redis on NFS :D [19:50:59] * Ryan_Lane stabs YuviPanda [19:51:06] <^demon> I heard Ryan_Lane likes gluster too ;-) [19:51:21] ^demon: I think we're moving to SMB soon. Rock solid history that one has [19:51:25] hahaha [19:51:31] ^demon: all storage sucks [19:51:33] all of it [19:51:43] <^demon> Maybe we should just hook up some external USB drives to labs. [19:51:46] <^demon> That might work better [19:52:14] !log delaying slave db45 for wikidatawiki wb_terms OSC duration [19:52:24] Logged the message, Master [19:52:25] ^demon: USB3 is very fast, I heard. [19:52:36] In all fairness, NFS has been - despite the hickups - less actively troublesome than gluster has ever been. Worst that happens is that disk IO stalls for a while. [19:52:50] <^demon> YuviPanda: Pfft, we'll get some macs and do it via thunderbolt [19:52:59] ^demon: but apple sucks [19:53:25] Coren: Aren't there other options available like XFS? [19:53:32] * ^demon whispers to his computer [19:53:41] springle: waiting for jenkins to merge stuff [19:53:41] <^demon> Ssshh, don't listen to YuviPanda, he didn't mean it. [19:53:48] Elsie: The problem isn't the underlying filesystem, but the network filesystem. [19:53:49] <^demon> Daddy loves you. [19:54:03] * YuviPanda installs Debian on ^demon's computer when he forgets to lock his screen [19:54:05] What did/does the Toolserver use? [19:54:13] Computers [19:54:17] ty Reedy [19:54:49] <^demon> YuviPanda: But I've already got a Debian VM. And like 3 ubuntu ones. [19:54:49] ^demon: if yuvi hasnt dealt with ufi versus bios install launcher you may have a few minutes if he ever actually does that ;] [19:54:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 19:54:46 UTC 2013 [19:55:20] /home on ha-nfs.esi:/global/home [19:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [19:55:37] ^demon: well, not the real deal. How else can you configure X to be exactly how you like? :) [19:55:45] RobH: sadly that's why I still run OS X :( [19:55:55] RobH: no more once a Haswell Carbon X1 / XPS 13 comes out tho [19:55:56] There's nothing wrong with OS X. [19:56:11] Elsie: nothing wrong with SMB either :) [19:56:13] springle: does osc run off of the code that is deployed? [19:56:13] It has a sensible GUI and a sensible backend. [19:56:21] well, i think yuvi is sad as in a free as in free solution isnt his default os. [19:56:22] <^demon> Elsie: Well I still can't compile hhvm on it :( [19:56:25] as he should be [19:56:28] i have a little shame i run os x. [19:56:42] RobH: I miss awm, mostly. And apt-get [19:56:46] e.g. the backport has to be in and deployed for osc [19:56:54] I think Wikimedia should convince Apple to open source the rest of the OS. [19:56:58] free as in speech (free as in free, wtf does that mean) [19:57:00] heh [19:57:02] They're not even making any money off of it any longer. [19:57:13] aude, no i pull the alter into a separate osc script [19:57:15] yes, because wikimedia can totally affect apple.... [19:57:18] ^demon: I'll look into it later this day [19:57:18] aude: i don't think so.. [19:57:20] ok [19:57:22] and their shareholders ;] [19:57:38] we can update wikibase, just to keep things in sync but not critical [19:57:39] <^demon> YuviPanda: You're awesome. I will totally owe you one. [19:57:51] aude: seems sensible [19:58:13] jenkins seems extra extra slow [19:58:14] <^demon> YuviPanda: brew is nice, but it's no apt. [19:58:16] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 19:58:06 UTC 2013 [19:58:23] !log starting wikidatawiki wb_terms.term_weight OSC [19:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [19:58:28] ^demon: brew sucks. System / brew python is a messy situation [19:58:30] apt-get install apt [19:58:34] Logged the message, Master [19:58:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 19:58:47 UTC 2013 [19:59:04] aude, no hurry, this will take a while :) [19:59:05] ^demon: is it just slow or not going to start gate and submit? [19:59:09] https://gerrit.wikimedia.org/r/#/c/74476/ [19:59:11] springle: ok [19:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [19:59:22] now it's going [20:00:21] ^demon: can you point me to docs of what exists right now, so I can make sure I don't miss out functionality? [20:01:26] <^demon> YuviPanda: No docs really. Existing hooks are in puppet: files/gerrit/hooks/* [20:01:39] ^demon: hmm, so things like 'put these repos in this channel' are just there? [20:01:40] * YuviPanda looks [20:01:56] New patchset: Dr0ptp4kt; "Adding Wikipedia Zero automation testing server to XFF whitelist." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74509 [20:02:24] Elsie, emo. [20:02:26] <^demon> YuviPanda: Ah, at some configuration at templates/gerrit/hookconfig.py.erb [20:02:28] <^demon> *and [20:02:31] <^demon> Sorry 'bout that [20:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 20:02:38 UTC 2013 [20:03:00] MaxSem: Which part? [20:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [20:03:18] the emo part. [20:03:38] New patchset: Dr0ptp4kt; "Adding Wikipedia Zero automation testing server to XFF whitelist." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74509 [20:03:40] /wrists [20:16:43] New patchset: Reedy; "Point php at php-1.22wmf10" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74513 [20:17:52] New patchset: Brian Wolff; "Proposed settings for VIPS." [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74514 [20:19:18] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74513 [20:19:54] he [20:19:55] hey [20:21:44] New review: Yurik; "What machine is that?" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74509 [20:21:55] springle: any way to get informed when OSC is done? so we can then do next steps. i guess it might take a few … days? [20:22:25] Denny_WMDE, not days. hours [20:22:31] Denny_WMDE, I'll email [20:23:05] hours? [20:23:06] awesome! [20:23:07] thanks [20:23:59] New review: Dr0ptp4kt; "zero-test.pmtpa.wmflabs" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74509 [20:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 20:24:51 UTC 2013 [20:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [20:26:40] !log reedy synchronized php-1.22wmf11/extensions/Wikibase [20:26:51] Logged the message, Master [20:27:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 20:27:40 UTC 2013 [20:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [20:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 20:28:46 UTC 2013 [20:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [20:30:20] New review: Yurik; "but labs IPs might change at any moment?" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74509 [20:31:56] PROBLEM - Puppet freshness on neon is CRITICAL: No successful Puppet run in the last 10 hours [20:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 20:32:45 UTC 2013 [20:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [20:35:05] Reedy: Who's in charge of the deployment schedule in greg-g's absence? We'd like to do the regular LD for VisualEditor today if possible, but don't want to just declare it… [20:35:51] ^demon: https://www.mediawiki.org/wiki/User:Yuvipanda/Gerrot-Bot writing up thoguhts before I start writing code. [20:36:54] Ryan_Lane: Do you think it worthwhile to raise the issue with Dell? They don't officially do Ubuntu, but we're Important and they may yet help. [20:37:20] I expect LSI would just point us back to Dell. [20:38:01] James_F: Not sure. But MaxSem has some time in todays.. [20:38:20] Reedy: Yeah; we won't need the full window. Think it's OK to go after him? [20:38:40] me? [20:38:49] I've never had windows on tuesdays [20:38:54] It's a thursday [20:38:56] MaxSem: Apparently you have an LD this afternoon. [20:39:02] ... and yes, it's not a Tuesday. [20:39:06] it was last week [20:39:10] <^demon> YuviPanda: Edited. [20:39:19] thursdays! [20:39:30] MaxSem: OK, will pull you from the LD list. [20:39:42] MaxSem: Also, are you sure? [20:39:46] Coren: yes [20:39:55] Coren: we've had a number of controller issues with them [20:39:58] MaxSem: 'Cos you're not in the list for last Thursday, and we did a VE deploy… [20:40:11] Coren: there's likely some firmware upgrades available, right? [20:40:18] Ryan_Lane: Do we have a designated contact person with them (or with us?) [20:40:36] Ryan_Lane: Lemme check the version; I think it was upgraded recently. [20:40:38] <^demon> Elsie: I tried again with latest master, still can't get hhvm on OSX. I filed a bug :) https://github.com/facebook/hiphop-php/issues/864 [20:41:05] MaxSem: No need for "GeoData fix"? [20:41:19] it was a week ago [20:41:21] ^demon: Facebook is evil. [20:41:36] sorry for confusion [20:41:43] <^demon> Elsie: So are clowns ;-) [20:41:50] MaxSem: OK, moved. [20:41:52] FW Package Build: 12.10.2-0004 [20:42:18] ^demon: Thanks for the link, watching the thread now. [20:42:36] <^demon> yw [20:42:36] MaxSem: No worries. :-) [20:44:04] Ryan_Lane: ... I don't think I understand their version number scheme. Ours is reported as 12.10.2-0004, their latest is 12.10.2-0004,A08 ? [20:44:15] Coren: we have A05 [20:44:39] That A is some sort of point release, then? [20:45:40] labstore1 and labstor3 have the same version afaict [20:48:17] seems A is a point release [20:48:29] Coren: labstore1-4 were bought at the same time [20:48:34] I think the eqiad ones were bought later [20:49:55] New review: Matmarex; "For the record, I asked him and James said that he doesn't have a plan to respond to this patch." [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/73565 [20:51:30] Does this mean I can just abandon it? [20:51:45] i'd say that should mean that we should just merge it [20:53:53] New patchset: Dr0ptp4kt; "Adding Wikipedia Zero automation testing server to XFF whitelist." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74509 [20:54:40] New patchset: MaxSem; "Add __version__ magic field" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74521 [20:54:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 20:54:47 UTC 2013 [20:55:00] New patchset: MaxSem; "Add __version__ magic field to GeoData schema" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74521 [20:55:03] New review: Dr0ptp4kt; "Good point. Just allocated a fixed "floating" (elastically load balanced) IP address of 208.80.153.1..." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74509 [20:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [20:56:38] ^demon: do you know how long on average a mw/core checkout is to take these days? [20:57:16] <^demon> Pretty quick, lemme find out [20:57:36] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 20:57:35 UTC 2013 [20:57:56] PROBLEM - Puppet freshness on analytics1019 is CRITICAL: No successful Puppet run in the last 10 hours [20:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [20:58:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 20:58:47 UTC 2013 [20:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [21:00:00] <^demon> YuviPanda: http://p.defau.lt/?_pz3iBmoAU7gANkurJT4zg, most time was spent in receiving objects and resolving deltas. [21:00:23] <^demon> When cloning within the wmf cluster, the time spent receiving is almost negligible. [21:00:32] <^demon> (cluster, labs, anything nearby :)) [21:00:55] ^demon: okay, I'm going to chalk this up to labs NFS then :) [21:01:04] * YuviPanda redos it [21:01:17] <^demon> Oh yeah, it'll prolly take awhile to resolve deltas on labs NFS [21:01:40] ^demon: interestingly it got everything, but I did a git status and a lot of the files showed as 'deleted' [21:01:44] git reset --hard got them back [21:01:54] still. re-clining [21:01:56] PROBLEM - Puppet freshness on analytics1018 is CRITICAL: No successful Puppet run in the last 10 hours [21:02:00] <^demon> Something must've borked between the clone and checkout. [21:02:06] <^demon> Easily fixed, as you saw :) [21:02:09] indeed [21:02:14] it's still running [21:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 21:02:42 UTC 2013 [21:02:56] PROBLEM - Puppet freshness on analytics1020 is CRITICAL: No successful Puppet run in the last 10 hours [21:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [21:17:27] ^demon: https://github.com/yuvipanda/SuchABot/issues/8 and https://github.com/yuvipanda/SuchABot/issues/7 [21:17:34] ^demon: is there a list of recognized Tags somewhere? [21:18:31] <^demon> tags of what? [21:18:37] ^demon: Bug:, RT:? [21:18:57] <^demon> Oh, um, templates/gerrit/gerrit.config.erb [21:19:01] right [21:19:05] <^demon> Looking for the trackingid stuff. [21:20:29] ^demon: okay, so that is just Bug and RT [21:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 21:24:50 UTC 2013 [21:25:23] anyone available to unblock me by merging https://gerrit.wikimedia.org/r/#/c/74334/ & https://gerrit.wikimedia.org/r/#/c/74335/ ? they were reviewed by faidon and i took his suggestion re: the paths. [21:25:37] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [21:25:46] New patchset: Vogone; "Enabling the 'property-create' right for all users on testwikidatawiki Bug: 51637" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74528 [21:28:10] did we remove ULS today? it just now disappeared from test2wiki and I can't find any particular reason for that [21:28:16] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 21:28:09 UTC 2013 [21:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [21:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 21:28:49 UTC 2013 [21:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [21:32:04] https://test2.wikipedia.org/wiki/Special:Version [21:32:15] chrismcmahon: Looks to be there to me. [21:32:41] I see it logged in and logged out on test2. [21:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 21:32:43 UTC 2013 [21:33:02] New patchset: Ottomata; "Puppetizing hue." [operations/puppet/cdh4] (master) - https://gerrit.wikimedia.org/r/69805 [21:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [21:34:32] Elsie: odd, I do not see the cog icon in the left sidebar as anon or logged it [21:34:40] logged in [21:35:03] New review: Reedy; "(1 comment)" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74528 [21:35:17] chrismcmahon: Do you see "English" at the top of the screen? [21:35:53] I don't see the cog either, but that may be something else entirely. [21:36:25] It looks like there are discrepancies between test2 and en. [21:36:32] Elsie: ah, thanks, different UI. I seem to be behind. Was not aware we were abandoning that cog icon. (might still be a bug though) [21:36:34] Namely a lack of Wikidata integration for the Main Page. [21:36:53] And "In other languages" vs. "Languages" in the sidebar. [21:37:00] No problem. [21:37:56] PROBLEM - Puppet freshness on erzurumi is CRITICAL: No successful Puppet run in the last 10 hours [21:37:56] PROBLEM - Puppet freshness on lvs1004 is CRITICAL: No successful Puppet run in the last 10 hours [21:37:56] PROBLEM - Puppet freshness on lvs1005 is CRITICAL: No successful Puppet run in the last 10 hours [21:37:56] PROBLEM - Puppet freshness on lvs1006 is CRITICAL: No successful Puppet run in the last 10 hours [21:37:56] PROBLEM - Puppet freshness on virt1 is CRITICAL: No successful Puppet run in the last 10 hours [21:37:56] PROBLEM - Puppet freshness on virt3 is CRITICAL: No successful Puppet run in the last 10 hours [21:37:56] PROBLEM - Puppet freshness on virt4 is CRITICAL: No successful Puppet run in the last 10 hours [21:41:08] ori-l: hey [21:41:32] hey paravoid [21:41:58] I have another follow up question and I'll merge [21:42:01] https://gerrit.wikimedia.org/r/#/c/74334/3/manifests/role/ipython_notebook.pp [21:42:18] ah, much obliged. looking. [21:42:22] why are pandas & sympy and the role class? i.e. why aren't they appropriate for ipython::notebook [21:43:25] i wasn't certain where to put them myself. they go could in either. i guess it depends on whether you think of them as being nice complements to ipython notebook in general or as complementing the specific setup on vanadium [21:43:43] the former is probably more true, so maybe they do belong in the module [21:44:34] New patchset: Manybubbles; "Add elasticsearch module and role." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74534 [21:44:36] I'll go with your decision either way, I just want to understand it better :) [21:47:01] well, i wanted the module to be sufficiently generic to be useful for third-parties, because there isn't another good puppet module for ipython that i'm aware of. and someone might credibly grumble at having these additional (and non-essential) packages bundled together just because someone thought they go well together. it's a throwback to the old problem you flagged with debian recommends:. [21:47:23] okay [21:47:29] maybe i should put them in the module but in a separate class? [21:47:39] 'extras' or whatever? [21:47:46] could work [21:47:50] in the role class isn't too bad either [21:48:04] let's just do that, then [21:48:50] New patchset: Faidon; "Clean-up: port 'analysis' class to a role class." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74334 [21:49:37] ^demon: the newline issue fixed :) https://gerrit.wikimedia.org/r/#/c/74523/ [21:49:57] <^demon> looks great :D [21:51:07] c'mon jenkins [21:51:14] Change merged: Faidon; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74334 [21:51:33] New patchset: Faidon; "Add eventlogging::plugin custom resource type" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74335 [21:52:46] !log Finally running "mwscriptwikiset maintenance/purgeDeletedFiles.php all.dblist --starttime 20130529000000" [21:52:52] Change merged: Faidon; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74335 [21:52:56] Logged the message, Master [21:53:12] paravoid: thanks; i really appreciate the careful reviews. [21:53:21] no worries [21:53:23] * spagewmf compliments ori-l for "complements" correctness :) [21:53:25] sorry that you had to wait for me :) [21:53:56] * ori-l is affected by spagewmf's effective compliment [21:54:10] impacted, even <--- NOOOOO [21:55:08] Aaron|home: What's wrong with foreachwiki? ;) [21:55:46] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 21:55:41 UTC 2013 [21:56:28] !log Scap failed due to Wikibase issue [21:56:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [21:56:38] Logged the message, Master [21:56:52] I got "Bad initialization order: When running the Wikibase repository extension and the WikibaseClient extension on the same wiki, WikibaseClient has to be included AFTER the repository." [21:56:55] Unrelated to our changes. [21:57:54] Looking at wmf-config now [21:57:56] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 21:57:48 UTC 2013 [21:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [21:59:08] greg-g, looks right in CommonSettings. [21:59:13] In git that is. [21:59:58] superm401: can you pastebin the full log? [22:00:30] https://dpaste.de/CpCYQ/ [22:01:12] superm401 Maybe "85f4cd1 Update and cleanup settings for Wikidata" ? Reedy deployed after that, but maybe without a scap [22:01:20] Looks right on tin, too. [22:01:40] scap includes CommonSettings.php, right? [22:01:40] aude: ^ [22:01:44] Yes [22:02:37] Well, I did a scap, so that probably means either the code throwing the exception is wrong, or the config (CommonSettings, etc.) is wrong in a non-obvious way [22:02:42] i think it's https://gerrit.wikimedia.org/r/#/c/72921/ [22:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 22:02:37 UTC 2013 [22:02:53] self-merged [22:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [22:06:07] hrm, maybe not [22:06:09] yay working exit status [22:06:12] @notify ^demon [22:06:12] I'll let you know when I see ^demon around here [22:06:31] nice to have scap fail this time instead of just breaking the site for half an hour [22:06:57] Krinkle: Try ^angry [22:07:55] wtf! [22:07:58] I think it's extension-list. [22:08:05] That controls the include order for i18n, right? [22:08:34] it's used by /usr/local/bin/mw-update-l10n , yes [22:08:56] oh, it tries to include everything [22:09:07] even though those extensions are not intended to be together [22:09:12] * aude knows that [22:09:37] Fix coming [22:09:41] New patchset: Mattflaschen; "Move WikibaseClient after repo to try to fix scap error." [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74535 [22:09:57] ^angry: ping [22:10:00] aude: probably makes sense for you to review that ^ [22:10:03] <^angry> Krinkle: Sup? [22:10:05] k [22:10:08] ^angry: you first [22:10:09] thank you [22:10:16] in the extensions list, it's fine to put in whatever order [22:10:34] > ^demon: [2013-07-18 19:52:56 +02:00] Krinkle|detached: You around? [22:10:36] New review: Aude; "this is fine" [operations/mediawiki-config] (master) C: 1; - https://gerrit.wikimedia.org/r/74535 [22:10:39] aude, then what's causing the error? [22:10:41] Except for sensibility it was alphar sorted [22:10:51] superm401: i suppose inclusion order? [22:10:52] New patchset: Yuvipanda; "Install Lua on toollabs" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74536 [22:10:52] It can include them all at the same time [22:11:01] <^angry> Krinkle: Oh yeah, so I was trying to merge a change to CirrusSearch (it now has an external), but it wants to fetch it from on-disk rather than github. [22:11:05] right, per reedy [22:11:15] Err, can't include [22:11:23] Reedy: we're working to make it possible [22:11:37] <^angry> Krinkle: I tried copying the github dependency to gerrit, but it didn't replicate to the right place on-disk methinks :\ [22:11:38] e.g. commons can have items and use them on the same wiki [22:12:09] ideally any setup codes that depends on other extensions being loaded should run in $wgExtensionFunctions[] (or defer execution using some other callback strategy) rather than rely on the order of includes [22:12:14] *code [22:12:17] ^angry: mediawiki/extensions/CirrusSearch.git ? [22:12:53] ori-l, there's already a comment that it needs to be fixed (where it throws). [22:13:11] In the meantime, our window ended at 3. [22:13:13] <^angry> Krinkle: Yep, https://gerrit.wikimedia.org/r/#/c/74192/ and https://integration.wikimedia.org/ci/job/mwext-CirrusSearch-lint/46/console [22:13:26] Does someone want to review the extension-list change (aude +1'ed)? [22:13:29] superm401: hardly our fault, but good to ping greg-g. greg-g, around? [22:13:30] Or should I self-merge. [22:13:37] ori-l: good idea [22:13:41] New patchset: Yuvipanda; "Install Lua on toollabs" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74536 [22:13:53] superm401: i'd merge if i could :) [22:14:00] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74535 [22:14:07] superm401: merged [22:14:18] Thanks, I'll give it a go. [22:14:56] I was doing 1.22wmf11 when I got off onto this, so I need to finish that first. [22:15:23] New patchset: Reedy; "Remove bigdelete from sysops" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74538 [22:15:59] lol Reedy [22:16:26] New review: coren; "Does it, really? :-)" [operations/puppet] (production) C: 2; - https://gerrit.wikimedia.org/r/74536 [22:16:26] Change merged: coren; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74536 [22:17:00] superm401: looks like the only thing scheduled after us is: Roan: VisualEditor push of master to wmf10 and wmf11. [22:17:06] in 43 minutes [22:17:11] Reedy, is there a way to test the mergeMessageFileList.php that failed outside of scap? [22:17:12] i imagine you'd be done before then, right? [22:17:22] Run it manually [22:17:33] mwscript mergeMessageFileList.php... [22:17:39] Look what the script runs [22:18:34] James_F: Roan is away, so I'll ask you -- do you think Roan would mind being delayed by a few minutes, if we run over? [22:18:57] ori-l: Yeah, that's no problem at all. [22:19:21] cool, thanks. [22:19:46] ^angry: hm. checking it out in a minute, I'll get back to you. [22:19:53] I think something went wrong, it may not be what you think it.s [22:22:03] <^angry> Krinkle: Ok, thanks. [22:22:06] PROBLEM - Host mw1085 is DOWN: PING CRITICAL - Packet loss = 100% [22:22:24] fun [22:22:46] RECOVERY - Host mw1085 is UP: PING OK - Packet loss = 0%, RTA = 0.22 ms [22:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 22:24:48 UTC 2013 [22:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [22:27:56] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 22:27:47 UTC 2013 [22:28:14] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74538 [22:28:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [22:29:16] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 22:29:08 UTC 2013 [22:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [22:29:33] !log dist-upgrading all virt nodes [22:29:44] Logged the message, Master [22:29:46] !log reedy synchronized wmf-config/InitialiseSettings.php 'Remove bigdelete from sysyop' [22:29:57] Logged the message, Master [22:30:24] !log mflaschen Started syncing Wikimedia installation... : Second scap attempt for E3 deployment of GettingStarted and GuidedTour [22:30:34] Logged the message, Master [22:31:41] Reedy: What did that do, exactly? [22:31:56] sysops never had bigdelete? [22:32:13] New review: MZMcBride; "What is the point of this?" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/74538 [22:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 22:32:43 UTC 2013 [22:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [22:34:04] the ganglia-monitor package is so fucked [22:34:15] if you try to upgrade and its running, it breaks [22:34:16] RECOVERY - DPKG on virt6 is OK: All packages OK [22:34:27] Starting Ganglia Monitor Daemon: invoke-rc.d: initscript ganglia-monitor, action "start" failed. [22:34:27] dpkg: error processing ganglia-monitor (--configure): [22:34:27] subprocess installed post-installation script returned error exit status 1 [22:34:27] Errors were encountered while processing: [22:34:27] ganglia-monitor [22:34:28] E: Sub-process /usr/bin/dpkg returned an error code (1) [22:34:43] it needs to be stopped for the package to upgrade [22:38:55] New patchset: Andrew Bogott; "Use the apache module for mediawiki_singlenode" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74385 [22:39:54] New patchset: Lcarr; "virt1007 is new aggregator" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74546 [22:47:03] !log mflaschen Finished syncing Wikimedia installation... : Second scap attempt for E3 deployment of GettingStarted and GuidedTour [22:47:14] Logged the message, Master [22:47:22] Change merged: Lcarr; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74546 [22:54:46] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 22:54:39 UTC 2013 [22:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [22:57:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 22:57:37 UTC 2013 [22:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [22:58:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 22:58:48 UTC 2013 [22:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [23:02:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 23:02:40 UTC 2013 [23:02:51] James_F: we're done [23:03:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [23:03:11] ori-l: Awesome. Thanks! [23:03:52] <^angry> Krinkle: Any clues? [23:04:19] ^d: in a meeting atm, what I did gather is that I don't see .gitmodules in either repo. [23:04:37] <^d> I see it in CirrusSearch as of the merge :\ [23:05:40] <^d> Hmm, i'm gonna try something different. [23:08:55] orenwolf, we've got a template that we use for staff pages - not sure if you've been told about that? [23:09:09] Nope! [23:09:26] I'm happy to use it, however :) [23:09:53] while at it, please do mine too :) [23:10:08] Haha ;) [23:10:09] it was never done and I didn't have access to do it when I noticed [23:10:45] :) [23:11:05] great :) You can use Gayle's or Philippe's as an example https://wikimediafoundation.org/wiki/User:Gyoung and https://wikimediafoundation.org/wiki/User:Philippe_(WMF) but https://wikimediafoundation.org/wiki/Template:User_info#Usage explains what you need to do [23:11:42] The template is optional and not only for staff user pages. [23:11:43] paravoid, you can request a WMF wiki account if you'd like, or I can link to Meta/MW.org/enwiki? [23:12:18] Thehelpfulone: Thanks :) [23:12:22] no problem :) [23:13:06] New patchset: Ori.livneh; "Fix typo in variable name ('ipython_dir' => 'ipythondir')" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74552 [23:13:30] paravoid: ^ oops. [23:15:38] Change merged: Faidon; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/74552 [23:16:34] paravoid: thanks, sorry. [23:17:25] !log completed wikidatawiki wb_terms.term_weight OSC [23:17:36] Logged the message, Master [23:21:12] <^d> Krinkle: Got it. So, it was doing `git submodule update`. This was the first time the submodule was there--needed an init. [23:21:39] <^d> I did it manually and it'll be fine from now on, but we might wanna change that to `git submodule update --init` for all such jobs. [23:21:48] I'm out of the meeting now, looking at the repos now [23:22:03] ^d: right [23:22:47] definiteley, it should've been that all along [23:22:58] ^d: So, solved? [23:23:03] <^d> Yeah solved. [23:23:03] (well, after we fix that) [23:23:11] Alrighty [23:23:26] <^d> manybubbles: cirrus branch merged to master \o/ [23:24:56] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 23:24:48 UTC 2013 [23:25:09] ^d: We might have to fix this upstream (short of adding "git submodule update --init" as a separate build step after "git submodule update"), in the jenkins jobs we just set a property scm > git > submodule: true/false [23:25:22] there's no literal cli command [23:25:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [23:25:48] ether in jenkins-job-builder or in Jenkins-Git-Plugin [23:25:53] not sure which controls it [23:25:57] <^d> Krinkle: Ah, that would explain why I couldn't find it ;-) [23:26:19] https://github.com/wikimedia/integration-jenkins-job-builder-config/blob/master/macro-scm.yaml#L42-L47 [23:26:37] (and the other macros there) [23:27:00] * Krinkle files a bug [23:28:56] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 23:28:47 UTC 2013 [23:29:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours [23:31:15] ^d: https://bugzilla.wikimedia.org/show_bug.cgi?id=51646 [23:31:51] <^d> ty [23:32:46] RECOVERY - Puppet freshness on cp1043 is OK: puppet ran at Thu Jul 18 23:32:42 UTC 2013 [23:33:06] PROBLEM - Puppet freshness on cp1043 is CRITICAL: No successful Puppet run in the last 10 hours [23:38:24] !log catrope Started syncing Wikimedia installation... : Updating VisualEditor and CentralAuth to master [23:38:35] Logged the message, Master [23:39:32] What's up with the bigdelete user right Reedy? [23:39:54] It actually makes no functional difference [23:40:33] !log restarting db45 slave threads [23:40:44] Logged the message, Master [23:40:48] springle: How long did it take? [23:41:31] about 90 mins [23:41:31] Reedy, ^ [23:42:34] That's not too bad then [23:42:50] git I love you <3 [23:42:57] New review: Faidon; "A few inline comments for starters. Plus, use 4 tabs instead of tabs." [operations/puppet] (production) C: -1; - https://gerrit.wikimedia.org/r/74534 [23:51:50] !log deploying new php5-fss, to fix segfault bug [23:52:01] Logged the message, Master [23:52:48] StevenW: https://www.mediawiki.org/wiki/Manual:$wgCookieExpiration [23:53:35] is the last paragraph true for all MediaWikis, or just the Wikimedia ones? Gerrit #5405 only changes the one /we/ use. [23:53:44] MediaWiki wikis* [23:54:14] !log catrope Finished syncing Wikimedia installation... : Updating VisualEditor and CentralAuth to master [23:54:25] Logged the message, Master [23:54:50] RECOVERY - Puppet freshness on cp1042 is OK: puppet ran at Thu Jul 18 23:54:45 UTC 2013 [23:55:36] PROBLEM - Puppet freshness on cp1042 is CRITICAL: No successful Puppet run in the last 10 hours [23:57:46] RECOVERY - Puppet freshness on cp1044 is OK: puppet ran at Thu Jul 18 23:57:38 UTC 2013 [23:58:26] PROBLEM - Puppet freshness on cp1044 is CRITICAL: No successful Puppet run in the last 10 hours [23:59:06] RECOVERY - Puppet freshness on cp1041 is OK: puppet ran at Thu Jul 18 23:59:04 UTC 2013 [23:59:16] PROBLEM - Puppet freshness on cp1041 is CRITICAL: No successful Puppet run in the last 10 hours