[00:02:45] wikipedia-commons-local-temp: 5350932.5477619 MiB [00:02:49] paravoid: it's mocking me :) [00:04:22] (03PS1) 10Aude: Move Wikibase settings to own file [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94069 [00:05:12] (03PS2) 10Aude: Move Wikibase settings to own file [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94069 [00:08:01] paravoid: so getFileStat() is giving them an mtime of false if I use the lastmod from the listings... [00:08:57] wfTimestamp( TS_UNIX, false ) of course gives the current time so it thinks almost nothing is expired [00:11:49] work fine locally with ceph [00:20:05] (03PS1) 10Dzahn: bugzilla module - WIP [operations/puppet] - 10https://gerrit.wikimedia.org/r/94075 [00:21:26] (03CR) 10jenkins-bot: [V: 04-1] bugzilla module - WIP [operations/puppet] - 10https://gerrit.wikimedia.org/r/94075 (owner: 10Dzahn) [00:24:20] (03PS2) 10Dzahn: bugzilla module - WIP [operations/puppet] - 10https://gerrit.wikimedia.org/r/94075 [00:26:01] paravoid: ahh, swift changed the date format [00:26:17] that also explains why the copy script was wasting so much time I bet [00:47:40] ugh, that's almost but not quite TS_ISO_8601 [00:49:24] apergos (when you are around again): ganglia works great again, thanks! [01:06:55] PROBLEM - Puppet freshness on analytics1010 is CRITICAL: No successful Puppet run in the last 10 hours [01:08:20] ergh i have been getting intermittent timeouts from gerrit for the last hour or so [01:10:04] MaxSem: fyi unit tests fail for me in https://gerrit.wikimedia.org/r/#/c/88903/3 [01:10:32] 1) CoordTest::testEquals with data set #7 (Coord, NULL, false, 'Comparison with null') [01:10:32] urgh [01:10:32] Argument 2 passed to CoordTest::testEquals() must be an instance of Coord, null given [01:11:55] PROBLEM - Puppet freshness on analytics1014 is CRITICAL: No successful Puppet run in the last 10 hours [01:16:55] PROBLEM - Puppet freshness on analytics1009 is CRITICAL: No successful Puppet run in the last 10 hours [01:19:55] PROBLEM - Puppet freshness on analytics1019 is CRITICAL: No successful Puppet run in the last 10 hours [01:20:55] PROBLEM - Puppet freshness on testsearch1001 is CRITICAL: No successful Puppet run in the last 10 hours [01:22:55] PROBLEM - Puppet freshness on analytics1017 is CRITICAL: No successful Puppet run in the last 10 hours [01:23:55] PROBLEM - Puppet freshness on analytics1020 is CRITICAL: No successful Puppet run in the last 10 hours [01:27:55] PROBLEM - Puppet freshness on analytics1016 is CRITICAL: No successful Puppet run in the last 10 hours [01:28:55] PROBLEM - Puppet freshness on analytics1013 is CRITICAL: No successful Puppet run in the last 10 hours [01:28:55] PROBLEM - Puppet freshness on analytics1026 is CRITICAL: No successful Puppet run in the last 10 hours [01:33:08] (03PS1) 10Ori.livneh: Enable ResourceLoader module storage in beta [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94084 [01:33:55] PROBLEM - Puppet freshness on analytics1027 is CRITICAL: No successful Puppet run in the last 10 hours [01:33:57] (03CR) 10Ori.livneh: [C: 032] Enable ResourceLoader module storage in beta [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94084 (owner: 10Ori.livneh) [01:36:55] PROBLEM - Puppet freshness on analytics1012 is CRITICAL: No successful Puppet run in the last 10 hours [01:38:28] !log ori synchronized wmf-config/InitialiseSettings.php 'I064c1b202: enable ResourceLoader module storage on mediawikiwiki' [01:38:49] Logged the message, Master [01:40:55] PROBLEM - Puppet freshness on analytics1015 is CRITICAL: No successful Puppet run in the last 10 hours [01:42:25] !log ori synchronized php-1.23wmf2/resources/startup.js 'touch' [01:42:41] Logged the message, Master [01:42:55] PROBLEM - Puppet freshness on testsearch1003 is CRITICAL: No successful Puppet run in the last 10 hours [01:49:55] PROBLEM - Puppet freshness on stat1002 is CRITICAL: No successful Puppet run in the last 10 hours [01:51:55] PROBLEM - Puppet freshness on analytics1011 is CRITICAL: No successful Puppet run in the last 10 hours [01:51:55] PROBLEM - Puppet freshness on testsearch1002 is CRITICAL: No successful Puppet run in the last 10 hours [01:54:55] PROBLEM - Puppet freshness on analytics1018 is CRITICAL: No successful Puppet run in the last 10 hours [02:15:28] !log LocalisationUpdate completed (1.23wmf2) at Thu Nov 7 02:15:27 UTC 2013 [02:15:43] Logged the message, Master [02:17:49] (03CR) 10Krinkle: "(1 comment)" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94084 (owner: 10Ori.livneh) [02:23:35] !log LocalisationUpdate completed (1.23wmf1) at Thu Nov 7 02:23:35 UTC 2013 [02:23:51] Logged the message, Master [02:57:02] The authenticity of host 'tin (10.64.0.196)' can't be established. [02:57:04] * Aaron|home hmms [03:00:38] !log LocalisationUpdate ResourceLoader cache refresh completed at Thu Nov 7 03:00:37 UTC 2013 [03:00:59] Logged the message, Master [03:07:00] !log aaron synchronized php-1.23wmf1/includes 'eaa917adad163c50eaa90378db99ed1d0cb13048' [03:07:15] Logged the message, Master [03:12:45] !log aaron synchronized php-1.23wmf2/includes 'eaa917adad163c50eaa90378db99ed1d0cb13048' [03:13:02] Logged the message, Master [05:11:50] (03CR) 10Ori.livneh: "(1 comment)" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94084 (owner: 10Ori.livneh) [05:55:08] (03PS1) 10ArielGlenn: hadoop install needs java [operations/puppet] - 10https://gerrit.wikimedia.org/r/94104 [05:57:07] (03CR) 10ArielGlenn: [C: 032] hadoop install needs java [operations/puppet] - 10https://gerrit.wikimedia.org/r/94104 (owner: 10ArielGlenn) [05:57:12] hey apergos [05:57:18] morning [05:57:37] something might be broken with ULSFO bits [05:57:56] oh? [05:58:04] some requests are taking an unreasonably long time [05:58:16] that might be very much a mark or paravoid thing [05:58:18] time to first byte is always quick, but some requests are bursty [05:58:30] with gaps of no data [05:58:34] taking a very long time to complete [05:58:37] ugh [05:58:48] RECOVERY - Puppet freshness on analytics1009 is OK: puppet ran at Thu Nov 7 05:58:38 UTC 2013 [05:59:14] a number of users are reporting it on VPT on enwiki, so far all are hitting ULSFO [05:59:40] scrollback in #wikimedia-tech probably useful [06:03:33] i think we might want to revert mark's changes [06:03:57] ori-l: you mean config-geo changes? [06:05:15] PROBLEM - RAID on snapshot3 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [06:05:24] 2f8b05af658f4cc858ece3a6b32f7a924f47ea58 this is the last thing to add to ulsfo [06:05:31] (dns repo) [06:06:05] RECOVERY - Puppet freshness on analytics1010 is OK: puppet ran at Thu Nov 7 06:05:57 UTC 2013 [06:06:10] canada, might not be so much, immediately before is california, oregon and washington with [06:06:12] 177113bf7bf2174c592c7d25e1df12168a7d686d [06:06:15] RECOVERY - RAID on snapshot3 is OK: OK: no RAID installed [06:06:28] (ignore snapshot3 whines please) [06:07:15] PROBLEM - DPKG on snapshot3 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [06:07:34] one of the reports on #-tech was oregon [06:07:42] what about the others? [06:08:03] i'm in australia [06:08:15] RECOVERY - DPKG on snapshot3 is OK: All packages OK [06:09:44] how does your traffic even go there [06:10:04] it should be going via esams according to the file [06:10:48] beats me! [06:10:57] i must be special [06:11:04] apergos: OC? [06:11:05] RECOVERY - Puppet freshness on analytics1014 is OK: puppet ran at Thu Nov 7 06:10:55 UTC 2013 [06:11:10] let me look again [06:11:20] yeah oc, ok [06:11:55] i can reproduce it pretty reliably now with this URL: 'https://bits.wikimedia.org/en.wikipedia.org/load.php?debug=false&lang=en&modules=jquery%2Cmediawiki%2CSpinner%7Cjquery.triggerQueueCallback%2CloadingSpinner%2CmwEmbedUtil%7Cmw.MwEmbedSupport&only=scripts&skin=vector&version=20131106T025802Z' [06:12:43] ori-l: here's one localization dude :) [06:12:59] hey [06:13:00] morning [06:13:02] what's up [06:13:36] bits is always fast to first byte but some response bodies come staggered, with very long pauses in between [06:14:09] reports from oregon, australia, california (me) [06:14:30] some trace routes and resp headers in #-tech, all point to ULSFO [06:14:31] looking [06:15:20] just bits? [06:15:26] yes [06:15:39] ori-l: uls was enwiki (api) [06:15:44] not bits [06:16:03] but who knows if that's a red herring [06:16:16] yes, true [06:16:33] also, good morning :) [06:16:59] yes, good morning! [06:19:05] RECOVERY - Puppet freshness on analytics1019 is OK: puppet ran at Thu Nov 7 06:18:58 UTC 2013 [06:22:25] RECOVERY - Puppet freshness on analytics1017 is OK: puppet ran at Thu Nov 7 06:22:21 UTC 2013 [06:23:05] RECOVERY - Puppet freshness on analytics1020 is OK: puppet ran at Thu Nov 7 06:23:03 UTC 2013 [06:25:16] both http and https [06:26:28] (03PS1) 10ArielGlenn: fix up java package for statistics [operations/puppet] - 10https://gerrit.wikimedia.org/r/94105 [06:27:00] (03CR) 10jenkins-bot: [V: 04-1] fix up java package for statistics [operations/puppet] - 10https://gerrit.wikimedia.org/r/94105 (owner: 10ArielGlenn) [06:27:35] RECOVERY - Puppet freshness on analytics1016 is OK: puppet ran at Thu Nov 7 06:27:25 UTC 2013 [06:28:05] RECOVERY - Puppet freshness on analytics1026 is OK: puppet ran at Thu Nov 7 06:27:56 UTC 2013 [06:28:15] RECOVERY - Puppet freshness on analytics1013 is OK: puppet ran at Thu Nov 7 06:28:06 UTC 2013 [06:28:21] (03PS2) 10ArielGlenn: fix up java package for statistics [operations/puppet] - 10https://gerrit.wikimedia.org/r/94105 [06:29:04] ori-l: v6 and v4? [06:29:40] (03CR) 10ArielGlenn: [C: 032] fix up java package for statistics [operations/puppet] - 10https://gerrit.wikimedia.org/r/94105 (owner: 10ArielGlenn) [06:31:35] RECOVERY - Puppet freshness on stat1002 is OK: puppet ran at Thu Nov 7 06:31:25 UTC 2013 [06:31:44] ffs, I see nothing wrong [06:32:30] (03PS1) 10Spage: Enable Flow discussions on a few test wiki pages [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94106 [06:33:05] RECOVERY - Puppet freshness on analytics1027 is OK: puppet ran at Thu Nov 7 06:32:57 UTC 2013 [06:35:49] (03CR) 10Spage: [C: 04-1] "Do not +2 until we have an approved deployment window. We hope to deploy Flow in November." [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94106 (owner: 10Spage) [06:35:55] RECOVERY - Puppet freshness on analytics1012 is OK: puppet ran at Thu Nov 7 06:35:53 UTC 2013 [06:36:01] ori-l: 07 06:29:04 < jeremyb> ori-l: v6 and v4? [06:38:10] (03PS1) 10ArielGlenn: fix up java package for elastic search [operations/puppet] - 10https://gerrit.wikimedia.org/r/94107 [06:39:11] (03CR) 10ArielGlenn: [C: 032] fix up java package for elastic search [operations/puppet] - 10https://gerrit.wikimedia.org/r/94107 (owner: 10ArielGlenn) [06:39:30] jeremyb: erm ,sec [06:40:05] RECOVERY - Puppet freshness on analytics1015 is OK: puppet ran at Thu Nov 7 06:39:59 UTC 2013 [06:42:15] (03PS1) 10ArielGlenn: and fix up the java dependency for elastic search too, woops [operations/puppet] - 10https://gerrit.wikimedia.org/r/94108 [06:43:17] (03CR) 10ArielGlenn: [C: 032] and fix up the java dependency for elastic search too, woops [operations/puppet] - 10https://gerrit.wikimedia.org/r/94108 (owner: 10ArielGlenn) [06:44:01] (03PS2) 10Spage: Enable Flow discussions on a few test wiki pages [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94106 [06:45:05] RECOVERY - Puppet freshness on testsearch1001 is OK: puppet ran at Thu Nov 7 06:44:56 UTC 2013 [06:46:03] (03CR) 10Spage: [C: 04-1] "PS2 I think has the right $wg - $wmg logic." [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94106 (owner: 10Spage) [06:46:39] (03PS1) 10Faidon Liambotis: Switch North America back to eqiad [operations/dns] - 10https://gerrit.wikimedia.org/r/94109 [06:47:00] (03CR) 10Faidon Liambotis: [C: 032] Switch North America back to eqiad [operations/dns] - 10https://gerrit.wikimedia.org/r/94109 (owner: 10Faidon Liambotis) [06:51:05] RECOVERY - Puppet freshness on analytics1011 is OK: puppet ran at Thu Nov 7 06:50:57 UTC 2013 [06:51:55] RECOVERY - Puppet freshness on testsearch1002 is OK: puppet ran at Thu Nov 7 06:51:52 UTC 2013 [06:54:15] RECOVERY - Puppet freshness on analytics1018 is OK: puppet ran at Thu Nov 7 06:54:13 UTC 2013 [07:01:18] back in a little while (errand) [07:05:37] PROBLEM - MySQL Processlist on db1047 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 65 copy to table, 0 statistics [07:32:47] RECOVERY - Puppet freshness on testsearch1003 is OK: puppet ran at Thu Nov 7 07:32:43 UTC 2013 [07:38:58] hrmmm [07:39:12] mchenry is still in pmtpa i guess [07:39:30] RT 1804 [07:41:26] yes, it is [07:47:09] i don't suppose someone has a clue how ipv6 mapped addresses work? [07:47:42] 1) are any in rdns? should they be? [07:47:49] no, no [07:48:04] 2) i read modules/interface/manifests/add_ip6_mapped.pp but can't seem to replicate the pairing i see in the wild [07:48:43] paravoid: any special reason? context is 3645. i now see iodine has the same issue williams once had [07:48:48] RT 3645 that is [07:50:53] apparently 2620:0:861:2:92b1:1cff:fe00:a9d5 is iodine [08:05:44] PROBLEM - MySQL Processlist on db1047 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 65 copy to table, 0 statistics [08:12:55] PROBLEM - RAID on snapshot3 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [08:13:34] PROBLEM - Disk space on snapshot3 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [08:14:04] jeremyb: IPv6-mapped IPv4's? [08:14:13] that address is certainly not of that kind [08:15:24] PROBLEM - DPKG on snapshot3 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [08:16:24] RECOVERY - DPKG on snapshot3 is OK: All packages OK [08:17:54] RECOVERY - RAID on snapshot3 is OK: OK: no RAID installed [08:18:34] RECOVERY - Disk space on snapshot3 is OK: DISK OK [08:19:03] Jasper_Deng: this one fits the pattern: [08:19:04] mchenry.wikimedia.org has address 208.80.152.186 [08:19:04] mchenry.wikimedia.org has IPv6 address 2620:0:860:2:208:80:152:186 [08:19:24] PROBLEM - DPKG on snapshot3 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [08:19:27] that's not formally an IPv6-mapped address [08:19:41] Jasper_Deng: did you read the puppet manifest i mentioned/ [08:19:43] ? [08:19:56] * Jasper_Deng thinks jeremyb fogot that those digits are not equal to the IPv4 address in numerical value [08:19:57] so maybe for some reason that manifest isn't applied on iodine. or williams [08:20:06] Jasper_Deng: huh? [08:20:13] 07 08:19:41 < jeremyb> Jasper_Deng: did you read the puppet manifest i mentioned/ [08:20:26] * Jasper_Deng doesn't have OTRS access [08:20:42] 208:80:152:86!=208.80.152.186 [08:20:48] if you expand their binary digits [08:20:56] this is more Wikimedia's thing, I think [08:21:21] Jasper_Deng: well sure... but if you have all the parts of the infra follow the same convention... [08:21:35] PROBLEM - Disk space on snapshot3 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [08:21:40] bits does not [08:21:52] ugh [08:22:34] RECOVERY - Disk space on snapshot3 is OK: DISK OK [08:23:49] bonjour [08:23:54] PROBLEM - RAID on snapshot3 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [08:24:04] PROBLEM - SSH on snapshot3 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [08:25:34] PROBLEM - Disk space on snapshot3 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [08:25:54] RECOVERY - RAID on snapshot3 is OK: OK: no RAID installed [08:27:05] RECOVERY - SSH on snapshot3 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [08:27:24] RECOVERY - DPKG on snapshot3 is OK: All packages OK [08:27:24] RECOVERY - Disk space on snapshot3 is OK: DISK OK [08:28:54] PROBLEM - RAID on snapshot3 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [08:29:54] RECOVERY - RAID on snapshot3 is OK: OK: no RAID installed [08:30:24] PROBLEM - DPKG on snapshot3 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [08:31:24] RECOVERY - DPKG on snapshot3 is OK: All packages OK [08:34:34] PROBLEM - Disk space on snapshot3 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [08:36:24] RECOVERY - Disk space on snapshot3 is OK: DISK OK [08:37:53] (03PS1) 10Jeremyb: RT 3645 - give iodine a predictable IPv6 address [operations/puppet] - 10https://gerrit.wikimedia.org/r/94111 [08:38:49] (03CR) 10Jeremyb: "I have no idea if this is right, just copied from sodium's site.pp block." [operations/puppet] - 10https://gerrit.wikimedia.org/r/94111 (owner: 10Jeremyb) [08:40:44] PROBLEM - MySQL Processlist on db1047 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 65 copy to table, 0 statistics [08:41:13] gah, dns mixes tabs and spaces!! [08:43:24] PROBLEM - DPKG on snapshot3 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [08:44:50] RD: shall we kill otrs-test as a domain name? [08:45:15] RECOVERY - DPKG on snapshot3 is OK: All packages OK [08:45:54] i wonder if rd has a labs acct :P [08:54:54] PROBLEM - RAID on snapshot3 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [08:55:55] RECOVERY - RAID on snapshot3 is OK: OK: no RAID installed [08:56:37] huh, so the subnet for an ip6 mapped IP is based on subnet [08:56:38] err [08:56:42] how recursive [08:56:51] (03PS1) 10Nikerabbit: Use correct bits for uls fonts on labs [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94112 [08:56:53] huh, so the subnet for an ip6 mapped IP is based on row [08:57:11] just guessing from reading templates/1.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa [09:01:24] PROBLEM - DPKG on snapshot3 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [09:03:15] RECOVERY - DPKG on snapshot3 is OK: All packages OK [09:04:31] (03CR) 10Petrb: [C: 031] tool labs exec_environ: add python-twitter package [operations/puppet] - 10https://gerrit.wikimedia.org/r/93426 (owner: 10Jeremyb) [09:04:34] PROBLEM - Disk space on snapshot3 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [09:05:24] RECOVERY - Disk space on snapshot3 is OK: DISK OK [09:06:37] PROBLEM - MySQL Processlist on db1047 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 65 copy to table, 0 statistics [09:12:44] (03PS1) 10Jeremyb: rm otrs-test.wm.o (testing is over) [operations/dns] - 10https://gerrit.wikimedia.org/r/94114 [09:14:58] (03PS1) 10Jeremyb: rm tesla (old virt box) [operations/dns] - 10https://gerrit.wikimedia.org/r/94115 [09:16:17] (03CR) 10Jeremyb: "is tesla dead yet?" [operations/dns] - 10https://gerrit.wikimedia.org/r/94115 (owner: 10Jeremyb) [09:25:32] (03CR) 10Hashar: "(1 comment)" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94112 (owner: 10Nikerabbit) [09:26:15] (03PS2) 10Nikerabbit: Use correct bits for uls fonts on labs [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94112 [09:32:36] (03PS2) 10Jeremyb: rm tesla (old virt box) [operations/dns] - 10https://gerrit.wikimedia.org/r/94115 [09:37:02] (03CR) 10Hashar: [C: 04-1] "(1 comment)" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94112 (owner: 10Nikerabbit) [09:37:17] !log uploaded snappy version 1.0.4-1build1wmf1 to apt.wikimedia.org [09:37:32] Logged the message, Master [09:38:13] (03CR) 10Hashar: "vary static using $wmfVersionNumber . There is a few example in the config file, for example:" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94112 (owner: 10Nikerabbit) [09:39:52] is it really called snappy? [09:40:07] huh [09:40:10] packages.ubuntu.com says saucy [09:40:20] oh, it's a package anme [09:40:21] name [09:40:35] never heard of that before :) [09:40:37] PROBLEM - MySQL Processlist on db1047 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 65 copy to table, 0 statistics [09:40:45] a compression library [09:40:53] really fast one too [09:41:11] akosiaris: same as http://packages.ubuntu.com/precise/snappy ? [09:41:16] i guess not [09:41:25] nope [09:41:43] http://packages.ubuntu.com/precise/libsnappy1 [09:41:58] kind of confusing huh ? [09:42:14] source package called snappy, binary package called libsnappy1 ... meh [09:42:52] (03CR) 10Matthias Mullie: "I assume Parsoid is working for these wikis? If so, we should save content in HTML (to not have to re-parse wikitext to html for every req" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94106 (owner: 10Spage) [09:42:58] it was missing JNI bindings and recompiled it with them added and enabled so that kafka could happily compress/uncompress stuff [09:43:25] oh, really? you can have a binary package with the same name as a source package if they're different sources packages? [09:43:33] seems like a recipe for disaster [09:44:03] (03CR) 10Matthias Mullie: "And not sure how this one ($wgFlowDefaultWikiDb) works exactly, but the comments in Flow.php suggest that we'll want all wiki's data in 1 " [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94106 (owner: 10Spage) [09:44:14] source package namespace is different the binary package namespace [09:44:32] each package needs to be unique in its namespace [09:44:50] so no ... not a recipe for disaster [09:45:28] akosiaris: source package being named differently than binaries is very common [09:45:36] in libraries, it's the rule [09:45:39] exactly [09:45:57] still... it always confuses me for a second [09:46:19] i made the error of going to snappy player's page at least 2-3 times [09:46:21] paravoid: and how often do you then have the reverse too? another source package with a binary that's the same name as the first source? [09:46:38] akosiaris: right, *that* confusion is why it's such a recipe :) [09:46:55] i got a source package named 'voluptuous' providing the binary package 'python-voluptuous' [09:46:56] :/ [09:47:51] meh... not really. You cant really err that bad there. So no destruction. Only going huh??? for a second [09:48:29] you are going to recompile the package and not test for example ? [09:49:22] Reedy: https://bugzilla.wikimedia.org/show_bug.cgi?id=54985 [09:51:38] reedy is sleeping :D [09:52:15] he will see it when he get up :) [09:52:20] *s [09:52:36] when i'll be sleeping most likely [09:53:05] it's only noon now. why would you be asleep? [09:53:14] :P [09:55:15] (03PS3) 10Nikerabbit: Use correct bits for uls fonts on labs [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94112 [09:58:59] (03CR) 10Hashar: [C: 032] Use correct bits for uls fonts on labs [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94112 (owner: 10Nikerabbit) [09:59:22] (03Merged) 10jenkins-bot: Use correct bits for uls fonts on labs [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94112 (owner: 10Nikerabbit) [10:00:37] !log hashar synchronized wmf-config/CommonSettings.php 'Use correct bits for uls fonts on labs {{gerrit|94112}}' [10:00:52] Logged the message, Master [10:01:01] !log hashar synchronized wmf-config/InitialiseSettings.php 'touch' [10:01:16] Logged the message, Master [10:03:37] PROBLEM - MySQL Processlist on db1047 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 65 copy to table, 0 statistics [10:12:06] PROBLEM - Host mw27 is DOWN: PING CRITICAL - Packet loss = 100% [10:12:36] RECOVERY - Host mw27 is UP: PING OK - Packet loss = 0%, RTA = 35.49 ms [10:14:39] (03PS1) 10Jeremyb: rdns for bits-lb, mobile-lb @ ulsfo [operations/dns] - 10https://gerrit.wikimedia.org/r/94120 [10:16:26] (03CR) 10Mark Bergsma: [C: 04-2] "IPs are being changed at the moment. reverse DNS already reflects the new IPs." [operations/dns] - 10https://gerrit.wikimedia.org/r/94120 (owner: 10Jeremyb) [10:16:50] ok :) [10:17:18] (03Abandoned) 10Jeremyb: rdns for bits-lb, mobile-lb @ ulsfo [operations/dns] - 10https://gerrit.wikimedia.org/r/94120 (owner: 10Jeremyb) [10:19:38] mark: any comment on https://gerrit.wikimedia.org/r/94111 ? [10:20:41] * aude wonders what timezone jeremyb is in now :) [10:20:55] aude: uga :) [10:21:02] heh [10:34:08] bunnyland TZ? [10:40:46] PROBLEM - MySQL Processlist on db1047 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 65 copy to table, 0 statistics [10:42:14] !log gallium / jenkins : running git gc --agressive on Zuul git repositories under /srv/ssd/zuul/git [10:42:25] Logged the message, Master [10:42:46] PROBLEM - MySQL Processlist on db1047 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 65 copy to table, 0 statistics [10:43:01] (03PS1) 10Mark Bergsma: Send non-wiki[pm]edia esams text traffic to Varnish [operations/dns] - 10https://gerrit.wikimedia.org/r/94122 [10:47:46] PROBLEM - MySQL Processlist on db1047 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 65 copy to table, 0 statistics [10:50:18] (03CR) 10Mark Bergsma: [C: 032] Send non-wiki[pm]edia esams text traffic to Varnish [operations/dns] - 10https://gerrit.wikimedia.org/r/94122 (owner: 10Mark Bergsma) [10:54:24] (03CR) 10Akosiaris: [C: 032] Check for administratively disabled puppet [operations/puppet] - 10https://gerrit.wikimedia.org/r/93965 (owner: 10Akosiaris) [11:03:46] PROBLEM - MySQL Processlist on db1047 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 65 copy to table, 0 statistics [11:07:57] PROBLEM - puppet disabled on db9 is CRITICAL: NRPE: Command check_puppet_disabled not defined [11:08:33] :-D [11:08:58] grrr why not defined ? [11:10:58] RECOVERY - puppet disabled on db9 is OK: OK [11:11:05] (03PS1) 10Mark Bergsma: Send esams wikimedia-lb traffic to Varnish [operations/dns] - 10https://gerrit.wikimedia.org/r/94123 [11:12:03] (03CR) 10Mark Bergsma: [C: 032] Send esams wikimedia-lb traffic to Varnish [operations/dns] - 10https://gerrit.wikimedia.org/r/94123 (owner: 10Mark Bergsma) [11:18:13] !log gallium on swap death thanks to git eating all memory :/ [11:18:28] Logged the message, Master [11:24:19] git pack-objects is the hog [11:25:46] yup [11:25:47] killed it [11:25:52] waiting for the job to complete [11:26:07] the max git window size is up to 8GB but gallium only has 4GB of spare mem :/ [11:26:24] that's gonna be an issue [11:26:36] killed it again [11:26:50] yeah would have to look at giving out a smaller window size [11:27:19] !log gallium : killed git process [11:27:38] Logged the message, Master [11:28:08] core.packedGitLimit [11:28:08] :( [11:30:27] logged https://bugzilla.wikimedia.org/show_bug.cgi?id=56717 [11:30:30] will fix that later on [11:30:36] ok [11:30:47] it is out of swap now. [11:31:23] yep see that [11:31:58] I am out again, focusing on the dojo [11:40:47] PROBLEM - MySQL Processlist on db1047 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 66 copy to table, 0 statistics [12:03:47] PROBLEM - MySQL Processlist on db1047 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 65 copy to table, 0 statistics [12:44:44] PROBLEM - Host palladium is DOWN: PING CRITICAL - Packet loss = 100% [12:45:21] (03PS1) 10Mark Bergsma: Send esams wikipedia traffic to Varnish [operations/dns] - 10https://gerrit.wikimedia.org/r/94126 [12:45:35] palladium being down is me [12:45:37] no worries [12:47:15] ok [12:47:24] RECOVERY - Host palladium is UP: PING OK - Packet loss = 0%, RTA = 0.20 ms [12:47:46] do you see anyone worrying? [12:47:59] nope [12:48:05] hint: it's not just for palladium :) [12:48:17] top 5 site down [12:48:18] ts ts ts... [12:48:19] who cares :) [12:48:53] we ? [12:48:59] (03CR) 10Mark Bergsma: [C: 032] Send esams wikipedia traffic to Varnish [operations/dns] - 10https://gerrit.wikimedia.org/r/94126 (owner: 10Mark Bergsma) [12:50:04] PROBLEM - Host palladium is DOWN: PING CRITICAL - Packet loss = 100% [12:51:40] * apergos whistles [12:56:14] RECOVERY - Host palladium is UP: PING OK - Packet loss = 0%, RTA = 0.26 ms [12:57:23] so... palladium. [12:57:27] OK: no RAID installed [12:57:48] but it seems to have two SEAGATE ST91000640SS in RAID1 configuration [12:57:55] md? [12:58:26] SCSI storage controller: LSI Logic / Symbios Logic SAS1068E PCI-Express Fusion-MPT SAS (rev 08) [12:58:47] is it hardware raid or software raid? [12:58:54] just rebooted an went into bios [12:59:28] that is were i saw the two disk and the LSI Corp Config Utility For Dell SAS 6 and the Dell VIRTUAL DISK (lol) [12:59:33] so hardware [12:59:44] lsictl ? [13:00:43] mptsas, fun [13:00:45] didn't know we had these [13:00:55] lots [13:02:03] root@palladium:~# mpt-status [13:02:03] ioc0 vol_id 0 type IM, 2 phy, 930 GB, state OPTIMAL, flags ENABLED [13:02:08] mpt-status [13:02:08] ioc0 vol_id 0 type IM, 2 phy, 930 GB, state OPTIMAL, flags ENABLED [13:02:08] ioc0 phy 0 scsi_id 9 SEAGATE ST91000640SS AS02, 931 GB, state ONLINE, flags NONE [13:02:08] ioc0 phy 1 scsi_id 1 SEAGATE ST91000640SS AS03, 931 GB, state ONLINE, flags NONE [13:02:16] a yes [13:02:21] dude, how do you think mpt-status got there just know? :) [13:02:24] now [13:02:28] ahahaha [13:02:35] I also modprobe'd mptctl fwiw [13:02:43] and i said... hey puppet had it already installed yey!!! [13:02:48] so, are you gonna fix check-raid.py? [13:02:50] :-( [13:02:54] seems like it [13:02:55] I can, sounds trivial [13:03:21] it has a weird check at the beginning to find which utility to call.... [13:03:44] plus the Package resource, plus find a way to put mptctl to /etc/modules, I don't think we have a definition for that [13:04:15] seems like a plan [13:05:15] hah [13:05:16] mpt-status --autoload -s [13:05:22] autoload loads the kernel module [13:05:23] even easier [13:05:42] make sure to add a Service def to disable the stupid service too [13:06:49] that is the one that sends spam all the time the moment a drive breaks right ? [13:07:00] man i hated that thing [13:42:48] PROBLEM - MySQL Processlist on db1045 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 70 copy to table, 2 statistics [13:57:48] RECOVERY - MySQL Processlist on db1045 is OK: OK 0 unauthenticated, 0 locked, 1 copy to table, 19 statistics [14:10:33] PROBLEM - MySQL Slave Running on db74 is CRITICAL: CRIT replication Slave_IO_Running: Yes Slave_SQL_Running: Yes Last_Error: Error Deadlock found when trying to get lock: try restarting transac [14:15:33] RECOVERY - MySQL Slave Running on db74 is OK: OK replication Slave_IO_Running: Yes Slave_SQL_Running: Yes Last_Error: [14:16:16] (03CR) 10Daniel Kinzler: "@ariel: i replied to your comment. In short:" [operations/apache-config] - 10https://gerrit.wikimedia.org/r/92925 (owner: 10Dzahn) [14:16:26] (03CR) 10Daniel Kinzler: [C: 031] make Special:EntityData redirects for wikidata 303 ..instead of 302 [operations/apache-config] - 10https://gerrit.wikimedia.org/r/92925 (owner: 10Dzahn) [14:18:02] (03CR) 10Thehelpfulone: [C: 031] (bug 56398) Update logo for wikimania2014wiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92882 (owner: 10Odder) [14:18:20] Reedy, ^ can you +2 that please? [14:18:23] PROBLEM - MySQL Processlist on db1045 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 72 copy to table, 3 statistics [14:29:23] PROBLEM - MySQL Processlist on db1045 is CRITICAL: CRIT 1 unauthenticated, 0 locked, 66 copy to table, 1 statistics [14:36:23] RECOVERY - MySQL Processlist on db1045 is OK: OK 1 unauthenticated, 0 locked, 1 copy to table, 2 statistics [14:44:40] (03PS2) 10Hashar: zuul: dependencies for Gearman based version [operations/puppet] - 10https://gerrit.wikimedia.org/r/93454 [14:45:48] (03Abandoned) 10Hashar: sartorize contint scripts for jenkins slaves [operations/puppet] - 10https://gerrit.wikimedia.org/r/91493 (owner: 10Hashar) [14:46:23] (03CR) 10Hashar: ""Whatever the name of our deployment system is" does not let us sync minions in both prod and labs." [operations/puppet] - 10https://gerrit.wikimedia.org/r/91493 (owner: 10Hashar) [14:48:26] (03Abandoned) 10Hashar: gbp: point upstream to testing/3.0.3plus-rc1 branch [operations/debs/varnish] - 10https://gerrit.wikimedia.org/r/90718 (owner: 10Hashar) [15:15:58] (03PS1) 10Akosiaris: Adding mptsas support in check-raids [operations/puppet] - 10https://gerrit.wikimedia.org/r/94134 [15:17:22] (03CR) 10jenkins-bot: [V: 04-1] Adding mptsas support in check-raids [operations/puppet] - 10https://gerrit.wikimedia.org/r/94134 (owner: 10Akosiaris) [15:17:24] (03PS3) 10Cmjohnson: decommision db3[29] db4[2-6] db5[1235689] [operations/puppet] - 10https://gerrit.wikimedia.org/r/93052 (owner: 10Springle) [15:17:31] (03CR) 10Cmjohnson: [C: 032] decommision db3[29] db4[2-6] db5[1235689] [operations/puppet] - 10https://gerrit.wikimedia.org/r/93052 (owner: 10Springle) [15:18:38] (03PS2) 10coren: tool labs exec_environ: add python-twitter package [operations/puppet] - 10https://gerrit.wikimedia.org/r/93426 (owner: 10Jeremyb) [15:19:47] (03CR) 10coren: [C: 032] tool labs exec_environ: add python-twitter package [operations/puppet] - 10https://gerrit.wikimedia.org/r/93426 (owner: 10Jeremyb) [15:23:49] (03PS1) 10Hashar: contint: deny Zuul gearman port (4370) beside localhost [operations/puppet] - 10https://gerrit.wikimedia.org/r/94136 [15:24:31] (03PS3) 10Hashar: zuul: configuration for gearman [operations/puppet] - 10https://gerrit.wikimedia.org/r/93457 [15:24:46] (03CR) 10Hashar: "rebased on top of:" [operations/puppet] - 10https://gerrit.wikimedia.org/r/93457 (owner: 10Hashar) [15:27:01] (03PS3) 10Hashar: zuul: dependencies for Gearman based version [operations/puppet] - 10https://gerrit.wikimedia.org/r/93454 [15:27:02] (03PS4) 10Hashar: zuul: configuration for gearman [operations/puppet] - 10https://gerrit.wikimedia.org/r/93457 [15:27:25] (03PS6) 10Hashar: role::zuul::labs::gearman to test out in labs [operations/puppet] - 10https://gerrit.wikimedia.org/r/93458 [15:28:41] (03PS1) 10ArielGlenn: remove tmh1/2 from everything, decommed (rt #6222) [operations/puppet] - 10https://gerrit.wikimedia.org/r/94138 [15:33:20] (03PS1) 10Cmjohnson: Removing db 33,39 5[1-5][8-9] from dsh group and ganglia.pp [operations/puppet] - 10https://gerrit.wikimedia.org/r/94140 [15:34:29] (03CR) 10Cmjohnson: [C: 031] remove tmh1/2 from everything, decommed (rt #6222) [operations/puppet] - 10https://gerrit.wikimedia.org/r/94138 (owner: 10ArielGlenn) [15:35:31] PROBLEM - MySQL Recent Restart on db1047 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [15:35:31] PROBLEM - MySQL Slave Running on db1047 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [15:37:24] RECOVERY - MySQL Recent Restart on db1047 is OK: OK 2506424 seconds since restart [15:37:24] RECOVERY - MySQL Slave Running on db1047 is OK: OK replication Slave_IO_Running: Yes Slave_SQL_Running: Yes Last_Error: [15:37:40] (03CR) 10Cmjohnson: [C: 032] Removing db 33,39 5[1-5][8-9] from dsh group and ganglia.pp [operations/puppet] - 10https://gerrit.wikimedia.org/r/94140 (owner: 10Cmjohnson) [15:41:07] (03PS2) 10Akosiaris: Adding mptsas support in check-raids [operations/puppet] - 10https://gerrit.wikimedia.org/r/94134 [15:42:10] hi paravoid! [15:43:10] (03CR) 10jenkins-bot: [V: 04-1] Adding mptsas support in check-raids [operations/puppet] - 10https://gerrit.wikimedia.org/r/94134 (owner: 10Akosiaris) [15:43:21] (03PS1) 10Cmjohnson: Removing dns entries for db's 32,39 4[2-6] 5[1-3][5-6][8-9] [operations/dns] - 10https://gerrit.wikimedia.org/r/94142 [15:45:43] hi ottomata [15:46:03] so I have 2 things to work out with ya this morn [15:46:08] ipv6 and varnishkafka deb [15:47:09] hmm [15:47:10] for the .deb [15:47:29] the varnishkafka.log file stuff doesn't seem quite write [15:47:30] right* [15:47:31] :p [15:48:22] !log moved morebots back to the grid [15:48:39] Logged the message, Master [15:49:08] (03PS2) 10Cmjohnson: Removing dns entries for db's 32,39 4[2-6] 5[1-3][5-6][8-9] [operations/dns] - 10https://gerrit.wikimedia.org/r/94142 [15:49:45] (03PS1) 10AzaToth: Merging upstream changes Conflicts: src/com/facebook/buck/android/RobolectricTestRule.java src/com/facebook/buck/java/JUnitStep.java src/com/facebook/buck/java/JarDirectoryStep.java src/com/facebook/buck/java/JavaBinaryRule.java src/com/facebook/buck [operations/debs/buck] - 10https://gerrit.wikimedia.org/r/94143 [15:49:46] (03PS1) 10AzaToth: Update debian specific code [operations/debs/buck] - 10https://gerrit.wikimedia.org/r/94144 [15:51:52] (03CR) 10Cmjohnson: [C: 032] Removing dns entries for db's 32,39 4[2-6] 5[1-3][5-6][8-9] [operations/dns] - 10https://gerrit.wikimedia.org/r/94142 (owner: 10Cmjohnson) [15:52:35] !log dns udpate [15:52:50] Logged the message, Master [15:53:27] ottomata: was there a question there? :) [15:53:36] ha, yes [15:53:40] https://gerrit.wikimedia.org/r/#/c/78782/ [15:53:43] chekc commetns here [15:54:00] so, i'm currently attempting to make sure the varnishkafka.log file is created and has proper permissions in rules [15:54:04] but 1. this seems hacky [15:54:11] and 2. it doesn't seem to even work correctly [15:54:27] is there a better way? perhaps a postinst file? [15:55:06] to do what? [15:58:43] so [15:58:52] remember when we were working on varnishkafka rsyslog stuff back in SF? [15:59:06] (03CR) 10Faidon Liambotis: [C: 04-1] "Let's do the formatting/variable naming changes in a different commit." [operations/puppet] - 10https://gerrit.wikimedia.org/r/94134 (owner: 10Akosiaris) [15:59:10] not really, no [15:59:19] /var/log/varnish is not writeable by the syslog user [15:59:32] we've configured vanrishkafka to use rsyslog to write ot /var/log/varnish/varnishkafka.log [15:59:50] so, that file needs to be in place and writeable by the syslog user upon installation of varnishkafka [16:00:38] i attempted to do this by creating and chowning the file in debian/rules [16:00:43] https://gerrit.wikimedia.org/r/#/c/78782/16/debian/rules [16:01:00] ewww [16:01:08] yes, you won the bet [16:01:12] this isn't the right way [16:02:33] the package has no business writing a logfile [16:03:46] right [16:03:54] but we weren't sure how to make this work otherwise [16:04:05] since rsyslog doesn't seem to be able to create the file itself if it can't write to the directory [16:06:14] !log hello world [16:06:30] Logged the message, Master [16:07:46] !log hello jeremyb [16:07:59] Logged the message, Master [16:08:35] kids [16:09:20] heh [16:11:43] !log hi hashar! [16:12:00] Logged the message, Master [16:12:05] hrmmmm [16:12:47] !log jenkins : reducing number of executor on gallium from 8 to 5, we have lanthanum now. [16:12:54] hashar: again please :) [16:13:05] (03CR) 10Jgreen: [C: 032 V: 031] rm otrs-test.wm.o (testing is over) [operations/dns] - 10https://gerrit.wikimedia.org/r/94114 (owner: 10Jeremyb) [16:13:15] !log jenkins : reducing number of executor on gallium from 8 to 5, we have lanthanum now. [16:13:32] Logged the message, Master [16:14:05] !log morebots running on tools-login again. will troubleshoot more later [16:14:20] greg-g: ping [16:14:22] thank you [16:14:23] Logged the message, Master [16:14:44] gwicke: yo yo [16:14:45] twitter just doesn't want to work on the grid... :/ [16:15:03] greg-g: we are prepping for a deploy, so you know.. [16:15:06] (03PS1) 10Mark Bergsma: On amssq47..62, sda is a HD, not an SSD [operations/puppet] - 10https://gerrit.wikimedia.org/r/94147 [16:15:08] :) [16:15:10] (03PS1) 10Ottomata: Cleaning up analytics role classes [operations/puppet] - 10https://gerrit.wikimedia.org/r/94148 [16:15:13] gwicke: when do you plan on going out? [16:15:26] greg-g: about ~30 minutes from now [16:15:32] gwicke: the big "new version of mw" goes out at 11am pacific [16:15:33] I'm starting prep [16:15:34] ok [16:15:39] cool, thanks [16:15:56] (03PS2) 10Hashar: Merging upstream changes Conflicts: src/com/facebook/buck/android/RobolectricTestRule.java src/com/facebook/buck/java/JUnitStep.java src/com/facebook/buck/java/JarDirectoryStep.java src/com/facebook/buck/java/JavaBinaryRule.java src/com/facebook/buck [operations/debs/buck] - 10https://gerrit.wikimedia.org/r/94143 (owner: 10AzaToth) [16:16:40] (03PS1) 10Andrew Bogott: Backup dynamicproxy-api's data.db [operations/puppet] - 10https://gerrit.wikimedia.org/r/94149 [16:16:56] paravoid, what should we do if we want to make the logfile work? [16:18:43] * mark kicks jenkins in the nuts [16:20:10] greg-g: I'm ready to go, we are starting a bit earlier [16:20:17] (03CR) 10Mark Bergsma: [C: 032] On amssq47..62, sda is a HD, not an SSD [operations/puppet] - 10https://gerrit.wikimedia.org/r/94147 (owner: 10Mark Bergsma) [16:20:26] gwicke: go forth [16:22:19] !log deployed Parsoid 986c1e787088 [16:22:34] Logged the message, Master [16:22:59] AzaToth: the debian-glue job was not being triggered. It is now https://integration.wikimedia.org/ci/job/operations-debs-buck-debian-glue/1/console :-] [16:23:32] AzaToth: the upstream branch is set to "upstream-google" which debian-glue does not checkout locally leading to: fatal: Not a valid object name upstream-google [16:23:46] (03PS2) 10Hashar: Update debian specific code [operations/debs/buck] - 10https://gerrit.wikimedia.org/r/94144 (owner: 10AzaToth) [16:28:45] saving takes a long time for some reason [16:29:03] feels like selser cache misses to me [16:29:22] sorry, wrong channel ;) [16:29:55] i lost paravoid :) :/, mark, IPv6 questions for you! [16:30:36] 1. since I want varnishkafka to address the brokers explicitly by the IPv6 addies, I should probably give them a different name than their usual A record, correct? [16:32:49] could anybody with the rights purge the parsoid caches for us? [16:33:05] hashar: I wondered if you has commited the changes (gbp-pull etc) you talked about before [16:34:12] mark: ^^ [16:34:41] AzaToth: nop [16:34:55] hashar: I see jenkins-bot says succeeded... [16:34:59] AzaToth: we use Jenkins git plugin to fetch the repo then run debian glue on it [16:35:52] AzaToth: well Jenkins bot does +1 but the job still fail [16:36:05] AzaToth: that is because the debian-glue jobs are ignored when figuring out the voting score [16:36:11] ah [16:36:24] this way folks are not blocked by the job while it is being tweaked [16:36:32] well, need to push upstream-google somehow then, or rename it to something common [16:36:50] must be named 'upstream' to match debian glue expectations [16:37:56] so "upstream" for upstream branch [16:38:02] which for debian branch? [16:38:04] (03CR) 10Faidon Liambotis: "(1 comment)" [operations/puppet] - 10https://gerrit.wikimedia.org/r/94134 (owner: 10Akosiaris) [16:38:09] "debian" or "master"? [16:38:23] debian iirc [16:38:39] any roots around? [16:38:56] pristine-tar for pristine-tar I assume [16:39:09] gwicke: salty roots! [16:39:50] jeremyb: ;) [16:40:38] we need somebody to purge the Parsoid cache to avoid some minor diffs we are seeing [16:41:18] AzaToth: in https://github.com/mika/jenkins-debian-glue/blob/master/scripts/generate-git-snapshot look for create_local_branch calls [16:41:26] create_local_branch upstream [16:41:27] create_local_branch debian [16:41:27] create_local_branch pristine-tar [16:41:28] AzaToth: ^^^^ [16:41:29] what's the roadmap for not having to do that any mroe? [16:41:33] more* [16:42:00] I am off, will be back on tuesday [16:42:02] have fun folks [16:42:05] bye! [16:42:07] jeremyb: we'll have HTML upgrades once we go for HTML storage [16:42:18] ETA this quarter [16:42:34] now please do a ban on those parsoid caches [16:42:41] (i'm not a root) [16:43:02] i mean even cache busting with version in query string would be great [16:43:14] maybe i don't understand all the issues [16:44:08] normally we would not need to clear the cache, but testing against old HTML systematically is relatively difficult without storage [16:44:22] so there are minor issues that sneak through [16:44:37] maybe it used to be more common [16:44:58] and this release changed the DOM spec in several places [16:45:25] mostly it's working with the old HTML from cache [16:59:11] gwicke: what do i need to run? [17:01:13] ori-l: I have just reverted the deploy because of the missing cache purge [17:01:29] oh :/ [17:01:35] !log roll back Parsoid deploy as we are not able to purge the cache [17:01:51] Logged the message, Master [17:02:21] ori-l: the command would have been basically a varnish ban [17:04:05] now old code gets some new content from cache [17:05:40] gwicke: ohhh, i was confused, for some reason i was thinking you were talking about mobile [17:05:47] ori-l: if you could run 'varnishadm ban.url .' on {cp1045,cp1058} after deploy then I'd give it another try [17:06:07] gwicke: sure [17:06:24] ori-l: great, thanks [17:06:47] I'm here now [17:06:55] was this a scheduled deployment? [17:07:11] first time I'm hearing of a requirement for help for this deployment [17:07:47] paravoid: sounds like the purge need was unexpected [17:07:59] it's fine to be adhoc -to the extent that greg-g doesn't mind- but maybe try sourcing the ops person to help you before the deployment starts [17:08:17] paravoid: we asked for the rights to purge those caches a long time ago [17:08:24] nothing has happened so far [17:08:37] the issue seems to be VE interaction with cached content [17:08:42] so, since it hasn't happened... might want to source a root before hand ;) [17:08:56] paravoid: would you like to take over? i don't mind either way. [17:09:33] gwicke: the ticket was rejected iirc [17:09:34] ori-l || paravoid: could you run 'varnishadm ban.url .' on {cp1045,cp1058} now? [17:09:40] doing so [17:09:45] ok, thanks ori-l [17:10:23] !log Ran 'varnishadm ban.url .' on cp1045 & cp1058 [17:10:23] !log deployed Parsoid 986c1e78708 [17:10:37] ori-l: thanks! [17:10:37] and even if it was still considered, this is no excuse to unplanned deployments :) [17:10:40] Logged the message, Master [17:10:48] paravoid: this is a planned deployment [17:10:54] Logged the message, Master [17:11:14] s/planned/coordinated with the relevant people/ [17:11:36] that's the same thing, isn't it? [17:11:46] gwicke: sorry for poking you on this, but I assume you had an incling that'd you'd deploy today before this morning, why not let me know? [17:11:47] I don't see it at https://wikitech.wikimedia.org/wiki/Deployments [17:12:03] greg-g: it depended on test results over night [17:12:08] that list is being prepared with love and we regularly look at it [17:12:12] I do at least [17:12:13] (03CR) 10Akosiaris: "All the changes were for pep8 to pass. Nothing more. Thanks for the error in the manifest though" [operations/puppet] - 10https://gerrit.wikimedia.org/r/94134 (owner: 10Akosiaris) [17:12:17] sure, everythihng depends on test results [17:12:36] and a red wheelbarrow? [17:12:45] so A) put on calendar pending test results B) go or no go depending on test results [17:12:48] seems reasonable [17:12:57] nod [17:13:00] no w.c.williams fans? oh well. [17:13:01] * gwicke shrugs [17:13:09] having on the deployments page helps on outages too [17:13:11] if that makes a difference [17:13:33] it does [17:13:40] it does, and we've been bugging you for your release cycle/schedule for a while, with no reponse (robla asked for it at least once this week in email) [17:14:47] we don't do time-based deploys normally [17:15:02] instead we deploy when the code is ready and VE dependencies allow / require [17:15:05] so it's "whenever we feel like it with 30 minutes notice"? not going to work long term [17:15:50] * paravoid agrees with greg-g [17:16:12] I'd completely agree with you guys if this was PHP [17:16:29] what does language has to do with anything? [17:16:30] language doesn't matter, here, gabriel [17:16:45] Elastic Search is in Java, still on calendar [17:16:57] Elasticsearch [17:17:00] whatever [17:17:04] :) [17:17:04] it does matter for debugging etc [17:17:05] hehe [17:17:21] and for deploy conflicts [17:17:23] gwicke: sure, you don't use gdb, but it doesn't matter for deploy overlaps etc [17:17:28] so, here's the thing [17:17:38] is e.g. switching DCs or switching from squid to varnish on the calendar? should it be? [17:17:40] we have a pretty hard rule of using windows for deploys, no matter the langauge or servers affected [17:17:51] jeremyb: it's getting better, I'm bugging mark as much as I can :) [17:17:57] hehe [17:17:59] greg-g: i was scared you meant MS Windows?! [17:18:04] momentarily [17:18:16] mark does give me >1 week heads up on varnish changes, though [17:18:30] greg-g: what does that mean for you? [17:18:35] greg-g: ok, but is it actually on the calendar? [17:18:36] do you prepare somehow? [17:18:57] jeremyb: 1yes [17:19:00] -1 [17:19:14] gwicke: you isolate changes so they don't temporally overlap [17:19:31] we scheduled the DNS changes [17:19:34] so, hi, here's the thing: see what I said above about windows [17:19:37] well, there are large changes that can affect anything [17:19:44] gwicke: you announce (here probably) that you're doing a deploy and actively coordinate with greg-g before starting [17:19:48] and there are small changes that can affect only very limited things [17:19:50] gwicke: why can't you let me know? [17:19:54] we scheduled all the switching Swift masters around too (simple one line config change, but larger impact) [17:19:55] what is so hard about it? [17:19:56] and a whole lot in between [17:20:12] gwicke: i suppose you could even use an LD window if it's so small [17:20:14] the small changes normally don't make it on the calendar [17:20:21] we are somewhere in between IMO [17:20:30] whatever I'm on a call [17:20:33] so the argument is about the threshold [17:20:47] somewhere in between is "purging caches"? [17:21:04] in some cases, yes [17:21:12] it is not a very complicated command to run [17:21:25] and those are pure Parsoid caches [17:21:36] they affect nothing else [17:21:38] gwicke: please don't ignore my and robla's request for your roadmap and release plans, and please give me more than 10 minutes headsup about deploys. All other dev teams are able to give me at least 1 more, more like 2 or *months* on their plans, you're a black box, and it isn't healthy and it's causing pain. [17:22:13] i tend to give notice of upcoming large changes to both greg and in SoS/ops meetings etc [17:22:24] ok, I think there's some disagreement here and a spontaneous IRC discussion during a deploy is probably the wrong way to have this [17:22:25] i just don't commit to small time windows, as it doesn't make much sense for most stuff I do [17:22:53] the rules are "coordinate greg-g", so let's stick with them; if you disagree about the rules, raise them to the appropriate channels and change them [17:22:54] greg-g: I proposed to have two time slots per week for potential parsoid deploys [17:23:07] gwicke: write myself and robla an email, we're on a call right now [17:23:11] with wikidata [17:23:18] and maybe you have good arguments, and I'd be willing to support you if that's the case [17:23:39] but this isn't clearly not working, you just had to rollback a deploy because of poor coordination [17:23:43] just please don't ignore our questions with answers that are tangential [17:23:47] so let's fix the root cause so it won't happen again [17:24:00] give gwicke varnishadm rights on parsoid machines, you mean? :) [17:24:07] IMO somebody deploying should have the rights to do everything that is part of the deploy [17:24:21] yup [17:24:49] give everyone root everywhere and let them everyone (or noone) dealing with outages, who needs coordination [17:25:15] "I deploy whenever I want to" assumes that you can produce an emergency whenever you want to [17:25:17] nah, i broadly agree with you and greg-g, but i've also been on gwicke's side a lot [17:25:24] * subbu switches over to operations and sees a discussion [17:25:29] are we proposing the 'rollback to 2006' plan of giving everyone root? cuz that never had issues! [17:25:29] which means you can page me whenever you want to [17:25:44] would you text me at random hours within your day to come to irc and help you with something? [17:25:48] * subbu is reading scrollback [17:26:07] paravoid: the point is that gwicke's dependency on ops here is somewhat artificial [17:26:16] paravoid: we asked for a way to run a single command on those two machines [17:26:25] and we asked you to coordinate with us [17:26:45] sounds like we might have a way forward! but switching to a different medium (lists/wiki) seems better [17:26:53] I did ping greg here, and he gave me the go-ahead [17:27:04] you're basing this request on the assumption that noone is going to be around on your deploys, so you should just be able to do it [17:27:49] I'm basing it on the assumption that somebody deploying should have the rights to do the things needed during the deploy [17:27:50] gwicke: because I couldn't tell you no, I was going to have a sit down with you next week, but I didn't want to block you today. [17:28:14] or, someone deploying should have someone from ops on board to handle anything that needs handling, which is a good idea for other reasons [17:28:25] I was supposed to ignore this channel while on the call, /me goes back to ignoring [17:28:26] I think i see two issues here .. one is that greg-g and ops are asking to have advance notification before deploys + gwicke not having rights on the caches. [17:28:35] this covers the case that something goes wrong beyond the foreseen, too [17:28:39] I was actually supportive on giving you the rights to purge caches [17:28:44] this discussion is actually convincing me otherwise though :) [17:29:35] we didn't anticipate the cache purge requirement on this deploy .. which led to this. [17:29:41] well, IMO it is fairly silly to rely on Roan to be around for each deploy [17:29:51] just to run the purge in case that is needed [17:30:00] gwicke, but i do think it is reasonable that we should probably notify ops beforehand. [17:30:09] well, I did [17:30:27] anyway, lets add a parsoid timeslot to the calendar too [17:30:33] and also that we should have rights for purging the cache. [17:30:54] (03Abandoned) 10Siebrand: Enable EducationProgram on Dutch language Wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/71605 (owner: 10Siebrand) [17:32:40] i dont see a reason to get into a debate over this .. we should just add our deploys to the calendar unless we have emergency hot fixes. [17:32:48] +1 [17:33:17] sure, it certainly does not harm [17:33:33] there used to be a recurring section in there- is that gone now? [17:33:33] right, so, let us do that going forward. [17:34:09] gwicke: not now, was redundant (heh), send me an email with a proposed slot and I'll find you one [17:35:05] 2 hours is the default length, btw, to guard against time needed for fixes/etc so others aren't delayed [17:35:18] you don't need to use it all, obviously, it's just a "just in case" thing [17:35:30] so we'd hold up php deploys? [17:35:33] paravoid, RobH do you think it is unreasonable for us to have access to cache purges? [17:35:46] gwicke: no, it's so no one else will go at the same time as you [17:35:55] these are scheduled, ie, you arne't holding anyone up [17:36:09] I feel like you're still testing me, and I don't appreciate it [17:36:30] I could be misunderstanding, so apologies if not [17:38:24] we also sometimes deploy puppet changes to the job queue etc [17:38:30] how are those scheduled? [17:40:02] one offs? those are done either in a team's normal window, or during some other time that is free on the calendar and approved by me (not to sound restrictive, that's actually a safe guard against a few corner cases, like security deploys) [17:40:22] (which are sekrit/not on the calendar) [17:41:57] so far it seemed to be more related to when the puppet change was merged [17:43:10] ok, I have to really go now, looking forward to the email from you/subbu about your release cycle/stuff and roadmap and proposed time slot [17:43:43] thanks greg-g will do. [17:46:00] greg-g: mail sent [17:48:48] mark: About? [17:48:48] [17:47:14] Reedy: When did you deploy to dewiki? [17:48:48] [17:47:47] Since about 16:55 people report users editing through esams squids [17:48:48] [17:48:32] eg. 2620:0:862:1:91:198:174:70 [17:49:19] wtf [17:49:23] that's the /other/ ipv6 ip [17:50:09] (03PS1) 10Ottomata: Updating varnishkafka module with recent varnishkafka.conf changes [operations/puppet/varnishkafka] - 10https://gerrit.wikimedia.org/r/94164 [17:50:12] we really need CIDR support for XFF [17:51:02] (03CR) 10Ottomata: [C: 032 V: 032] Updating varnishkafka module with recent varnishkafka.conf changes [operations/puppet/varnishkafka] - 10https://gerrit.wikimedia.org/r/94164 (owner: 10Ottomata) [17:53:13] Guess that should go onto the platform todo list [17:54:07] amssq60 and amssq54 I think... [17:54:17] and amssq57 [18:00:33] well, crap [18:00:43] I guess I'll just rollback to squid and fix that properly tomorrow [18:01:33] or... [18:01:38] a quick and dirty fix on the routers [18:01:43] heh [18:03:15] Can't one just revert https://gerrit.wikimedia.org/r/94147 for now? [18:05:57] "just" [18:05:57] ;) [18:07:23] !log reedy synchronized php-1.23wmf3 'Staging' [18:07:42] Logged the message, Master [18:08:25] (03PS1) 10Reedy: Add/update symlinks [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94166 [18:08:55] (03CR) 10Reedy: [C: 032] Add/update symlinks [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94166 (owner: 10Reedy) [18:09:26] !log reedy synchronized docroot and w [18:09:39] Logged the message, Master [18:12:16] <^d> Reedy: 1.23wmf2 & 1.23wmf3 are deployed versions? I need to sneak in a pretty major bugfix for Cirrus [18:12:29] <^d> And wanted to ride your scap [18:12:35] haha [18:12:39] I was just starting to scap now [18:12:45] wmf1 will be out of use in about an hour [18:12:53] <^d> Meh, I'll do it later, no worries. [18:14:00] !log reedy Started syncing Wikimedia installation... : testwiki to 1.23wmf3 and build l10n cache [18:14:16] Logged the message, Master [18:17:33] (03PS1) 10Mark Bergsma: Disable prefer_ipv6 until IPv6 interface address selection are sorted [operations/puppet] - 10https://gerrit.wikimedia.org/r/94168 [18:19:09] (03Merged) 10jenkins-bot: Add/update symlinks [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94166 (owner: 10Reedy) [18:20:16] (03CR) 10Mark Bergsma: [C: 032 V: 032] Disable prefer_ipv6 until IPv6 interface address selection are sorted [operations/puppet] - 10https://gerrit.wikimedia.org/r/94168 (owner: 10Mark Bergsma) [18:20:53] (03PS3) 10AzaToth: Update rebased debian code [operations/debs/buck] - 10https://gerrit.wikimedia.org/r/94144 [18:22:49] (03Abandoned) 10AzaToth: Merging upstream changes [operations/debs/buck] - 10https://gerrit.wikimedia.org/r/94143 (owner: 10AzaToth) [18:23:06] ^d: is arsenic your cirrus script host? [18:23:08] arsenic: Copying to arsenic from mw1070.eqiad.wmnet...cannot delete non-empty directory: multiversion/.git/refs/heads [18:23:10] etc etc etc [18:23:30] <^d> Yes, it is. I was experimenting with something. [18:23:32] <^d> Lemme fix. [18:23:53] Reedy: Are you doing the config change for pushing MMV/CMD/BF to Commons and Meta, or d'you need me to? [18:24:13] The Multimedia Extension Triad [18:24:14] Which? [18:24:18] <^d> Reedy: Deleted that .git dir [18:24:29] MultimediaViewer, CommonsMetadata, BetaFeatures [18:24:47] (03PS1) 10Ottomata: Setting up varnishkafka on mobile varnish caches. [operations/puppet] - 10https://gerrit.wikimedia.org/r/94169 [18:24:58] No, which config change? [18:25:19] (03CR) 10Ottomata: [C: 04-1] "Not ready to merge, varnishkafka package and a handful of other things need done first. Submitting this for early review." [operations/puppet] - 10https://gerrit.wikimedia.org/r/94169 (owner: 10Ottomata) [18:25:39] PROBLEM - Apache HTTP on mw1070 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [18:26:29] RECOVERY - Apache HTTP on mw1070 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.206 second response time [18:27:02] Hey opsen, could I get https://gerrit.wikimedia.org/r/#/c/94063/ merged please? [18:27:14] It's a Gerrit replication change that has a +1 from Chad :) [18:27:28] !log reedy Finished syncing Wikimedia installation... : testwiki to 1.23wmf3 and build l10n cache [18:27:35] ottomata: If the topic is accurate you're on duty :) ----^^ [18:27:36] (03PS2) 10Ottomata: Add custom replication rules for oojs/core and oojs/ui [operations/puppet] - 10https://gerrit.wikimedia.org/r/94063 (owner: 10Catrope) [18:27:42] Logged the message, Master [18:27:44] (03CR) 10Ottomata: [C: 032 V: 032] Add custom replication rules for oojs/core and oojs/ui [operations/puppet] - 10https://gerrit.wikimedia.org/r/94063 (owner: 10Catrope) [18:27:58] gotcha! [18:28:46] paravoid, i guess you might not be around any more, but any thoughts on the varnishkafka.log discussion? [18:28:47] what should I do? [18:29:05] ottomata: Wow that was quick, thanks man [18:29:11] yup ;) [18:30:11] ottomata: (Also, can I haz it deployed?) [18:31:25] oof what machien is that on, do you know? [18:31:29] ytterbium ? [18:31:35] Probably [18:31:37] ^d: ? [18:31:52] (03CR) 10Edenhill: [C: 04-1] "(1 comment)" [operations/puppet] - 10https://gerrit.wikimedia.org/r/94169 (owner: 10Ottomata) [18:31:58] yup! [18:32:02] RoanKattouw: its there [18:32:15] Awesome [18:32:29] <^d> Huh? [18:32:37] <^d> Oh, replication thing. [18:32:55] Yeah [18:33:01] !log reedy rebuilt wikiversions.cdb and synchronized wikiversions files: testwiki back to 1.23wmf2 [18:33:07] Sorry for providing literally the smallest amount of context possible [18:33:19] Logged the message, Master [18:33:28] ^d: Does that replication change need a puppet run anywhere else other than ytterbium? [18:33:29] <^d> !log reloading replication plugin [18:33:40] <^d> !log for gerrit [18:33:44] Logged the message, Master [18:33:52] <^d> (considering replication is vague :p) [18:34:04] Logged the message, Master [18:34:37] haha [18:34:39] Thanks guys [18:34:44] I'm gonna create these repos now [18:35:03] (03PS1) 10Reedy: Wikipedias to 1.22wmf2 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94172 [18:35:04] (03PS1) 10Reedy: phase1 wikis to 1.23wmf3 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94173 [18:36:23] (03PS2) 10Reedy: (bug 31068) Set up NS_PROJECT for azwikibooks [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/91665 (owner: 10Odder) [18:36:27] (03CR) 10Reedy: [C: 032] (bug 31068) Set up NS_PROJECT for azwikibooks [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/91665 (owner: 10Odder) [18:37:32] (03Merged) 10jenkins-bot: (bug 31068) Set up NS_PROJECT for azwikibooks [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/91665 (owner: 10Odder) [18:38:03] (03PS2) 10Reedy: Added filemover user group to bnwiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/91886 (owner: 10Vogone) [18:38:07] (03CR) 10Reedy: [C: 032] Added filemover user group to bnwiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/91886 (owner: 10Vogone) [18:38:21] (03Merged) 10jenkins-bot: Added filemover user group to bnwiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/91886 (owner: 10Vogone) [18:39:01] (03PS2) 10Reedy: (bug 56198) Additional user right for sysops on eswikivoyage [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92055 (owner: 10Odder) [18:39:05] (03CR) 10Reedy: [C: 032] (bug 56198) Additional user right for sysops on eswikivoyage [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92055 (owner: 10Odder) [18:39:22] (03Merged) 10jenkins-bot: (bug 56198) Additional user right for sysops on eswikivoyage [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92055 (owner: 10Odder) [18:40:29] (03PS2) 10Reedy: Added import source for 'wikidata' [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92178 (owner: 10Vogone) [18:40:36] (03CR) 10Reedy: [C: 032] Added import source for 'wikidata' [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92178 (owner: 10Vogone) [18:40:49] (03Merged) 10jenkins-bot: Added import source for 'wikidata' [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92178 (owner: 10Vogone) [18:41:32] (03PS1) 10Ottomata: Adding config for topic_request_timeout_ms [operations/puppet/varnishkafka] - 10https://gerrit.wikimedia.org/r/94175 [18:41:48] (03CR) 10Ottomata: [C: 032 V: 032] Adding config for topic_request_timeout_ms [operations/puppet/varnishkafka] - 10https://gerrit.wikimedia.org/r/94175 (owner: 10Ottomata) [18:42:36] (03PS2) 10Ottomata: Setting up varnishkafka on mobile varnish caches. [operations/puppet] - 10https://gerrit.wikimedia.org/r/94169 [18:42:59] (03CR) 10Ottomata: [C: 04-1] Setting up varnishkafka on mobile varnish caches. [operations/puppet] - 10https://gerrit.wikimedia.org/r/94169 (owner: 10Ottomata) [18:43:56] (03PS2) 10Reedy: Remove $wgSubversionProxy [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92606 [18:44:21] (03CR) 10Edenhill: [C: 031] Setting up varnishkafka on mobile varnish caches. [operations/puppet] - 10https://gerrit.wikimedia.org/r/94169 (owner: 10Ottomata) [18:44:25] (03CR) 10Reedy: [C: 032] Remove $wgSubversionProxy [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92606 (owner: 10Reedy) [18:45:25] (03PS3) 10Reedy: (bug 56384) Configure $wgImportSources for dewikisource [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92797 (owner: 10Odder) [18:45:29] (03CR) 10Reedy: [C: 032] (bug 56384) Configure $wgImportSources for dewikisource [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92797 (owner: 10Odder) [18:46:48] (03PS2) 10Reedy: Follow-up to I357d3ed1c1: log a timestamp. Untested. [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93020 (owner: 10MZMcBride) [18:46:53] (03CR) 10Reedy: [C: 032] Follow-up to I357d3ed1c1: log a timestamp. Untested. [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93020 (owner: 10MZMcBride) [18:47:45] (03Merged) 10jenkins-bot: Remove $wgSubversionProxy [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92606 (owner: 10Reedy) [18:47:59] (03PS3) 10Reedy: Follow-up to I357d3ed1c1: log a timestamp. Untested. [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93020 (owner: 10MZMcBride) [18:48:02] (03Merged) 10jenkins-bot: (bug 56384) Configure $wgImportSources for dewikisource [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92797 (owner: 10Odder) [18:48:03] (03CR) 10Reedy: [C: 032] Follow-up to I357d3ed1c1: log a timestamp. Untested. [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93020 (owner: 10MZMcBride) [18:50:00] (03PS4) 10Reedy: Follow-up to I357d3ed1c1: log a timestamp. Untested. [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93020 (owner: 10MZMcBride) [18:50:07] (03CR) 10Reedy: [C: 032] Follow-up to I357d3ed1c1: log a timestamp. Untested. [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93020 (owner: 10MZMcBride) [18:50:28] * Reedy kicks grrrit-wm [18:50:39] (03Merged) 10jenkins-bot: Follow-up to I357d3ed1c1: log a timestamp. Untested. [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93020 (owner: 10MZMcBride) [18:50:45] gj Elsie [18:51:14] (03PS2) 10Reedy: (bug 56570) Set up 'accountcreator' user group on cawiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93591 (owner: 10Odder) [18:51:18] (03CR) 10Reedy: [C: 032] (bug 56570) Set up 'accountcreator' user group on cawiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93591 (owner: 10Odder) [18:51:36] (03Merged) 10jenkins-bot: (bug 56570) Set up 'accountcreator' user group on cawiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93591 (owner: 10Odder) [18:52:18] (03PS2) 10Reedy: Increase upload size limit for chunked and URL uploads to 1000MB. [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93900 (owner: 10Eloquence) [18:52:26] (03CR) 10Reedy: [C: 032] Increase upload size limit for chunked and URL uploads to 1000MB. [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93900 (owner: 10Eloquence) [18:52:38] (03Merged) 10jenkins-bot: Increase upload size limit for chunked and URL uploads to 1000MB. [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93900 (owner: 10Eloquence) [18:52:39] (03PS3) 10Ottomata: Setting up varnishkafka on mobile varnish caches. [operations/puppet] - 10https://gerrit.wikimedia.org/r/94169 [18:53:19] (03PS3) 10Reedy: DynamicPageList extension configuration maintenance [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92314 (owner: 10Dereckson) [18:53:23] (03CR) 10Reedy: [C: 032] DynamicPageList extension configuration maintenance [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92314 (owner: 10Dereckson) [18:54:23] (03Merged) 10jenkins-bot: DynamicPageList extension configuration maintenance [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92314 (owner: 10Dereckson) [18:56:38] (03PS3) 10Reedy: Move Wikibase settings to own file [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94069 (owner: 10Aude) [18:56:59] (03PS1) 10MaxSem: Extend OpenSearchXml with images from PageImages [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94179 [18:57:03] (03CR) 10Ottomata: [C: 04-1] Setting up varnishkafka on mobile varnish caches. [operations/puppet] - 10https://gerrit.wikimedia.org/r/94169 (owner: 10Ottomata) [18:57:18] (03CR) 10MaxSem: [C: 04-2] "Waiting for dependency." [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94179 (owner: 10MaxSem) [18:57:40] (03CR) 10jenkins-bot: [V: 04-1] Extend OpenSearchXml with images from PageImages [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94179 (owner: 10MaxSem) [18:59:03] (03PS2) 10MaxSem: Extend OpenSearchXml with images from PageImages [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94179 [19:00:36] (03Abandoned) 10MaxSem: WIP: OSM module [operations/puppet] - 10https://gerrit.wikimedia.org/r/36222 (owner: 10MaxSem) [19:00:55] (03Abandoned) 10MaxSem: Initial commit of osm2pgsql [operations/debs/osm2pgsql] - 10https://gerrit.wikimedia.org/r/48605 (owner: 10MaxSem) [19:00:59] Reedy, mark: Did that issue with 2620:0:862:1:91:198:174:70 get fixed? [19:01:21] (03Abandoned) 10MaxSem: Initial commit of mod_title [operations/debs/mod_tile] - 10https://gerrit.wikimedia.org/r/48606 (owner: 10MaxSem) [19:01:37] (03Abandoned) 10MaxSem: Initial commit of osm-mapnik-style [operations/debs/osm-mapnik-style] - 10https://gerrit.wikimedia.org/r/48607 (owner: 10MaxSem) [19:02:14] Krenair: not sure, but bd808 is going to set up alerting for edits from loopback / wmf ip address spaces [19:02:35] ori-l: I am? [19:02:48] (03Abandoned) 10MaxSem: Check mobile site's HTTP status [operations/puppet] - 10https://gerrit.wikimedia.org/r/57419 (owner: 10MaxSem) [19:03:00] * bd808 learns new things all the time :P [19:03:03] (03PS4) 10Reedy: Move Wikibase settings to own file [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94069 (owner: 10Aude) [19:03:36] ori-l, bd808: We already have some logging for this in CommonSettings.php , search for 'localhost' [19:03:45] (03CR) 10Reedy: [C: 032] Move Wikibase settings to own file [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94069 (owner: 10Aude) [19:03:46] we do? [19:03:51] * ori-l learns new things all the time, too :P [19:03:54] We do now [19:03:56] It's "new" [19:04:03] that's "cool" [19:04:08] oh, symlinks [19:04:15] (03Merged) 10jenkins-bot: Move Wikibase settings to own file [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94069 (owner: 10Aude) [19:04:18] Krenair: it was fixed at least twice by now :> [19:04:29] * aude missed that.... [19:04:43] Everyone does ;) [19:04:50] I put it in after the most recent 127.0.0.1 thing [19:04:58] (03PS2) 10Reedy: (bug 56398) Update logo for wikimania2014wiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92882 (owner: 10Odder) [19:04:59] i know for next time [19:05:02] (03CR) 10Reedy: [C: 032] (bug 56398) Update logo for wikimania2014wiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92882 (owner: 10Odder) [19:05:42] Reedy: https://gerrit.wikimedia.org/r/#/c/94180/ has our bug fixes [19:05:46] (03Merged) 10jenkins-bot: (bug 56398) Update logo for wikimania2014wiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92882 (owner: 10Odder) [19:05:52] test.wikidata should be better with that [19:07:21] !log reedy synchronized wmf-config/ [19:07:33] Logged the message, Master [19:07:51] * aude goes to check that wikidata is alive :) [19:08:03] all good [19:08:22] !log reedy synchronized docroot and w [19:09:49] !log reedy synchronized php-1.23wmf3/extensions/Wikibase [19:09:59] (03PS2) 10Reedy: Wikipedias to 1.22wmf2 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94172 [19:10:03] Logged the message, Master [19:10:14] (03CR) 10Reedy: [C: 032] Wikipedias to 1.22wmf2 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94172 (owner: 10Reedy) [19:10:23] (03Merged) 10jenkins-bot: Wikipedias to 1.22wmf2 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94172 (owner: 10Reedy) [19:12:29] !log reedy rebuilt wikiversions.cdb and synchronized wikiversions files: all wikipedias to 1.23wmf2 [19:12:47] Logged the message, Master [19:16:05] (03PS2) 10Reedy: phase1 wikis to 1.23wmf3 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94173 [19:16:20] (03CR) 10Reedy: [C: 032] phase1 wikis to 1.23wmf3 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94173 (owner: 10Reedy) [19:16:32] (03Merged) 10jenkins-bot: phase1 wikis to 1.23wmf3 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94173 (owner: 10Reedy) [19:18:11] !log reedy rebuilt wikiversions.cdb and synchronized wikiversions files: phase1 wikis to 1.23wmf3 [19:18:26] Logged the message, Master [19:19:17] (03CR) 10Edenhill: [C: 031] Setting up varnishkafka on mobile varnish caches. [operations/puppet] - 10https://gerrit.wikimedia.org/r/94169 (owner: 10Ottomata) [19:19:29] Unhappy APC is unhappy [19:20:53] <^d> Poor apc [19:24:47] (03PS2) 10Reedy: Minor changes to robots.php [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92566 [19:24:53] (03CR) 10Reedy: [C: 032] Minor changes to robots.php [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92566 (owner: 10Reedy) [19:25:06] (03Merged) 10jenkins-bot: Minor changes to robots.php [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92566 (owner: 10Reedy) [19:26:28] !log reedy synchronized w/robots.php [19:26:48] Logged the message, Master [19:27:23] <^d> Reedy: Waiting was beneficial. Only one branch to update now :p [19:27:31] heh [19:27:36] wheee [19:28:25] (03PS3) 10Dzahn: bugzilla module - WIP [operations/puppet] - 10https://gerrit.wikimedia.org/r/94075 [19:29:16] (03PS4) 10Dzahn: bugzilla module - WIP [operations/puppet] - 10https://gerrit.wikimedia.org/r/94075 [19:32:06] (03PS1) 10Akosiaris: Cleanup check-raid.py and PEP8 compliance [operations/puppet] - 10https://gerrit.wikimedia.org/r/94184 [19:33:28] (03CR) 10Akosiaris: [C: 032] Cleanup check-raid.py and PEP8 compliance [operations/puppet] - 10https://gerrit.wikimedia.org/r/94184 (owner: 10Akosiaris) [19:36:55] (03PS3) 10Reedy: Update RC2UDP config to use $wgRCFeeds [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92455 [19:44:43] (03PS4) 10Reedy: Update RC2UDP config to use $wgRCFeeds [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92455 [19:45:48] (03PS3) 10Akosiaris: Adding mptsas support in check-raids [operations/puppet] - 10https://gerrit.wikimedia.org/r/94134 [19:45:50] (03PS5) 10Reedy: Update RC2UDP config to use $wgRCFeeds [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92455 [19:46:42] (03CR) 10jenkins-bot: [V: 04-1] Adding mptsas support in check-raids [operations/puppet] - 10https://gerrit.wikimedia.org/r/94134 (owner: 10Akosiaris) [19:47:07] (03PS6) 10Reedy: Update RC2UDP config to use $wgRCFeeds [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92455 [19:47:12] (03CR) 10Reedy: [C: 032] Update RC2UDP config to use $wgRCFeeds [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92455 (owner: 10Reedy) [19:47:25] (03Merged) 10jenkins-bot: Update RC2UDP config to use $wgRCFeeds [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92455 (owner: 10Reedy) [19:49:17] !log reedy synchronized wmf-config/ [19:52:31] (03PS4) 10Akosiaris: Adding mptsas support in check-raids [operations/puppet] - 10https://gerrit.wikimedia.org/r/94134 [19:53:59] (03CR) 10Akosiaris: [C: 032] Adding mptsas support in check-raids [operations/puppet] - 10https://gerrit.wikimedia.org/r/94134 (owner: 10Akosiaris) [19:58:50] (03PS3) 10Reedy: Update CentralAuth RC2UDP config [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92463 [19:59:18] (03CR) 10Krinkle: "(1 comment)" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/92566 (owner: 10Reedy) [20:00:01] (03PS1) 10Cmjohnson: Missed a dsh file for db42 [operations/puppet] - 10https://gerrit.wikimedia.org/r/94185 [20:01:06] (03CR) 10Cmjohnson: [C: 032] Missed a dsh file for db42 [operations/puppet] - 10https://gerrit.wikimedia.org/r/94185 (owner: 10Cmjohnson) [20:02:03] Reedy: Sorry I disappeared, it looks like the deploy is done and none of those extensions went out, right? [20:02:25] Like, that's fine, but it probably means I'll need to snake into the LD to deploy a config change [20:02:51] None of the extensions went out? [20:02:57] I don't see them on Commons [20:03:05] MultimediaViewer, BetaFeatures, CommonsMetadata [20:03:20] I didn't know they were supposed to be.. [20:03:39] BetaFeatures with CommonsMetadata and MultimediaViewer to Commons and MetaWiki [20:03:42] https://wikitech.wikimedia.org/wiki/Deployments [20:03:46] <^d> Reedy: I'm going to sync-dir Cirrus now if that's cool [20:03:53] Also me pinging you before the deploy [20:03:57] But it's fine [20:04:32] There's also another hour of this deploy window left anyway [20:04:36] ^d: sure [20:04:39] Oh, cool beans. [20:06:34] !log put sampling configuration on cr1-sdtpa - possible problems = high RE cpu utilization, which could cause network instability [20:06:51] Logged the message, Mistress of the network gear. [20:09:01] Reedy: To confirm, you're going to push it out? [20:09:16] (03PS1) 10Reedy: Move extension-list-1.23wmf2 into extension-list [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94187 [20:09:34] Aha, apparently yes. [20:10:15] (03CR) 10Reedy: [C: 032] Move extension-list-1.23wmf2 into extension-list [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94187 (owner: 10Reedy) [20:14:32] what's up with the raid status changes ? [20:14:35] !log demon synchronized php-1.23wmf2/extensions/CirrusSearch 'Cirrus to master' [20:14:58] Logged the message, Master [20:15:05] (03Merged) 10jenkins-bot: Move extension-list-1.23wmf2 into extension-list [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94187 (owner: 10Reedy) [20:15:13] akosiaris broke it :) [20:15:24] 1 PHP Fatal error: Class 'CirrusSearchLinksUpdateJob' not found in /usr/local/apache/common-local/php-1.23wmf2/extensions/CirrusSearch/includes/CirrusSearchUpdater.php on line 109 [20:15:52] Nov 7 20:14:33 [20:15:54] <^d> And not 1.23wmf3? Odd. [20:15:56] <^d> Fixing. [20:16:34] There's also a load of Error message is: SearchPhaseExecutionException[Failed to execute phase [dfs], all shards failed; shardFailures [20:17:21] manybubbles: ^d random question - is CirrusSearch deployed for enwiki? Behind some sort of a URL parameter, perhaps? [20:17:31] <^d> No, it's not, bad time. [20:17:52] ok [20:17:57] YuviPanda: What's wrong with special:Version? https://en.wikipedia.org/wiki/Special:Version [20:18:10] Reedy: i'm an idiot. nevermind me [20:18:25] :D [20:19:52] <^d> Reedy: Dafuq? It's in $wgAutoloadClasses. [20:20:14] Still only that one at that time [20:20:22] 10.64.16.117 [20:20:37] mw1137.eqiad.wmnet [20:20:38] PROBLEM - MySQL Recent Restart on db1047 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [20:20:47] <^d> Hmm. [20:21:28] RECOVERY - MySQL Recent Restart on db1047 is OK: OK 2523468 seconds since restart [20:21:34] Only worry if they come back [20:21:49] <^d> The actual file is missing from mw1137 [20:21:54] <^d> Everything else is there though. [20:22:14] <^d> Or not. [20:22:18] <^d> Nvm, ignoring. [20:22:22] <^d> So, the other error. [20:23:46] jdlrobson: Could you (or somebody else) reply inline https://gerrit.wikimedia.org/r/93490 ? ;) [20:24:01] Wrong channel [20:24:38] (03PS1) 10Akosiaris: Fix bug introduced in 064974ed4b [operations/puppet] - 10https://gerrit.wikimedia.org/r/94232 [20:26:45] !log reedy Started syncing Wikimedia installation... : Ensure l10n cache is up to date [20:27:01] That was quick.. [20:27:03] Logged the message, Master [20:28:36] (03CR) 10Akosiaris: [C: 032 V: 032] Fix bug introduced in 064974ed4b [operations/puppet] - 10https://gerrit.wikimedia.org/r/94232 (owner: 10Akosiaris) [20:34:50] !log reedy Finished syncing Wikimedia installation... : Ensure l10n cache is up to date [20:35:09] Logged the message, Master [20:35:27] <^d> !log elastic: in-place reindexing for all wikis running cirrus as primary [20:35:44] Logged the message, Master [20:47:07] [09f20cf3] 2013-11-07 20:46:44: Fatal exception of type MWException [20:47:18] @skwiktionary [20:50:30] (03PS1) 10Reedy: Enable MultimediaViewer, BetaFeatures, CommonsMetadata on commonswiki and metawiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94234 [20:50:38] (03CR) 10jenkins-bot: [V: 04-1] Enable MultimediaViewer, BetaFeatures, CommonsMetadata on commonswiki and metawiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94234 (owner: 10Reedy) [20:52:04] (03PS2) 10Reedy: Enable MultimediaViewer, BetaFeatures, CommonsMetadata on commonswiki and metawiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94234 [20:53:45] !log reedy synchronized wmf-config/InitialiseSettings.php [20:54:09] Logged the message, Master [20:55:15] (03CR) 10Reedy: [C: 032] Enable MultimediaViewer, BetaFeatures, CommonsMetadata on commonswiki and metawiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94234 (owner: 10Reedy) [20:58:09] (03Merged) 10jenkins-bot: Enable MultimediaViewer, BetaFeatures, CommonsMetadata on commonswiki and metawiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94234 (owner: 10Reedy) [20:58:16] paravoid: the cleanupStash script deleted 100k+ things and it still going [20:58:25] oh, cool! [20:58:31] I saw something on the backlog [20:58:32] that really should delete in batches...really slow [20:58:37] about the dates being changed? [20:58:51] date format [20:58:56] I guess it won't be as bad when the backlog isn't so huge [20:59:06] not sure how long that was broken...maybe a long time [20:59:43] it was tested against ceph, which worked OK...maybe it was always broken since that adviseStat flag was there [21:00:04] !log Created betafeatures database table on metawiki and commonswiki [21:00:06] anyway, merging the fix for that (plus updating some stash regex) seems to work [21:00:23] Reedy: Done with the deploy? If not, please ping me when you are. [21:00:23] Logged the message, Master [21:00:27] * Aaron|home likes all the different file name formats the stash uses [21:01:15] it's at commons shard 0d now [21:01:30] 243 to go [21:01:50] does it delete anything else but the contents of temp containers? [21:03:22] anomie: Should be, yeah [21:03:34] Reedy: Thanks, I'll start my bugfix deploy then [21:13:30] !log anomie synchronized php-1.23wmf2/skins/common/upload.js 'Backport gerrit change 94161 to fix regression since 1.23wmf1' [21:13:43] !log anomie synchronized php-1.23wmf2/skins/common/protect.js 'Backport gerrit change 94161 to fix regression since 1.23wmf1' [21:13:45] Logged the message, Master [21:13:57] !log anomie synchronized php-1.23wmf2/resources/Resources.php 'Backport gerrit change 94161 to fix regression since 1.23wmf1' [21:14:04] Logged the message, Master [21:14:10] !log anomie synchronized php-1.23wmf2/maintenance/jsduck/config.json 'Backport gerrit change 94161 to fix regression since 1.23wmf1' [21:14:21] Logged the message, Master [21:14:24] !log anomie synchronized php-1.23wmf2/maintenance/jsduck/external.js 'Backport gerrit change 94161 to fix regression since 1.23wmf1' [21:14:28] (03PS10) 10Dr0ptp4kt: Add an extra header for cache variance of W0 banners for proxies. [operations/puppet] - 10https://gerrit.wikimedia.org/r/88261 [21:14:39] !log anomie synchronized php-1.23wmf2/resources/jquery/jquery.spinner.js 'Backport gerrit change 94161 to fix regression since 1.23wmf1' [21:14:40] Logged the message, Master [21:14:53] !log anomie synchronized php-1.23wmf3/skins/common/upload.js 'Backport gerrit change 94161 to fix regression since 1.23wmf1' [21:14:58] Logged the message, Master [21:15:07] !log anomie synchronized php-1.23wmf3/skins/common/protect.js 'Backport gerrit change 94161 to fix regression since 1.23wmf1' [21:15:16] Logged the message, Master [21:15:20] !log anomie synchronized php-1.23wmf3/resources/Resources.php 'Backport gerrit change 94161 to fix regression since 1.23wmf1' [21:15:32] Logged the message, Master [21:15:36] !log anomie synchronized php-1.23wmf3/maintenance/jsduck/config.json 'Backport gerrit change 94161 to fix regression since 1.23wmf1' [21:15:46] Logged the message, Master [21:15:49] !log anomie synchronized php-1.23wmf3/maintenance/jsduck/external.js 'Backport gerrit change 94161 to fix regression since 1.23wmf1' [21:15:58] (03PS11) 10Dr0ptp4kt: Add an extra header for cache variance of W0 banners for proxies. [operations/puppet] - 10https://gerrit.wikimedia.org/r/88261 [21:16:04] !log anomie synchronized php-1.23wmf3/resources/jquery/jquery.spinner.js 'Backport gerrit change 94161 to fix regression since 1.23wmf1' [21:16:04] Logged the message, Master [21:16:16] * anomie is done deploying now [21:16:25] Logged the message, Master [21:16:44] Logged the message, Master [21:17:04] Logged the message, Master [21:20:32] (03CR) 10Chad: "Silly MediaWiki." [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93988 (owner: 10Jdlrobson) [21:22:51] dr0ptp4kt: mind comenting on https://gerrit.wikimedia.org/r/#/c/92288/ please? [21:25:51] matanya, you'll want to check with Mr. Liambotis and yurik on that. i'm not in a good position to comment on that patchset. [21:26:33] dr0ptp4kt: funny, yurik added you as the point to contact :) thanks anyway [21:27:38] matanya, apologies. i guess that means it would be best if Mr. Liambotis had the final word on the patchset. he'd need to do the approving (i don't have approval rights, and don't feel comfortable approving it without Mr. Liambotis's approval, anyway). [21:28:58] dr0ptp4kt: he already commented, so i guess, there is someone who knows :) [21:30:20] yeah :( [21:59:45] (03PS1) 10Dr0ptp4kt: Load ZeroRatedMobileAccess only where currently supported. [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94250 [22:00:10] PROBLEM - MySQL Processlist on db1010 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 65 copy to table, 3 statistics [22:01:01] PROBLEM - MySQL Processlist on db1019 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 71 copy to table, 4 statistics [22:01:20] PROBLEM - MySQL Processlist on db1003 is CRITICAL: CRIT 1 unauthenticated, 0 locked, 66 copy to table, 1 statistics [22:02:30] PROBLEM - MySQL Slave Delay on db1035 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [22:02:30] PROBLEM - MySQL Processlist on db1035 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [22:02:30] PROBLEM - puppet disabled on db1035 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [22:02:40] PROBLEM - RAID on db1035 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [22:03:10] PROBLEM - MySQL Processlist on db1010 is CRITICAL: CRIT 3 unauthenticated, 0 locked, 79 copy to table, 2 statistics [22:03:10] PROBLEM - Apache HTTP on mw1151 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [22:03:20] PROBLEM - MySQL Processlist on db1003 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 75 copy to table, 7 statistics [22:03:21] RECOVERY - MySQL Slave Delay on db1035 is OK: OK replication delay 98 seconds [22:03:21] RECOVERY - MySQL Processlist on db1035 is OK: OK 0 unauthenticated, 0 locked, 2 copy to table, 0 statistics [22:03:21] RECOVERY - puppet disabled on db1035 is OK: OK [22:03:30] RECOVERY - RAID on db1035 is OK: OK: optimal, 1 logical, 2 physical [22:03:31] PROBLEM - Apache HTTP on mw1150 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [22:03:40] mutante: ^ [22:03:49] suddenly things seem slow [22:04:01] RECOVERY - Apache HTTP on mw1151 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 2.573 second response time [22:04:01] PROBLEM - MySQL Processlist on db1019 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 77 copy to table, 3 statistics [22:04:01] maybe it will fix itself or it's just me [22:04:53] Failed to load resource: the server responded with a status of 503 (Service Unavailable) https://bits.wikimedia.org/en.wikipedia.org/load.php... [22:05:18] seems loading now [22:05:21] RECOVERY - Apache HTTP on mw1150 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.054 second response time [22:07:22] (03CR) 10MaxSem: [C: 04-1] "(2 comments)" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94250 (owner: 10Dr0ptp4kt) [22:09:16] (03PS1) 10Andrew Bogott: Move android::sdk and packages::ant18 into contint module. [operations/puppet] - 10https://gerrit.wikimedia.org/r/94257 [22:09:31] Reedy, thanks for the merge, but I see no logo: https://wikimania2014.wikimedia.org/wiki/Main_Page - I don't know if it's just me? [22:13:05] TimStarling: same here [22:13:26] also, the weird logos and font styles make it look like scam [22:13:48] * Thehelpfulone [22:13:53] (03PS2) 10Andrew Bogott: Move android::sdk and packages::ant18 into contint module. [operations/puppet] - 10https://gerrit.wikimedia.org/r/94257 [22:13:55] sorry ts [22:14:56] on the main page? I think that's Ed playing aroudn with things [22:15:19] hmm The source file 'Wikimania_2014_S hard_logo_v3_with_logotype_and_date_(small).svg' does not exist. [22:17:03] ah nvm that was a copy/paste fail: the new logo's at https://upload.wikimedia.org/wikipedia/commons/thumb/0/06/Wikimania_2014_Shard_logo_v3_with_logotype_and_date_(small).svg/135px-Wikimania_2014_Shard_logo_v3_with_logotype_and_date_(small).svg.png I believe [22:17:28] (03CR) 10Chad: "Actually if we can abstract away all the MW dependencies, I'd like to just symlink the files. The logic hasn't changed in ages and isn't l" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93622 (owner: 10Chad) [22:20:16] !log Reloading zuul config for new oojs-core and oojs-ui pipelines [22:20:35] Logged the message, Mr. Obvious [22:25:04] PROBLEM - MySQL Processlist on db1021 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 72 copy to table, 2 statistics [22:26:57] (03PS4) 10Faidon Liambotis: Add all Asian countries in the list [operations/dns] - 10https://gerrit.wikimedia.org/r/80974 [22:27:58] Thehelpfulone: Chrome says it's an invalid property value [22:28:04] RECOVERY - MySQL Processlist on db1021 is OK: OK 0 unauthenticated, 0 locked, 27 copy to table, 19 statistics [22:29:06] Seems to be suggesting it's only using //upload.wikimedia.org/wikipedia/commons/thumb/0/06/Wikimania_2014_Shard_logo_v3_with_logotype_and_date_(small) [22:30:05] RECOVERY - MySQL Processlist on db1010 is OK: OK 0 unauthenticated, 0 locked, 32 copy to table, 3 statistics [22:30:28] anyone know, off the top of their head, a 10G eqiad machine that is unused? (RobH cmjohnson1?) [22:30:40] by unused i mean not serving user traffic and if it became congested would not hurt the site [22:31:25] RECOVERY - MySQL Processlist on db1003 is OK: OK 0 unauthenticated, 0 locked, 31 copy to table, 1 statistics [22:33:23] Thehelpfulone: I'm guessing it's related to teh () [22:34:52] Reedy: heh [22:34:58] Reedy: it probably is [22:35:04] PROBLEM - MySQL Processlist on db1019 is CRITICAL: CRIT 0 unauthenticated, 0 locked, 40 copy to table, 79 statistics [22:35:06] definitely* [22:35:08] okay, do we need to convert that to %28 then in the patch, or should we rename the image? [22:35:24] can try encoding them first [22:35:25] encode them [22:35:38] or quote the patch [22:36:06] path* [22:37:04] RECOVERY - MySQL Processlist on db1019 is OK: OK 0 unauthenticated, 0 locked, 32 copy to table, 4 statistics [22:40:34] !log reedy synchronized wmf-config/InitialiseSettings.php [22:40:50] Logged the message, Master [22:41:23] (03PS1) 10Reedy: Fix wikimania2014wiki logo [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94272 [22:41:52] (03CR) 10Reedy: [C: 032] Fix wikimania2014wiki logo [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94272 (owner: 10Reedy) [22:49:03] https://www.mediawiki.org/wiki/CI/JJB [22:49:06] GET https://bits.wikimedia.org/static-current/extensions/UniversalLanguageSelector/data/fontrepo/fonts/Autonym/Autonym.woff?version=20131104 404 (Not Found) JJB:1 [22:49:06] GET https://bits.wikimedia.org/static-current/extensions/UniversalLanguageSelector/data/fontrepo/fonts/Autonym/Autonym.ttf?version=20131104 404 (Not Found) JJB:1 [22:49:14] Nemo_bis: [22:49:35] Recent regression. Did anyone deploy this recently? [22:51:24] I bet I know what it is... [22:51:26] ori-l: ^^ [22:51:37] fonts are there in master [22:51:58] lrwxrwxrwx 1 reedy wikidev 54 Nov 7 19:11 extensions -> /usr/local/apache/common-local/php-1.23wmf1/extensions [22:52:08] Fonts are not there in 1.23wmf1 [22:54:00] autonym died? good riddance [22:54:11] No [22:54:14] It's there in master [22:54:29] https://git.wikimedia.org/history/mediawiki%2Fextensions%2FUniversalLanguageSelector.git/9dd4072d3d36722aed91bab29d0769a6dc152df3/data%2Ffontrepo%2Ffonts%2FAutonym [22:54:33] Added 6 days ago [22:55:04] santhosh was supposedly doing something to load it in a different way [22:55:31] where these static-(current|stable) symlinks come in.. [22:55:46] hence why I pinged ori-l [22:56:00] where's that bug… [22:56:56] ori-l [22:56:57] ori-l [22:56:57] ori-l [22:57:21] https://bugzilla.wikimedia.org/show_bug.cgi?id=56514#c2 [22:57:29] related? [23:00:35] Dunno [23:00:38] * Reedy looks at mw.o [23:03:45] (03CR) 10jenkins-bot: [V: 04-1] Fix wikimania2014wiki logo [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94272 (owner: 10Reedy) [23:04:14] (03PS2) 10Reedy: Fix wikimania2014wiki logo [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94272 [23:04:42] (03CR) 10Reedy: [C: 032] Fix wikimania2014wiki logo. [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94272 (owner: 10Reedy) [23:05:12] Reedy: sorry, missed your ping, looking now [23:06:19] so deployment instructions should be updated to say that -current symlinks must point to the most recent (e.g. current or next) version, e.g. the version of group0? [23:06:47] Krinkle: the symlinks are updated automatically [23:06:52] Reedy: Why not? static-current is a hack, but whatever. it does rely on it being the most recent version available in deployments. [23:07:06] Apparently not, Reedy says it points to wmf1, we've got wmf3 live on mw.o [23:07:20] i'll run the update script, then. [23:07:54] Uh [23:08:02] It is pointing at 1.23wmf2 [23:08:06] But the fonts are still not there [23:08:12] because they were added in 1.23wmf3 [23:08:38] reedy@tin:/a/common$ ls -al docroot/bits/static-current/ [23:08:38] total 8 [23:08:38] drwxrwxr-x 2 reedy wikidev 4096 Nov 7 19:11 . [23:08:38] drwxrwxr-x 22 midom wikidev 4096 Nov 7 19:11 .. [23:08:38] lrwxrwxrwx 1 reedy wikidev 54 Nov 7 19:11 extensions -> /usr/local/apache/common-local/php-1.23wmf2/extensions [23:08:39] lrwxrwxrwx 1 reedy wikidev 53 Nov 7 19:11 resources -> /usr/local/apache/common-local/php-1.23wmf2/resources [23:08:40] why isn't static-current pointing at 1.23wmf3? [23:08:41] lrwxrwxrwx 1 reedy wikidev 49 Nov 7 19:11 skins -> /usr/local/apache/common-local/php-1.23wmf2/skins [23:08:43] reedy@tin:/a/common$ ls -al docroot/bits/static-stable/ [23:08:46] total 8 [23:08:48] drwxrwxr-x 2 reedy wikidev 4096 Nov 7 19:11 . [23:08:50] drwxrwxr-x 22 midom wikidev 4096 Nov 7 19:11 .. [23:08:52] lrwxrwxrwx 1 reedy wikidev 54 Nov 7 19:11 extensions -> /usr/local/apache/common-local/php-1.23wmf1/extensions [23:08:54] lrwxrwxrwx 1 reedy wikidev 53 Nov 7 19:11 resources -> /usr/local/apache/common-local/php-1.23wmf1/resources [23:08:57] lrwxrwxrwx 1 reedy wikidev 49 Nov 7 19:11 skins -> /usr/local/apache/common-local/php-1.23wmf1/skins [23:09:15] Reedy: ori-l: so the -current symlink should be updated when checking out a new version, right? [23:09:20] yes [23:09:34] what is static-stable? [23:09:36] It is updated [23:09:37] multiversion/updateBitsBranchPointers [23:09:49] wait, we have both? when was that approved. [23:09:58] static-stable should die [23:10:04] k [23:10:08] https://git.wikimedia.org/commit/operations%2Fmediawiki-config.git/d73e0ff0e5f6ea9c7c9d476c8e01dee516c102d7 [23:10:10] i added it, but nothing is using it, and the idea was ill-conceived [23:10:30] 'static-current' works, though, and should point at the latest branch [23:10:42] k. I can see a use for it, but this is the first I've seen it, so just a bit surprised :) [23:11:20] Reedy: that commit sohuld've updated to wmf3 instead of wmf2 I guess? [23:11:25] Maybe [23:11:27] I didn't do it [23:11:30] checkoutMediaWiki did [23:11:41] current meaning the newest doesn't make sense [23:11:56] we could replace both with 'latest', if you prefer [23:11:59] Reedy: maybe, but it does if you know what it is for. [23:12:06] Not really [23:12:35] It's meant for static resources that rarely change (e.g. append-only if you will). ULS fonts only for the moment. [23:12:46] static-static [23:13:06] which means it must point at the latest, otherwise you're bound to get 404 errors everytime you add somethign. [23:14:02] * ori-l looks to see if i can retrace why it updated to wmf2 and not wmf3 [23:14:22] You've got the output in PM ;) [23:14:28] the problem is basically that we don't have a proper versioning for resources outside api.php and load.php, because those are only accessible via extensions-{version} or static-{version}, which invalites each wmf version bump [23:19:39] Reedy: what do you run after checkoutMediaWiki? [23:19:59] Nothing [23:20:03] just sync it etc [23:23:48] (03Merged) 10jenkins-bot: Fix wikimania2014wiki logo. [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94272 (owner: 10Reedy) [23:24:39] (03CR) 10Aaron Schulz: "I wouldn't symlink them to the utils/ files unless those were moved to /lib first" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93622 (owner: 10Chad) [23:26:08] (03CR) 10Aaron Schulz: "Actually I'm not sure how you'd know where to symlink to though (which MW version)...unless it was updated in checkoutMediaWiki" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/93622 (owner: 10Chad) [23:26:39] !log ori synchronized docroot/bits/static-current 'Update static-current symlinks to 1.23wmf3' [23:26:56] Logged the message, Master [23:28:07] PROBLEM - MySQL Slave Running on db1047 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [23:28:16] (03PS1) 10Ori.livneh: Update static-current symlinks to 1.23wmf3 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94277 [23:28:57] RECOVERY - MySQL Slave Running on db1047 is OK: OK replication Slave_IO_Running: Yes Slave_SQL_Running: Yes Last_Error: [23:30:51] ori-l and RoanKattouw, is https://gerrit.wikimedia.org/r/94250 okay? we only currently want w0 banners showing on wikipedia (sister projects to be supported later). and we only want configuration pages on meta. i was trying to figure out whether the page calls to RL at http://bits.wikimedia.org/.wikipedia.org/load.php?... would "just work" or if ther was additional necessary configuration to consider. [23:32:00] It's hard to know what you're asking without knowing what that configuration variable controls [23:33:50] (03CR) 10Andrew Bogott: "tested on labs, should be a no-op." [operations/puppet] - 10https://gerrit.wikimedia.org/r/94257 (owner: 10Andrew Bogott) [23:45:01] ori-l, just walked over to talk with RoanKattouw about it. it's used in the other mediawikii-config's wmf-config/mobile.php to include the extension in the first place. i didn't notice it before i walked over to Roan's desk, but MaxSem noted that we may need to still explicitly disable wikidata. i'll step through the code to figure it out. Roan said the RL stuff should just work fine, otherwise. [23:46:50] (03PS1) 10Ori.livneh: Fix how 'current' branch is determined in updateBitsBranchPointers [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94280 [23:47:12] dr0ptp4kt, although... there's a safeguard now that prevents Zero from being enabled on wikis without MF [23:47:19] ^ Krinkle, Reedy [23:48:00] (03CR) 10Reedy: [C: 032] Fix how 'current' branch is determined in updateBitsBranchPointers [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94280 (owner: 10Ori.livneh) [23:48:59] MaxSem, which is good. want to add even another layer of checks just to be sure :) [23:49:37] Don't add too many [23:49:48] It'll be more expensive to find out if it can run than running [23:50:31] hey, i'm going to remove ulsfo from dns [23:53:43] (03PS1) 10Lcarr: draining ulsfo of all traffic until TiNet link is confirmed working [operations/dns] - 10https://gerrit.wikimedia.org/r/94283 [23:54:16] !log draining ulsfo of all traffic [23:54:21] (03Merged) 10jenkins-bot: Fix how 'current' branch is determined in updateBitsBranchPointers [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/94280 (owner: 10Ori.livneh) [23:54:31] Logged the message, Mistress of the network gear. [23:56:19] (03CR) 10Lcarr: [C: 032] draining ulsfo of all traffic until TiNet link is confirmed working [operations/dns] - 10https://gerrit.wikimedia.org/r/94283 (owner: 10Lcarr) [23:56:25] (03CR) 10Lcarr: [V: 032] draining ulsfo of all traffic until TiNet link is confirmed working [operations/dns] - 10https://gerrit.wikimedia.org/r/94283 (owner: 10Lcarr)