[00:00:43] paravoid: there is a list of bogus files in aaron/commons-boguslistings on terbium [00:01:18] every one I look at seems to have been deleted by an admin [00:01:31] maybe the containers just failed to update [00:08:08] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 00:07:59 UTC 2013 [00:08:18] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [00:09:18] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 00:09:09 UTC 2013 [00:09:18] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [00:10:18] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 00:10:11 UTC 2013 [00:11:18] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [00:11:58] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 00:11:56 UTC 2013 [00:12:18] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [00:12:48] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 00:12:39 UTC 2013 [00:13:18] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [00:13:18] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 00:13:13 UTC 2013 [00:14:18] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [00:14:48] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 00:14:46 UTC 2013 [00:15:18] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [00:16:21] !log installing package-upgrades on zirconium [00:16:29] Logged the message, Master [00:20:05] !log added myself to root email alias at mchenry [00:20:13] :) there you go Alex [00:20:13] Logged the message, Master [00:31:25] akosiaris: Next you can get it setup so you get a LOADS of messages to your phone from icinga! ;) [00:32:13] Reedy: getting there.... but i think I will create some filters first ;-) [00:38:38] Reedy (you vampire!) any idea about the 1.22wmf4 messages? Is there some update localization cache step for 1.22wmf4 that has to run? [00:41:52] New patchset: Tim Starling; "Fix rsyncd.conf filename" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63600 [00:43:24] spagewmf: There shouldn't be. At this point I think it's worth waiting for localisation update to run tonight (next couple of hours?) and see if that fixes it [00:44:08] Reedy thanks for the update. [00:44:23] Change merged: Tim Starling; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63600 [00:59:23] New patchset: Bsitu; "Add new eventlogging schema:EchoMail" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63602 [00:59:48] !log tstarling synchronized README [00:59:56] Logged the message, Master [01:03:07] New patchset: Tim Starling; "Copy the new rsyncd "hosts allow" line from nfs1 to tin" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63603 [01:04:33] Change merged: Tim Starling; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63603 [01:12:36] !log tstarling synchronized README [01:12:44] Logged the message, Master [01:25:37] argh [01:26:14] running dsh pegs the CPU at 100% on my ssh-agent instance [01:27:10] for a long time [02:11:07] !log LocalisationUpdate failed: mwversionsinuse returned empty list [02:11:15] Logged the message, Master [02:12:11] !log LocalisationUpdate completed (1.22wmf3) at Tue May 14 02:12:10 UTC 2013 [02:12:18] Logged the message, Master [02:31:15] !log LocalisationUpdate completed (1.22wmf4) at Tue May 14 02:31:15 UTC 2013 [02:31:23] Logged the message, Master [02:51:07] New patchset: Jforrester; "Enable VisualEditor on all content namespaces for MW.org" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63621 [03:09:49] !log LocalisationUpdate ResourceLoader cache refresh completed at Tue May 14 03:09:49 UTC 2013 [03:09:56] Logged the message, Master [03:13:32] New patchset: coren; "Tool Labs: Email! (@toollabs.org)" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63624 [03:15:39] New review: coren; "I can haz domain!" [operations/puppet] (production) C: 2; - https://gerrit.wikimedia.org/r/63624 [03:15:39] Change merged: coren; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63624 [03:19:07] New patchset: Tim Starling; "Switch back to fenari agent" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63625 [03:20:23] Change merged: Tim Starling; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63625 [03:36:14] New patchset: Tim Starling; "Remove some node lists" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/56108 [03:39:01] Change merged: Tim Starling; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/56108 [03:51:38] New patchset: Tim Starling; "Add puppetized dsh on fenari and bast1001" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63626 [03:53:36] New patchset: Tim Starling; "Add puppetized dsh on fenari and bast1001" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63626 [03:54:24] Change merged: Tim Starling; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63626 [03:57:39] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [03:58:29] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 1.086 second response time [04:01:39] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [04:02:26] !log tstarling synchronized cgi-bin [04:02:29] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.125 second response time [04:02:34] Logged the message, Master [04:08:04] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 04:07:58 UTC 2013 [04:08:14] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [04:09:14] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 04:09:07 UTC 2013 [04:09:14] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [04:10:14] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 04:10:11 UTC 2013 [04:11:14] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [04:12:04] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 04:11:57 UTC 2013 [04:12:14] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [04:12:44] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 04:12:41 UTC 2013 [04:13:14] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [04:13:24] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 04:13:15 UTC 2013 [04:14:14] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [04:15:14] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 04:15:04 UTC 2013 [04:15:14] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [04:34:06] !log moved scap to tin. Copied source from NFS to tin. Fixed up broken git submodule config using sed. [04:34:14] Logged the message, Master [04:34:56] PROBLEM - Puppet freshness on db45 is CRITICAL: No successful Puppet run in the last 10 hours [04:37:56] PROBLEM - Puppet freshness on db26 is CRITICAL: No successful Puppet run in the last 10 hours [05:06:41] New patchset: Tim Starling; "Add tin to mediawiki-installation" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63627 [05:09:35] New patchset: Tim Starling; "Migrate scap-1, scap-2, & sync-common from wikimedia-task-appserver" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/57854 [05:18:23] PROBLEM - Puppet freshness on virt3 is CRITICAL: No successful Puppet run in the last 10 hours [05:18:55] Change merged: Tim Starling; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/57854 [05:19:54] Change merged: Tim Starling; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63627 [05:24:41] New patchset: Tim Starling; "Update rsyncd location in scap client scripts" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63629 [05:25:39] Change merged: Tim Starling; [operations/debs/wikimedia-task-appserver] (master) - https://gerrit.wikimedia.org/r/58671 [05:26:13] Change merged: Tim Starling; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63629 [05:26:33] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:27:23] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.142 second response time [05:33:43] New patchset: Tim Starling; "Don't put scap scripts in the root directory" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63630 [05:34:02] Change merged: Tim Starling; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63630 [05:36:36] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [05:37:26] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.131 second response time [05:57:16] PROBLEM - Puppet freshness on ms-fe3001 is CRITICAL: No successful Puppet run in the last 10 hours [06:01:36] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [06:02:26] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.126 second response time [06:22:04] New patchset: Tim Starling; "Move in the remaining scap scripts from wikimedia-task-appserver" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63632 [06:25:03] Change merged: Tim Starling; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63632 [06:30:29] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 06:30:21 UTC 2013 [06:30:29] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [06:31:09] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 06:31:06 UTC 2013 [06:31:29] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [06:31:49] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 06:31:46 UTC 2013 [06:31:49] New patchset: Tim Starling; "Remove duplicate of mwversionsinuse" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63633 [06:32:29] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [06:40:15] !log tstarling synchronized README [06:40:24] Logged the message, Master [06:51:43] PROBLEM - Puppet freshness on db44 is CRITICAL: No successful Puppet run in the last 10 hours [07:02:33] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:03:23] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.124 second response time [07:03:41] New patchset: Tim Starling; "Updates for migration to tin" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63635 [07:06:44] New review: Tim Starling; "Sorry, can't wait for review. Will deploy carefully." [operations/mediawiki-config] (master); V: 2 C: 2; - https://gerrit.wikimedia.org/r/63635 [07:07:06] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63635 [07:11:26] !log tstarling synchronized docroot/noc/db.php [07:11:28] Logged the message, Master [07:13:48] !log tstarling synchronized multiversion [07:13:56] Logged the message, Master [07:14:51] !log tstarling synchronized refresh-dblist [07:14:59] Logged the message, Master [07:15:29] !log tstarling synchronized w/MWVersion.php [07:15:37] Logged the message, Master [07:28:58] New patchset: Tim Starling; "Running commands as apache is required for deployment" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63637 [07:29:21] Change merged: Tim Starling; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63633 [07:29:38] Change merged: Tim Starling; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63637 [07:33:27] !log tstarling Started syncing Wikimedia installation... : [07:33:35] Logged the message, Master [07:36:34] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:37:24] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.126 second response time [07:43:58] !log tstarling Started syncing Wikimedia installation... : [07:44:06] Logged the message, Master [07:47:26] scap is not quite working yet [07:47:34] back shortly [07:57:34] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [07:57:54] apergos: and finally here I am [07:58:05] hey [07:58:12] well our slot isn't for a while yet [07:58:24] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.137 second response time [07:59:22] we have 10 am utc, which is in a couple hours, the i18n thing should be happening soon [08:00:09] apergos: I guess we can start as soon as i18n as finished [08:00:25] yep [08:02:16] Nikerabbit: hi, are you using your i18n deployment slot today? :) [08:07:15] you know that scap isn't working right? [08:07:51] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 08:07:47 UTC 2013 [08:08:01] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [08:08:28] TimStarling: I guess they are not deploying any i18n changes today [08:08:31] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 08:08:27 UTC 2013 [08:09:01] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [08:09:01] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 08:08:59 UTC 2013 [08:09:04] when do you want it working by? [08:09:22] we won't be using scap for our window [08:09:34] (puppet, a bit of dsh) [08:10:01] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [08:11:10] !log tstarling Started syncing Wikimedia installation... : [08:11:18] Logged the message, Master [08:15:11] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 08:15:03 UTC 2013 [08:16:01] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [08:18:27] New patchset: Tim Starling; "Update location of find-nearest-rsync in sudoers" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63639 [08:18:43] Change merged: Tim Starling; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63639 [08:22:31] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [08:23:22] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.125 second response time [08:24:01] PROBLEM - Puppet freshness on mc15 is CRITICAL: No successful Puppet run in the last 10 hours [08:25:01] PROBLEM - Puppet freshness on colby is CRITICAL: No successful Puppet run in the last 10 hours [08:25:59] !log tstarling Started syncing Wikimedia installation... : [08:26:03] 4th time lucky? [08:26:07] Logged the message, Master [08:33:42] !log tstarling Finished syncing Wikimedia installation... : [08:33:49] Logged the message, Master [08:34:28] woot. [08:36:23] !log tstarling synchronized php-1.22wmf3/LocalSettings.php 'remove testwiki special case' [08:37:37] !log tstarling synchronized php-1.22wmf4/LocalSettings.php 'remove testwiki special case' [08:45:40] ok, I'm done for the day [08:45:59] oh, except I should chase that apt issue I guess [08:46:20] congrats tim ! [08:46:26] thank you [08:49:36] yay [08:50:28] Logged the message, Master [08:50:37] Logged the message, Master [08:55:32] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [08:57:23] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.132 second response time [09:03:09] AaronSchulz: I'm looking at that list of yours [09:03:26] I've tried to find a bunch of those on ms7 and have found 0 so far [09:03:42] I'm about to script it, but I don't have much hope [09:06:44] have you tried ms1001? [09:10:32] found 44 out of 5079 [09:16:39] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [09:17:29] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.127 second response time [09:36:35] !log depooled ssl1 in pmtpa for testing [09:36:43] Logged the message, Master [09:51:14] testing? [09:51:24] hashar's work? [09:52:08] paravoid: yeah Ariel is deploying the new manifests for the proto proxies :-] [09:57:43] !log puppet temp disabled on all ssl terminators except ssl1 in pmtpa [09:57:50] Logged the message, Master [09:58:51] PROBLEM - Puppet freshness on lvs1004 is CRITICAL: No successful Puppet run in the last 10 hours [09:58:51] PROBLEM - Puppet freshness on lvs1005 is CRITICAL: No successful Puppet run in the last 10 hours [09:58:51] PROBLEM - Puppet freshness on lvs1006 is CRITICAL: No successful Puppet run in the last 10 hours [10:00:08] ppooor puppet [10:05:23] Change merged: ArielGlenn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/62966 [10:06:10] Change merged: ArielGlenn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/62973 [10:08:03] Change merged: ArielGlenn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/62976 [10:18:20] Change merged: ArielGlenn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/62977 [10:20:55] New patchset: Hashar; "protoproxy: mobile + beta support" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63431 [10:22:54] Change merged: ArielGlenn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63431 [10:42:10] !log repooled ssl1 [10:42:19] Logged the message, Master [10:51:07] !log re-enabling puppet on ssl1001 for a bit of real traffic [10:51:15] Logged the message, Master [10:56:04] http://ganglia.wikimedia.org/latest/?c=SSL%20cluster%20eqiad&h=ssl1001.wikimedia.org&m=cpu_report&r=hour&s=by%20name&hc=4&mc=2 [10:56:07] still working :D [11:07:56] New patchset: Hashar; "beta: enable HTTPS on all projects" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63644 [11:13:35] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [11:14:25] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.124 second response time [11:15:35] PROBLEM - DPKG on mc15 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [11:16:35] RECOVERY - DPKG on mc15 is OK: All packages OK [11:20:16] !reenabling puppet and reloading nginx on all ssl terminators [11:23:25] PROBLEM - RAID on snapshot2 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [11:23:35] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [11:23:40] hm [11:24:01] dos on stafford? :D [11:24:12] no, stafford flaps a lot sadly [11:24:16] it is likely overworked [11:24:28] I'm just yeing the snapshot2 message, will have to check on that [11:24:34] yeah as I said, the catalog are not cached :-D [11:24:52] we should cache them by git revision instead of an ever changing timestamp [11:25:26] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.129 second response time [11:26:47] apergos: final step for beta is https://gerrit.wikimedia.org/r/63644 :-D [11:26:59] that will enable the nginx config for domains besides bits.beta.wmflabs.org [11:27:03] snap looks ok [11:27:17] greedyguts :-P [11:27:26] let's see it [11:28:46] fine fine :-P [11:29:47] hashar: nope [11:29:54] Nikerabbit: :-] [11:30:03] didn't it read in the calendar? [11:30:03] Change merged: ArielGlenn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63644 [11:30:07] Nikerabbit: asked because we had a window just after your [11:31:09] no, the calendar listed you guys with the regular deployment slot [11:31:16] weird [11:31:21] I did email greg [11:31:31] no worries, looks like we are done now [11:32:35] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [11:33:25] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.125 second response time [11:33:35] PROBLEM - DPKG on snapshot2 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [11:34:22] andre__: does https://bugzilla.wikimedia.org/show_bug.cgi?id=48072#c2 mean we should be removing patch-in-gerrit after marking bugs RESOLVED FIXED? [11:34:25] RECOVERY - DPKG on snapshot2 is OK: All packages OK [11:35:19] odder: I wouldn't - I don't see any win, and I didn't do that mass removing myself [11:36:35] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [11:37:25] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.124 second response time [11:41:05] PROBLEM - HTTP radosgw on ms-fe1004 is CRITICAL: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 758 bytes in 0.003 second response time [11:42:22] New patchset: Hashar; "labs: hardcode nginx server_names_hash_bucket_size to 64" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63646 [11:45:26] New patchset: Hashar; "labs: hardcode nginx server_names_hash_bucket_size to 64" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63646 [11:46:13] Change merged: ArielGlenn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63646 [11:46:41] \O/ [11:46:42] andre__: thanks [11:50:35] PROBLEM - RAID on snapshot2 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [11:52:22] hm, interesting to see we now have 300 more reports open than a month ago, andre__ [11:53:35] PROBLEM - RAID on snapshot2 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [11:53:35] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [11:53:40] odder, normal growth I'd say [11:53:48] odder, https://bugzilla.wikimedia.org/reports.cgi?product=-All-&datasets=UNCONFIRMED&datasets=NEW&datasets=ASSIGNED&datasets=REOPENED&banner=1 [11:54:03] oh, that's neat [11:54:25] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.124 second response time [11:57:35] PROBLEM - RAID on snapshot2 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [11:59:17] PROBLEM - SSH on snapshot2 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [11:59:35] PROBLEM - Disk space on snapshot2 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [11:59:35] PROBLEM - DPKG on snapshot2 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [12:03:11] crap [12:03:25] RECOVERY - Disk space on snapshot2 is OK: DISK OK [12:03:26] RECOVERY - DPKG on snapshot2 is OK: All packages OK [12:03:26] memory. hopefully it will oom one of those [12:04:05] RECOVERY - SSH on snapshot2 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [12:04:35] PROBLEM - RAID on snapshot2 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [12:25:39] New patchset: Faidon; "Ceph: move osd min down reporters to [mon]" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63652 [12:25:49] Change merged: Faidon; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63651 [12:26:01] Change merged: Faidon; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63652 [12:26:21] RECOVERY - SSH on snapshot2 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [12:28:07] PROBLEM - SSH on snapshot2 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [12:33:07] RECOVERY - SSH on snapshot2 is OK: SSH OK - OpenSSH_5.3p1 Debian-3ubuntu7 (protocol 2.0) [12:35:27] RECOVERY - Disk space on snapshot2 is OK: DISK OK [12:35:28] RECOVERY - DPKG on snapshot2 is OK: All packages OK [12:36:39] PROBLEM - RAID on snapshot2 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [12:38:02] Change merged: ArielGlenn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/61636 [12:38:31] Change merged: ArielGlenn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/61558 [12:39:39] PROBLEM - Disk space on snapshot2 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [12:40:39] PROBLEM - DPKG on snapshot2 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [12:40:39] PROBLEM - RAID on snapshot2 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [12:40:46] Change merged: ArielGlenn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/62434 [12:44:39] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [12:45:29] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.135 second response time [12:47:33] New patchset: Hashar; "contint: install colordiff" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63130 [12:48:19] Change merged: ArielGlenn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63130 [12:48:39] PROBLEM - RAID on snapshot2 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [12:51:30] RECOVERY - Disk space on snapshot2 is OK: DISK OK [12:51:30] RECOVERY - DPKG on snapshot2 is OK: All packages OK [13:02:43] New review: Faidon; "allow_xff is currently all of the Wikimedia networks. I don't think we want to do such magic on all ..." [operations/puppet] (production) C: -1; - https://gerrit.wikimedia.org/r/62103 [13:10:31] New review: Faidon; "Thanks for doing this! FTR, I'm not actually reviewing all that, I'll just trust your script :)" [operations/puppet] (production) C: 2; - https://gerrit.wikimedia.org/r/63500 [13:10:32] Change merged: Faidon; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63500 [13:14:13] New review: Mark Bergsma; "Looks very good, very thorough and extensive!" [operations/software/varnish/vhtcpd] (master) C: 1; - https://gerrit.wikimedia.org/r/60390 [13:49:05] apergos: if you have some time can you have a look at https://gerrit.wikimedia.org/r/#/c/63220/ [13:52:04] can I look at this after my lunch actually? [13:52:19] need brain food [13:52:44] !log Zuul: applying a patch to prevent it from fetching a change multiple time. [13:52:53] Logged the message, Master [13:53:05] !log running dns update [13:53:12] Logged the message, Master [13:54:14] apergos:of course! [13:57:42] Change merged: Ottomata; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63220 [14:07:18] New patchset: Reedy; "Set Thai wikis to uca-default collation" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63661 [14:08:58] Change merged: Reedy; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63661 [14:12:25] !log reedy synchronized wmf-config/InitialiseSettings.php 'thwiki collation' [14:12:33] Logged the message, Master [14:16:13] someone didn't want to wait :-D [14:16:39] but your input is still very much appreciated :) [14:17:39] sreedy@tin:~$ sql wikidatawiki [14:17:39] /usr/local/bin/sql: line 19: mysql: command not found [14:18:16] Can someone put the mysql client stuff on tin please? It's not on bast1001 either, though I'm not sure it should be [14:20:45] Ah, I can make the changeset [14:21:56] New patchset: Reedy; "Add generic::mysql::packages::client to tin" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63663 [14:22:53] Reedy: [14:22:57] mysql on terbium [14:22:59] not on tin [14:23:15] eh? [14:23:17] tin is apparently just for deployment [14:23:27] not for random mysql queries [14:23:53] <^demon> What about maintenance scripts? [14:23:58] terbium [14:24:01] it's the new hume [14:24:04] all that stuff will be there [14:24:10] <^demon> Mmk. [14:24:30] Ugh. So I have to pop another shell and/or change host just to run an sql query? :/ [14:24:35] Long live fenari! ;) [14:24:43] just leave a window open over there [14:24:45] no biggie [14:26:07] Hmm. sql.php? :D [14:26:37] * apergos glowers [14:27:02] so I guess I shouldn't talk about the dark launch in here then... [14:27:08] or everyone will want it [14:27:10] :-P [14:27:44] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [14:29:11] Maybe i should just setup one of the putty window management tools. AFAIK they can automatically open numerous windows [14:29:35] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.126 second response time [14:29:39] sorta like your channel autojoin yeah [14:30:35] also that stringutils entry is hilarious [14:31:06] And typically there's at least 5 options.. [14:35:04] PROBLEM - Puppet freshness on db45 is CRITICAL: No successful Puppet run in the last 10 hours [14:37:14] <^demon> apergos: We should do that for mediawiki...don't have a static skin, but have it figure out things randomly based on page title :) [14:37:35] :-D :-D [14:37:40] now there is an apr 1 idea [14:38:02] PROBLEM - Puppet freshness on db26 is CRITICAL: No successful Puppet run in the last 10 hours [15:01:44] New patchset: Hashar; "jenkins::slave and a basic role applied gallium" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63666 [15:02:10] 666 [15:02:17] yeah that sounds evil enough [15:13:37] !log Created EducationProgram tables on dewikiversity [15:13:44] Logged the message, Master [15:19:18] PROBLEM - Puppet freshness on virt3 is CRITICAL: No successful Puppet run in the last 10 hours [15:22:52] New patchset: Reedy; "Cache loaded dblists when tagged. Reuse for SiteMatrix, CentralAuth and Incubator" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/57173 [15:26:42] New patchset: Diederik; "Two fixes to rolematcher.py" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63668 [15:28:24] hashar: Regarding the RT bug about puppet and apt… does your last comment mean that the issue is now fixed? [15:28:34] and the RT is ? :D [15:28:50] for php5 ? [15:29:00] Sorry, my client crashed [15:29:02] https://rt.wikimedia.org/Ticket/Display.html?id=5141 [15:30:56] ah yeah that one [15:31:03] we were talking about it with Ariel [15:31:13] I just copy pasted my investigation results [15:31:33] not sure what happened, but I suspect our pinning to prefer Wikimedia release is/was not working [15:31:38] New patchset: Diederik; "Two fixes to rolematcher.py" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63668 [15:31:51] ok. I just want to make sure someone is on the case :) [15:32:02] someone need to rebuild our package on top of latest ubuntu version I guess [15:32:11] and I am not working on it. [15:32:28] I think Tim logged that RT to make sure the issue will not get forgotten. [15:32:32] Change merged: Ottomata; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63668 [15:32:41] and I have no who manage the php package :( [15:32:56] ok… paravoid, when I hear 'rebuild the package' I think of you :) [15:39:23] andrewbogott: yeah sorry, I am not very helpful on this topic [15:41:56] * andrewbogott -> dentist [15:42:41] New patchset: Ottomata; "Fixing one more PacketLossLogtailer error" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63669 [15:42:56] Change merged: Ottomata; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63669 [15:46:31] New patchset: Reedy; "Enable Collection on lbwiki" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63670 [15:46:45] Change merged: Reedy; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63670 [15:47:09] noc is going to be out of date unless we run git pull on fenari /home.. [15:47:57] noc needs to get moved to some good location (and have an appropriate syncing mechanism) [15:48:02] I dunno where that is though [15:48:03] mhmm [15:48:22] having cronjob git pull is probably enough.. depending on where it's hosted [15:48:43] Else have it in mediawiki-installation and symlink if necessary [15:49:02] there's already some symlinks to stuff not in the mw tree [15:49:17] well er not in git anyways [15:58:15] PROBLEM - Puppet freshness on ms-fe3001 is CRITICAL: No successful Puppet run in the last 10 hours [16:01:33] New patchset: Reedy; "Move RightsUrl variables to InitialiseSettings.php" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63673 [16:01:35] PROBLEM - SSH on lvs1001 is CRITICAL: Server answer: [16:02:22] Change merged: Reedy; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63673 [16:03:35] RECOVERY - SSH on lvs1001 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [16:06:44] PROBLEM - SSH on lvs1001 is CRITICAL: Server answer: [16:07:57] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 16:07:56 UTC 2013 [16:08:27] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [16:08:39] New review: Greg Grossmeier; "(1 comment)" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63673 [16:09:03] Reedy: ^^ [16:09:07] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 16:09:06 UTC 2013 [16:09:21] Yup, I'd just noticed that myself :D [16:09:27] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [16:09:47] heh, I didn't notice until I went to wikidata to check something else :) [16:10:17] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 16:10:10 UTC 2013 [16:10:27] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [16:11:17] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 16:11:07 UTC 2013 [16:11:27] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [16:11:37] RECOVERY - SSH on lvs1001 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [16:12:12] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 16:11:57 UTC 2013 [16:12:25] greg-g: Is that display text otherwise correct? [16:12:27] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [16:12:47] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 16:12:41 UTC 2013 [16:13:27] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [16:14:09] New patchset: Reedy; "Wrong link rel="copyright" for main namespace of Wikidata: set $wgRightsUrl and $wgRightsText to CC0" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63675 [16:14:57] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 16:14:50 UTC 2013 [16:14:58] New patchset: Reedy; "Wrong link rel="copyright" for main namespace of Wikidata: set $wgRightsUrl and $wgRightsText to CC0" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63675 [16:15:27] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [16:15:37] PROBLEM - SSH on lvs1001 is CRITICAL: Server answer: [16:16:53] Change merged: Reedy; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63675 [16:17:01] Reedy: Ideally it'd say something like: "To the extent possible under law, asdf has waived all copyright and related or neighboring rights to asdf." with a link to the CC0 waiver deed page. But, to fit the style of the rest of the wikis... [16:17:41] "Creative Commons Zero - Public Domain" or some such [16:17:45] !log reedy synchronized wmf-config/ [16:17:53] Logged the message, Master [16:19:08] $msg = 'Creative Commons Public Domain 1.0'; [16:19:09] 'wgRightsText' => array( [16:19:09] 'default' => 'Creative Commons Attribution-Share Alike 3.0 Unported', [16:19:09] 'huwikinews' => 'Creative Commons Attribution 3.0 Unported', [16:19:09] 'wikinews' => 'Creative Commons Attribution 2.5', [16:19:09] ), [16:19:37] RECOVERY - SSH on lvs1001 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [16:20:08] So, to fit that: "Creative Commons Zero" [16:20:26] er: "Creative Commons Zero 1.0" [16:52:27] PROBLEM - Puppet freshness on db44 is CRITICAL: No successful Puppet run in the last 10 hours [16:55:34] New patchset: Reedy; "$wgTranslateBlacklist of zh-* on metawiki" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63678 [16:57:42] New patchset: Cmjohnson; "adding parsoid servers to site.pp" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63680 [16:57:47] PROBLEM - Puppetmaster HTTPS on stafford is CRITICAL: CRITICAL - Socket timeout after 10 seconds [16:58:17] apergos: could your review plz [16:58:23] already see it [16:58:38] RECOVERY - Puppetmaster HTTPS on stafford is OK: HTTP OK: Status line output matched 400 - 336 bytes in 0.124 second response time [16:58:45] jenkins likes it! [16:59:45] that's good! [17:00:15] Change merged: ArielGlenn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63680 [17:00:36] !log reedy synchronized wmf-config/InitialiseSettings.php [17:00:44] Logged the message, Master [17:01:35] !log reedy synchronized wmf-config/InitialiseSettings.php [17:01:43] Logged the message, Master [17:02:46] !log reedy synchronized wmf-config/InitialiseSettings.php [17:02:54] Logged the message, Master [17:03:10] New patchset: Reedy; "Fix bug 37523" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63682 [17:03:26] Change merged: Reedy; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63682 [17:05:03] New patchset: Reedy; "Fix bug: 37522" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63683 [17:05:22] Change merged: Reedy; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63683 [17:05:38] !log reedy synchronized wmf-config/InitialiseSettings.php [17:05:45] Logged the message, Master [17:15:05] !log reedy synchronized wmf-config/CommonSettings.php [17:15:12] Logged the message, Master [17:15:21] New patchset: Reedy; "Enable translation import" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63684 [17:15:32] Change merged: Reedy; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63684 [17:18:57] New patchset: Reedy; "Namespace configuration for ur.wikipedia" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63685 [17:19:28] Change merged: Reedy; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63685 [17:19:39] !log reedy synchronized wmf-config/InitialiseSettings.php 'Namespace configuration for ur.wikipedia' [17:19:47] Logged the message, Master [17:22:07] Wow ready, good pace again :) [17:22:12] Reedy: * [17:22:29] There was seemingly quite a few rotting trivial bugs [17:22:52] * odder swears about some bloody keyboard under his nose [17:23:58] Reedy: this trivial shell request is 2 years old :p https://bugzilla.wikimedia.org/show_bug.cgi?id=30228 [17:24:42] odder: why don't you submit a patch for https://bugzilla.wikimedia.org/show_bug.cgi?id=33513 ? :) [17:27:21] tfinc: milimetric attached a screenshot (https://mingle.corp.wikimedia.org/attachments/0f9b5c8fca4964e51d8a08eacfd65d21/299/Limn_Problem_Mingle__628.png) to https://mingle.corp.wikimedia.org/projects/analytics/cards/628 [17:27:55] mutante: I'm dumb and didn't send the email warning people about the RT outage. So we should reschedule… does Thursday afternoon work OK for you? [17:28:03] thanks drdee [17:28:13] !log reedy synchronized wmf-config/CommonSettings.php 'Enable upload of XML files on Incubator (for MediaWiki dumps)' [17:28:20] Logged the message, Master [17:28:35] New patchset: Reedy; "Enable upload of XML files on Incubator (for MediaWiki dumps)" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63686 [17:29:05] andrewbogott: that's ok. Thursday it is then [17:29:14] I will send the email right now so I don't forget. [17:29:18] Nemo_bis: you just submitted it? [17:29:20] Change merged: Reedy; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63686 [17:29:20] thanks [17:29:23] Will we want notpeter in the loop to deal with db stuff? [17:29:43] Mutante, also i have a new version of the puppet stuff awaiting your review, here: https://gerrit.wikimedia.org/r/#/c/63213/ [17:30:14] odder: no, wrong bug in commit message [17:30:18] Damn it [17:30:19] Wrong bug [17:30:36] New review: Reedy; "Actually bug 30228" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63686 [17:33:39] mutante, any preference about timing on Thursday? (I was about to schedule it for noon but probably you will want lunch :) ) [17:33:58] andrewbogott: that one doesn't touch the existing RT? [17:34:21] andrewbogott: did that run on labs in puppetmaster::self? [17:34:55] andrewbogott: 11? [17:37:24] PROBLEM - Parsoid on wtp1013 is CRITICAL: Connection refused [17:37:24] PROBLEM - Parsoid on wtp1008 is CRITICAL: Connection refused [17:37:24] PROBLEM - Parsoid on wtp1020 is CRITICAL: Connection refused [17:37:24] PROBLEM - Parsoid on wtp1024 is CRITICAL: Connection refused [17:37:34] PROBLEM - Parsoid on wtp1010 is CRITICAL: Connection refused [17:37:44] PROBLEM - Parsoid on wtp1014 is CRITICAL: Connection refused [17:37:44] PROBLEM - Parsoid on wtp1021 is CRITICAL: Connection refused [17:37:44] PROBLEM - Parsoid on wtp1011 is CRITICAL: Connection refused [17:37:54] PROBLEM - Parsoid on wtp1015 is CRITICAL: Connection refused [17:38:04] PROBLEM - Parsoid on wtp1006 is CRITICAL: Connection refused [17:38:04] PROBLEM - Parsoid on wtp1016 is CRITICAL: Connection refused [17:38:14] PROBLEM - Parsoid on wtp1007 is CRITICAL: Connection refused [17:38:14] PROBLEM - Parsoid on wtp1018 is CRITICAL: Connection refused [17:38:31] RoanKattouw: I take it thats ok? ^ [17:39:34] gwicke: did you break parsoid again? ;) [17:39:54] those are all new machines [17:40:11] brand spanking [17:40:14] Roan was probably working on setting them or the monitoring up [17:40:18] * greg-g nods [17:40:29] thanks gwicke [17:40:46] he is in a stand-up right now [17:41:26] mutante, I have an apt that ends at 11 but we can start as soon as I get home. [17:42:17] andrewbogott: ok, 11.30..or just make it 12, it's ok [17:46:27] terbium foundationwiki Error connecting to db1025.eqiad.wmnet: Unknown MySQL server host 'db1025.eqiad.wmnet' (0) [17:46:33] * Aaron|home wonders what that is [17:48:49] andrewbogott: 10.in-addr.arpa:100 1H IN PTR db1025.frack.eqiad.wmnet. [17:49:06] argg, i meant Aaron|home [17:49:10] note the "fracK" part [17:49:17] means it's fundraising [17:50:19] Jeff_Green: ^ [17:53:11] !log reedy synchronized wmf-config/ [17:53:18] Logged the message, Master [17:55:35] !log reedy synchronized wmf-config/ [17:55:43] Logged the message, Master [17:57:06] paravoid: you around? [17:57:09] RobH: The monitoring panic is normal. I haven't touched the boxes yet so they don't work yet [18:03:38] mutante: where is that query from? [18:03:54] terbium--that's the eqiad cron/scriptwhatever box? [18:04:40] sbernardin [18:04:48] Hey [18:04:54] New patchset: Reedy; "First attempt at starting to cleanup wiki specific file extensions" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63694 [18:04:55] can you confirm if there is an orange led for the bad drives? [18:05:19] i want to make sure we're pulling the right ones [18:06:38] Jeff_Green: Aaron|home reported it, i don't know yet [18:07:02] but that db is in frack and something expects it to be in just eqiad [18:07:20] mutante: yeah--it moved last week [18:07:43] terbium is a Wikimedia Misc - Maintenance Server: [18:07:50] Reedy: https://gerrit.wikimedia.org/r/#/c/63696/ [18:07:52] and currently it's firewalled blocking terbium. short term I suggest switching whatever it is to db78.pmtpa.wmnet [18:07:56] foundationwiki, pagetriage extension and parser cache purging [18:08:28] and after that we need to evaluate whether it's ok to continue to allow non-frack hosts to access fundraising mysql [18:08:39] Reedy: can you review that? [18:09:37] PROBLEM - Parsoid on wtp1022 is CRITICAL: Connection refused [18:09:47] PROBLEM - Parsoid on wtp1017 is CRITICAL: Connection refused [18:09:57] PROBLEM - Parsoid on wtp1009 is CRITICAL: Connection refused [18:10:07] PROBLEM - Parsoid on wtp1012 is CRITICAL: Connection refused [18:10:07] PROBLEM - Parsoid on wtp1023 is CRITICAL: Connection refused [18:10:17] PROBLEM - Parsoid on wtp1005 is CRITICAL: Connection refused [18:10:17] PROBLEM - Parsoid on wtp1019 is CRITICAL: Connection refused [18:11:04] Change merged: Kaldari; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63602 [18:11:25] !log reedy synchronized wmf-config/ [18:11:33] Logged the message, Master [18:12:17] New patchset: Reedy; "First attempt at starting to cleanup wiki specific file extensions" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63694 [18:13:14] New patchset: Reedy; "First attempt at starting to cleanup wiki specific file extensions" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63694 [18:13:22] Change merged: Reedy; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63694 [18:13:25] cmjohnson1: ms-be11 shows amber on a drive ...ms-be5 has a drive with no leds at all...and ms-be10 has all green activity leds [18:14:27] Change merged: Andrew Bogott; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/62840 [18:15:02] cmjohnson1: can I pull the drives from 5 & 11? [18:15:11] not yet [18:15:53] OK [18:17:47] PROBLEM - RAID on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [18:17:59] Change merged: Dzahn; [operations/apache-config] (master) - https://gerrit.wikimedia.org/r/59580 [18:18:27] PROBLEM - swift-object-replicator on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [18:18:36] sbernardin: replace the disk on ms-be11 [18:18:37] PROBLEM - DPKG on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [18:18:37] PROBLEM - swift-account-auditor on ms-be11 is CRITICAL: Timeout while attempting connection [18:18:38] PROBLEM - swift-container-updater on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [18:18:38] PROBLEM - swift-object-server on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [18:18:38] PROBLEM - swift-account-replicator on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [18:18:38] PROBLEM - swift-container-server on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [18:18:38] PROBLEM - swift-container-auditor on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [18:18:38] PROBLEM - swift-account-reaper on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [18:18:39] PROBLEM - swift-object-auditor on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [18:18:47] PROBLEM - swift-object-updater on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [18:18:47] PROBLEM - swift-container-replicator on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [18:18:47] PROBLEM - swift-account-server on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. [18:21:32] ACKNOWLEDGEMENT - DPKG on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. daniel_zahn disks being replaced - RT-5088 [18:21:33] ACKNOWLEDGEMENT - Disk space on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. daniel_zahn disks being replaced - RT-5088 [18:21:33] ACKNOWLEDGEMENT - RAID on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. daniel_zahn disks being replaced - RT-5088 [18:21:33] ACKNOWLEDGEMENT - swift-account-auditor on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. daniel_zahn disks being replaced - RT-5088 [18:21:33] ACKNOWLEDGEMENT - swift-account-reaper on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. daniel_zahn disks being replaced - RT-5088 [18:21:33] ACKNOWLEDGEMENT - swift-account-replicator on ms-be11 is CRITICAL: Timeout while attempting connection daniel_zahn disks being replaced - RT-5088 [18:21:33] ACKNOWLEDGEMENT - swift-account-server on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. daniel_zahn disks being replaced - RT-5088 [18:21:34] ACKNOWLEDGEMENT - swift-container-auditor on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. daniel_zahn disks being replaced - RT-5088 [18:21:34] ACKNOWLEDGEMENT - swift-container-replicator on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. daniel_zahn disks being replaced - RT-5088 [18:21:35] ACKNOWLEDGEMENT - swift-container-server on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. daniel_zahn disks being replaced - RT-5088 [18:21:35] ACKNOWLEDGEMENT - swift-container-updater on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. daniel_zahn disks being replaced - RT-5088 [18:21:36] ACKNOWLEDGEMENT - swift-object-auditor on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. daniel_zahn disks being replaced - RT-5088 [18:21:36] ACKNOWLEDGEMENT - swift-object-replicator on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. daniel_zahn disks being replaced - RT-5088 [18:21:37] ACKNOWLEDGEMENT - swift-object-server on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. daniel_zahn disks being replaced - RT-5088 [18:21:37] ACKNOWLEDGEMENT - swift-object-updater on ms-be11 is CRITICAL: CHECK_NRPE: Socket timeout after 10 seconds. daniel_zahn disks being replaced - RT-5088 [18:21:46] heh, showing that feature to Alex:) [18:22:17] RECOVERY - swift-object-replicator on ms-be11 is OK: PROCS OK: 1 process with regex args ^/usr/bin/python /usr/bin/swift-object-replicator [18:22:28] RECOVERY - DPKG on ms-be11 is OK: All packages OK [18:22:28] RECOVERY - swift-account-auditor on ms-be11 is OK: PROCS OK: 1 process with regex args ^/usr/bin/python /usr/bin/swift-account-auditor [18:22:28] RECOVERY - swift-object-server on ms-be11 is OK: PROCS OK: 101 processes with regex args ^/usr/bin/python /usr/bin/swift-object-server [18:22:28] RECOVERY - swift-account-replicator on ms-be11 is OK: PROCS OK: 1 process with regex args ^/usr/bin/python /usr/bin/swift-account-replicator [18:22:28] RECOVERY - swift-container-updater on ms-be11 is OK: PROCS OK: 1 process with regex args ^/usr/bin/python /usr/bin/swift-container-updater [18:22:28] RECOVERY - swift-container-server on ms-be11 is OK: PROCS OK: 13 processes with regex args ^/usr/bin/python /usr/bin/swift-container-server [18:22:28] RECOVERY - swift-container-auditor on ms-be11 is OK: PROCS OK: 1 process with regex args ^/usr/bin/python /usr/bin/swift-container-auditor [18:22:29] RECOVERY - swift-account-reaper on ms-be11 is OK: PROCS OK: 1 process with regex args ^/usr/bin/python /usr/bin/swift-account-reaper [18:22:29] RECOVERY - swift-object-auditor on ms-be11 is OK: PROCS OK: 2 processes with regex args ^/usr/bin/python /usr/bin/swift-object-auditor [18:22:38] RECOVERY - swift-object-updater on ms-be11 is OK: PROCS OK: 1 process with regex args ^/usr/bin/python /usr/bin/swift-object-updater [18:22:38] RECOVERY - swift-container-replicator on ms-be11 is OK: PROCS OK: 1 process with regex args ^/usr/bin/python /usr/bin/swift-container-replicator [18:22:38] RECOVERY - swift-account-server on ms-be11 is OK: PROCS OK: 13 processes with regex args ^/usr/bin/python /usr/bin/swift-account-server [18:22:47] RECOVERY - RAID on ms-be11 is OK: OK: State is Optimal, checked 1 logical device(s) [18:23:55] sbernardin: you there ? [18:24:27] PROBLEM - Puppet freshness on mc15 is CRITICAL: No successful Puppet run in the last 10 hours [18:24:31] !log kaldari synchronized php-1.22wmf4/extensions/Echo 'syncing Echo for 1.22wmf4' [18:24:39] Logged the message, Master [18:24:40] cmjohnson1: yup...just replaced disk in ms-be11 [18:24:49] okay [18:25:27] PROBLEM - Puppet freshness on colby is CRITICAL: No successful Puppet run in the last 10 hours [18:27:42] !log reedy synchronized wmf-config/ [18:27:50] Logged the message, Master [18:28:31] Nemo_bis, Reedy https://commons.wikimedia.org/wiki/Commons:Upload [18:28:44] odder: ? [18:29:13] defining $wgUploadWizardConfig['altUploadForm'] => array [18:29:21] for that many languages just doesn't scale. [18:29:37] RECOVERY - Host mw1173 is UP: PING OK - Packet loss = 0%, RTA = 1.07 ms [18:29:38] New patchset: Reedy; "Mostly remove need for $wmgPrivateWikiUploads" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63702 [18:29:48] !log aaron synchronized php-1.22wmf3/includes/filerepo/file/LocalFile.php '0cf6b27aa43ad2b65725081ee4d36ccb0f8a34c6' [18:29:55] Logged the message, Master [18:29:59] odder: copy source HTML + regex [18:30:46] Nemo_bis: I /know/ how to get the addresses & et al., but the idea of defining that in a variable is just… weird. [18:30:48] !log reedy synchronized wmf-config/ [18:30:56] Logged the message, Master [18:32:22] http://p.defau.lt/?6EZNo9Tr1vvtA6wzDodr9A [18:32:24] maybe [18:32:28] PROBLEM - Apache HTTP on mw1173 is CRITICAL: Connection refused [18:33:27] RECOVERY - Apache HTTP on mw1173 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 747 bytes in 0.057 second response time [18:33:47] !log kaldari synchronized php-1.22wmf4/extensions/Echo 'syncing Echo for 1.22wmf4' [18:33:55] Logged the message, Master [18:35:10] !log replacing bad drive in ms-be11 [18:35:17] Logged the message, Master [18:38:59] New patchset: Ottomata; "Putting nginx webrequest logs back in the main udp2log multicast stream." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63704 [18:39:36] Change merged: Ottomata; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63704 [18:39:56] didn't we used to have a redirect at http://wmflabs.org/ to labsconsole? [18:42:12] New patchset: Odder; "(bug 48457) Set abusefilter-modify-restricted for cawiki" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63705 [18:43:08] robla: originally and for a long time there was none, dunno if it existed at some point [18:43:34] !log deployed puppet change to make nginx send webrequest logs back to main multicast firehose.  NOTE: These logs are not sent to the udp2log instance on emery. [18:43:53] robla: btw https://bugzilla.wikimedia.org/show_bug.cgi?id=43580 [18:44:53] thanks [18:45:39] !log running an apache-graceful-all [18:45:46] Logged the message, Master [18:47:12] cmjohnson1: are we good with ms-be11? [18:49:17] he had to step out for a minute, should be back shortly [18:49:37] don't disappear on us, just in case sbernardin [18:50:27] apergos: I'll be here [18:50:34] ok cool [18:51:25] !log gracefulling apaches with dsh [18:51:33] Logged the message, Master [18:51:45] Nemo_bis: o__0 [18:51:53] New patchset: Reedy; "Fixup leading whitespace" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63709 [18:52:09] odder: ?? [18:52:34] https://gerrit.wikimedia.org/r/gitweb?p=operations/mediawiki-config.git;a=blame;f=wmf-config/CommonSettings.php;hb=015f5b7131eea4309864691dbc1f4266ac57ad18#l2208 [18:53:11] so? [18:53:54] eh, I guess I'll just edit that. [18:53:55] New patchset: Reedy; "Fixup leading whitespace" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63709 [18:54:10] still gracefulling [18:54:16] Change merged: Reedy; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63709 [18:55:03] odder: yes, we have some endless switches in CommonSettings anyway :) [18:56:20] done [18:56:36] PROBLEM - LVS HTTPS IPv6 on wikinews-lb.esams.wikimedia.org_ipv6 is CRITICAL: Connection refused [18:56:37] PROBLEM - HTTPS on ssl3001 is CRITICAL: Connection refused [18:56:58] Nemo_bis: maybe it's better to move the variable to InitialiseSettings though? [18:59:36] RECOVERY - HTTPS on ssl3001 is OK: OK - Certificate will expire on 01/20/2016 12:00. [18:59:37] RECOVERY - LVS HTTPS IPv6 on wikinews-lb.esams.wikimedia.org_ipv6 is OK: HTTP OK: HTTP/1.1 200 OK - 64427 bytes in 0.728 second response time [19:02:20] odder: nah [19:02:21] New patchset: Catrope; "Add wtp1005-wtp1024 to Parsoid deployment group" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63711 [19:02:38] I am about to run scap [19:02:51] andrewbogott: Could you look at https://gerrit.wikimedia.org/r/63711 real quick please? 1-line change [19:05:53] New patchset: Reedy; "Fixup mixed leading whitespace" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63712 [19:06:11] Change merged: Reedy; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63712 [19:11:05] New review: Andrew Bogott; "I hate regexp" [operations/puppet] (production); V: 2 C: 2; - https://gerrit.wikimedia.org/r/63711 [19:11:05] Change merged: Andrew Bogott; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63711 [19:11:35] !log bsitu Started syncing Wikimedia installation... : Update Echo to master [19:11:43] Logged the message, Master [19:16:38] PROBLEM - SSH on lvs6 is CRITICAL: Server answer: [19:17:38] RECOVERY - SSH on lvs6 is OK: SSH OK - OpenSSH_5.9p1 Debian-5ubuntu1.1 (protocol 2.0) [19:18:50] New patchset: Reedy; "Fixup a few more indenting differences" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63715 [19:19:00] New patchset: Asher; "prep for planned s4 master swap" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63716 [19:19:22] Change merged: Reedy; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63715 [19:19:30] sbernardin: ms-be11 is good..thx for waiting for me to get back [19:20:03] cmjohnson1: no problem ...which one is next? [19:20:24] not sure [19:20:48] Change merged: Asher; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63716 [19:21:45] !log bsitu Finished syncing Wikimedia installation... : Update Echo to master [19:21:53] Logged the message, Master [19:21:58] New patchset: Andrew Bogott; "Several minor changes to the openstack manifest:" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/51798 [19:23:28] New patchset: Diederik; "Add version info to umapi puppet template." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63717 [19:32:29] cmjohnson1: ms-be5 has a drive with no activity led going at all [19:33:14] cmjohnson1: ms-be10 has green activity led's on everything [19:33:32] sbernardin: thx...still trying to get ms-be11 right...disk is there but not shows as /sdo not /sdh [19:33:37] Change merged: Ottomata; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63717 [19:37:43] New patchset: Andrew Bogott; "Several minor changes to the openstack manifest:" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/51798 [19:46:22] !log Accepted salt keys for wtp1005-1024 on sockpuppet [19:46:31] Logged the message, Mr. Obvious [19:49:36] New patchset: Andrew Bogott; "Several minor changes to the openstack manifest:" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/51798 [19:50:28] RECOVERY - Disk space on ms-be11 is OK: DISK OK [19:51:12] !log aaron synchronized php-1.22wmf3/includes/filerepo/file/LocalFile.php 'd58dc596a25607deaecf0aa1c3be877f834da533 ' [19:51:20] Logged the message, Master [19:53:00] !log Running salt-call state.highstate on wtp1005-24 [19:53:08] Logged the message, Mr. Obvious [19:54:36] http://commons.wikimedia.org/wiki/Category:1877_books \o/ [19:54:44] so now I need to run sync-common on srv193 to test stuff on testwiki? [19:55:03] New patchset: Reedy; "$wgTranslateBlacklist of zh-* on metawiki" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63678 [19:55:40] MaxSem: yes [19:55:47] cool, thanks [19:59:08] PROBLEM - Puppet freshness on lvs1004 is CRITICAL: No successful Puppet run in the last 10 hours [19:59:08] PROBLEM - Puppet freshness on lvs1005 is CRITICAL: No successful Puppet run in the last 10 hours [19:59:08] PROBLEM - Puppet freshness on lvs1006 is CRITICAL: No successful Puppet run in the last 10 hours [20:00:06] aieee [20:00:15] maxsem@tin:/a/common/php-1.22wmf4$ git pull [20:00:16] error: insufficient permission for adding an object to repository database .git/objects [20:01:04] drwxr-xr-x 2 maxsem wikidev 4096 May 14 19:59 ba [20:01:27] You and kaldari by the looks of it [20:01:41] sbernardin: around? [20:03:07] Aaron|home: hah, that category has been annoying you for ages! [20:03:39] meh, of course - our .bashrc's were left on fenari:] [20:04:04] oops ;) [20:04:12] but, I suppose that makes sense [20:04:28] cmjohnson1: yup....what's up [20:04:53] okay we are going to get ms-be10 here in a few minutes [20:04:58] are you ready? [20:05:05] kaldari, can you fix your perms? [20:05:12] MaxSem: Not on fenari, on NFS [20:05:33] effectively, on fenari;) [20:05:46] MaxSem, greg-g: sure, how do I do that again? [20:05:46] just go ahead and mount nfs on tin ;) [20:05:57] find -user kaldari -exec chmod g+w {} \; [20:06:01] in wmf4 [20:06:19] do I need to set some sort of mask for the future? [20:06:20] and anywhere else you updated stuff [20:06:33] umask 22 in your ~/.bashrc [20:06:42] seems the default mask should make sense, if it doesn't, that should be an RT [20:07:02] I thought Tim fixed this issue ages ago [20:07:08] * greg-g shrugs [20:07:13] MaxSem: thanks [20:07:19] hopefully fixed now [20:07:26] mutante did fix it at one point [20:07:32] kaldari, thanks:) [20:07:57] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 20:07:54 UTC 2013 [20:08:03] * Aaron|home converts his mwscript and sync script autocompletion code to tin [20:08:07] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [20:09:07] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 20:09:05 UTC 2013 [20:09:07] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [20:09:47] Reedy: MaxSem : gerrit changes 34223 and 22111 and RT-804 [20:10:17] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 20:10:09 UTC 2013 [20:10:34] class generic::wikidev-umask [20:11:04] it is used on node fenari in site.pp, you may want it on the new host [20:11:07] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [20:11:17] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 20:11:08 UTC 2013 [20:12:07] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [20:12:30] mutante: Aha, that sounds like a good idea. Same for terbium if it's not got it already [20:12:47] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 20:12:45 UTC 2013 [20:12:47] so tin and terbium? [20:13:05] mhm, sync-common on srv193 doesn't pull tyhe newest code [20:13:07] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [20:13:18] or it does... [20:13:27] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 20:13:21 UTC 2013 [20:13:30] mutante: please! [20:13:39] note this FIXME in there "# FIXME: remove this once fenari became precise or there is a new deploy host "":) [20:14:07] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [20:14:47] RECOVERY - Puppet freshness on ms2 is OK: puppet ran at Tue May 14 20:14:44 UTC 2013 [20:15:07] PROBLEM - Puppet freshness on ms2 is CRITICAL: No successful Puppet run in the last 10 hours [20:17:28] Change merged: MaxSem; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63081 [20:17:29] mutante: https://bugzilla.wikimedia.org/show_bug.cgi?id=47204 is resolved then? [20:18:00] odder: i wasn't sure, i was just sure the other one to add the redirect was resolved [20:18:09] but this is about removing it somewhere [20:18:11] didnt do that [20:19:55] !log maxsem synchronized wmf-config/InitialiseSettings.php [20:20:03] Logged the message, Master [20:20:56] mutante: Liagent did that back in April already, so I closed the bug [20:21:21] !log maxsem synchronized docroot/bits/mobile/W.png [20:21:29] Logged the message, Master [20:21:35] New patchset: Asher; "new s4 master" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63727 [20:21:44] odder: great, thx [20:24:43] New patchset: Dzahn; "use generic::wikidev-umask on tin and terbium to ensure wikidev users have umask 0002 as used on fenari." [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63728 [20:25:49] Change merged: Asher; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63727 [20:26:15] New review: Dzahn; "feel free to move into role for deployment server if you want to. this is just like it was on fenari..." [operations/puppet] (production) C: 2; - https://gerrit.wikimedia.org/r/63728 [20:26:16] Change merged: Dzahn; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63728 [20:26:50] New patchset: MaxSem; "Fix group name fail" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63729 [20:27:24] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63729 [20:28:11] Reedy: MaxSem kaldari .. applying the umask thin on tin and terbium [20:28:16] (as used on fenari) [20:28:17] wee [20:28:55] thanks mutante [20:29:42] notice: /Stage[main]/Generic::Wikidev-umask/File[/etc/profile.d/umask-wikidev.sh]/ensure: [20:29:46] on tin [20:30:20] !log asher synchronized wmf-config/db-eqiad.php 'setting s4 to read-only for a sec' [20:30:22] that script just does ... if groups | grep -w -q wikidev; then umask 0002 else umask 0022 ... fi [20:30:28] Logged the message, Master [20:30:34] so after you relogin ,it should do it [20:30:40] as long as you are wikidev member [20:31:01] same on terbium now [20:31:10] !log asher synchronized wmf-config/db-eqiad.php 'setting s4 to writeable on new master' [20:31:17] Logged the message, Master [20:31:39] !log apply umask 0002 for wikidev users fix on tin and terbium, class generic::wikidev-umask [20:31:48] Logged the message, Master [20:32:06] unrelated issue on terbium, fyi: [20:32:09] err: /Stage[main]/Misc::Maintenance::Refreshlinks/File[/home/mwdeploy/refreshLinks]/ensure: change from absent to directory failed: Cannot create /home/mwdeploy/refreshLinks; parent directory /home/mwdeploy does not exist [20:32:20] !log maxsem synchronized wmf-config/InitialiseSettings.php [20:32:28] Logged the message, Master [20:34:46] sbernardin: please replace the disk slot 2 on ms-be10 that remember they start from 0 [20:35:07] @seen brion [20:35:15] hm, possibly wrong channel. [20:35:28] cmjohnson1: starts from 0 and goes down ...correct? [20:35:42] yep..there are numbers [20:38:24] mutante: what does that do? [20:39:06] !log maxsem synchronized php-1.22wmf3/extensions/Gadgets/ 'https://gerrit.wikimedia.org/r/#/c/60954/' [20:39:14] Logged the message, Master [20:40:20] PROBLEM - Disk space on ms-be9 is CRITICAL: DISK CRITICAL - /srv/swift-storage/sde1 is not accessible: Input/output error [20:40:28] MaxSem: you deploying right now? [20:40:53] not at the precise second [20:41:02] though in general yes I do [20:41:38] MaxSem: any chance you can sync this one for me? (https://gerrit.wikimedia.org/r/#/c/63596/) i can +2 it if so [20:41:43] 1 line change, adds a log group [20:42:22] sure [20:42:26] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63596 [20:42:29] Aaron|home: which one? umask thing? [20:42:48] MaxSem: thanks! [20:42:53] Misc::Maintenance::Refreshlinks/File [20:43:42] cmjohnson1: changed drive on ms-be10 [20:44:04] !log changing drives on ms-be10 [20:44:11] Logged the message, Master [20:44:23] sbernardin: you log before you make the change :-P [20:44:28] thx [20:44:39] !log maxsem synchronized wmf-config/InitialiseSettings.php 'https://gerrit.wikimedia.org/r/#/c/63596/' [20:44:47] Logged the message, Master [20:44:55] cmjohnson1: I know...just remembered when I came back down [20:44:58] ori-l, ^^ [20:45:10] MaxSem: <3 [20:45:46] cmjohnson1: so ms-be5 should be disk slot 1 [20:48:37] probably but we'll get to that in a few mins sbernardin [20:50:00] Aaron|home: command => "/usr/local/bin/mwscriptwikiset refreshLinks.php ${cluster}.dblist --dfn-only ... [20:50:06] bbl [20:51:47] !log s4 now all mariadb, new master info = MASTER_LOG_FILE='db1059-bin.000012', MASTER_LOG_POS=473448023 [20:51:55] Logged the message, Master [20:53:18] New patchset: Catrope; "No longer run and deploy Parsoid on the Parsoid Varnish machines" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63735 [20:59:28] sbernardin: let's get ms-be5....it is going to be slot 1 [20:59:35] log it [21:00:23] !log changing HD in ms-be5 [21:00:31] Logged the message, Master [21:02:40] RECOVERY - Parsoid on wtp1021 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.007 second response time [21:02:41] RECOVERY - Parsoid on wtp1014 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.008 second response time [21:02:41] RECOVERY - Parsoid on wtp1011 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.007 second response time [21:02:41] RECOVERY - Parsoid on wtp1012 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.008 second response time [21:03:00] RECOVERY - Parsoid on wtp1018 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.007 second response time [21:03:20] RECOVERY - Parsoid on wtp1008 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.007 second response time [21:03:20] RECOVERY - Parsoid on wtp1016 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.007 second response time [21:03:20] RECOVERY - Parsoid on wtp1020 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.007 second response time [21:03:20] RECOVERY - Parsoid on wtp1024 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.007 second response time [21:03:20] RECOVERY - Parsoid on wtp1013 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.007 second response time [21:03:28] cmjohnson1: new HD is in ms-be5 [21:03:30] RECOVERY - Parsoid on wtp1010 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.007 second response time [21:03:31] RECOVERY - Parsoid on wtp1007 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.011 second response time [21:03:31] RECOVERY - Parsoid on wtp1023 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.010 second response time [21:03:31] RECOVERY - Parsoid on wtp1009 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.007 second response time [21:03:31] RECOVERY - Parsoid on wtp1017 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.009 second response time [21:03:40] RECOVERY - Parsoid on wtp1022 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.008 second response time [21:03:40] RECOVERY - Parsoid on wtp1015 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.008 second response time [21:03:40] RECOVERY - Parsoid on wtp1005 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.010 second response time [21:06:37] New patchset: Catrope; "Remove apachebench on wtp1004, we don't need it any more" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63738 [21:11:20] RECOVERY - Parsoid on wtp1006 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.007 second response time [21:12:20] RECOVERY - Parsoid on wtp1019 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.007 second response time [21:14:21] hmm [21:14:22] from within function "GlobalUsage::deleteLinksFromPage". Database returned error "1290: The MySQL server is running with the --read-only option so it cannot execute this statement (10.64.16.27)". [21:14:25] on testwiki? [21:14:34] binasher: ---^^ [21:14:50] the bit above that was [21:14:50] A database query syntax error has occurred. This may indicate a bug in the software. The last attempted database query was: [21:14:51] (SQL query hidden) [21:16:47] hmm seems to have worked on a refresh [21:19:20] RECOVERY - Parsoid on wtp1004 is OK: HTTP OK: HTTP/1.1 200 OK - 1373 bytes in 0.002 second response time [21:25:29] New patchset: Odder; "(bug 48237) Remove redundant namespaces from testwiki" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63756 [21:29:50] !log Repooled wtp1004, added wtp1005-wtp1024 to the pool [21:29:56] Logged the message, Mr. Obvious [21:45:02] scapping [21:45:17] "Copying to tin from tin.eqiad.wmnet..." [21:45:21] brilliant [21:46:19] You never noticed it copying to fenari? [21:47:39] previously, it at least tried to hide its nature behind IPs [21:49:23] Ryan_Lane: When I add an SSH key at wikitech.wikimedia.org, how long does it take before that key lets me into bastion? [21:49:38] should be within 5 minutes [21:49:41] OK [21:49:43] Also [21:49:50] The display of private IPs on wikitech seems to be broken [21:49:55] yep [21:50:11] you mean stuff showing up in recent changes and such? [21:50:15] Which is why I was gonna log in and do host , and realized I had to reset my key [21:50:17] No, I mean https://wikitech.wikimedia.org/wiki/Nova_Resource:I-00000736 [21:50:39] There is no way to figure out the instance's IP from that page other than logging into bastion and doing a DNS lookup for its DNS name [21:50:47] ah. [21:50:48] hm [21:51:16] Oh neeever mind [21:51:18] It's still building [21:51:23] :D [21:51:27] The other one has finished building and now displays its IP [21:51:30] yeah. it'll update that info when it's done [21:51:39] Which is a bit weird because surely it's assigned earlier than that, but it's fine [21:51:49] RoanKattouw: it's not [21:52:30] it schedules, then networks, then builds [21:52:34] then goes active [21:53:11] !log maxsem Started syncing Wikimedia installation... : Weekly mobile deployment [21:53:14] I'm not totally sure when the mediawiki editing call is made, but I have a feeling it's when it goes active [21:53:19] Logged the message, Master [21:56:21] New patchset: Reedy; "Move squid config to realm specific config files" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63781 [21:57:00] New patchset: Reedy; "Move squid config to realm specific config files" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63781 [22:02:37] is anybody working on the database errors on image upload on test? [22:02:55] !log upgrading mediawiki to 1.22wmf4 on wikitech [22:03:02] Logged the message, Master [22:03:14] New patchset: Reedy; "Move squid config to realm specific config files" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63781 [22:03:23] https://gist.github.com/brion/5580000 <- here's the api error response i get [22:03:37] unfortunately all the error messages in the backtrace are cropped so you can't tell what's going on [22:03:46] and it only returned a generic 'database query error' message to the api [22:03:47] I think it's a dbreadonly [22:03:51] ah [22:03:52] fun [22:03:53] Thehelpfulone mentioned it somewhere [22:03:59] !log maxsem Finished syncing Wikimedia installation... : Weekly mobile deployment [22:04:00] can we….. fix that ? :) [22:04:06] Logged the message, Master [22:04:13] * Reedy hands brion a mallet and points towards commons [22:04:14] earlier in here Reedy [22:04:21] ~1 hour ago [22:04:32] well i can save edits on test… http://test.wikipedia.org/wiki/Foo just updated [22:04:33] brion: test2! [22:04:36] so it's not read-only as a whole [22:04:42] brion: I think the key is globalusage -> commons master [22:05:00] hrmfllll [22:05:15] Hang on.. Wasn't binasher prepping for a master switch earlier? [22:05:42] 20:52 binasher: s4 now all mariadb, new master info = MASTER_LOG_FILE='db1059-bin.000012', MASTER_LOG_POS=473448023 [22:05:45] uh oh [22:07:01] Hah, we've have nah wiki [22:07:26] binasher: [22:07:26] Tue May 14 22:01:18 UTC 2013 srv193 testwiki GlobalUsage::deleteLinksToFile 10.64.16.27 1290 The MySQL server is running with the --read-only option so it cannot execute this statement (10.64.16.27) DELETE FROM `globalimagelinks` WHERE gil_wiki = 'testwiki' AND gil_to = 'Superman.jpeg' [22:07:50] you found my error! [22:07:57] heh [22:07:59] and yes… fucker's read-only on that table. niiice [22:08:00] long live superman [22:08:23] looks like srv193 doesn't get deployed to from tin? [22:08:28] New patchset: Sanja pavlovic; "Per bug #48012. Compressed possible errors into for loop; program exits if there is one or more errors and writes all of them." [operations/dumps] (ariel) - https://gerrit.wikimedia.org/r/63782 [22:08:28] 10.64.16.27 = the old master [22:08:36] test.wp is not srv193 anymore [22:08:44] since when? [22:08:49] mutante: read what reedy pasted [22:09:01] it probably just needs to be added to mediawiki-installation [22:09:08] i don't know, but i noticed it a couple days ago, it points to wikipedia-lb [22:09:21] ah [22:09:31] it's always pointed to wikipedia-lb [22:09:32] !log Running sync-common on srv193 [22:09:42] Logged the message, Master [22:09:45] then wikipedia-lb goes to squid, and squid has an ACL which sends it to srv193 [22:09:54] TimStarling: oh, then ignore me, i thought it was directly srv193 in the past for some reason [22:10:21] brion: Try again? :D [22:10:43] trying now... [22:11:03] Reedy: still getting same err :( [22:11:12] I see srv193 in mediawiki-installation already [22:11:17] srv193 is in mediawiki-installation on fenari [22:11:39] and tin (noting it's now puppetised) [22:12:31] It's still trying for 10.64.16.27 [22:14:21] it's still the master in db-pmtpa.php [22:14:31] I guess binasher only changed it in db-eqiad.php [22:14:58] oh pmtpa [22:15:01] it's a silly place [22:15:05] ah, sorrry about that [22:16:00] are you fixing it? [22:16:56] !log finished upgrading wikitech [22:17:04] Logged the message, Master [22:17:10] i'm interviewing a candidate right now [22:17:27] ok, I'll fix it [22:19:40] thanx! [22:19:51] New patchset: Tim Starling; "Fix db-pmtpa.php, new s4 master" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63784 [22:20:16] oops, no hosts entry [22:21:19] New patchset: Tim Starling; "Fix db-pmtpa.php, new s4 master" [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63784 [22:21:57] New review: Tim Starling; "PS2: added hosts entry" [operations/mediawiki-config] (master); V: 2 C: 2; - https://gerrit.wikimedia.org/r/63784 [22:21:58] Change merged: Tim Starling; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63784 [22:23:08] !log tstarling synchronized wmf-config/db-pmtpa.php [22:23:15] Logged the message, Master [22:24:35] \o/ [22:24:37] thx TimStarling [22:27:46] New patchset: Andrew Bogott; "First pass at a labsconsole puppet setup" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/53989 [22:27:47] New patchset: Andrew Bogott; "Several minor changes to the openstack manifest:" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/51798 [22:29:06] New review: Andrew Bogott; "WIP" [operations/puppet] (production) C: -1; - https://gerrit.wikimedia.org/r/53989 [22:31:42] !log DNS update - removing owa servers [22:31:50] Logged the message, Master [22:37:30] !log authdns-update for a bunch of decom updates [22:37:37] Logged the message, RobH [22:46:14] hey, are we using this? [22:46:20] "mobile::vumi" [22:46:25] on server "zhen" [22:46:54] 'vumi' is fun to say out loud [22:47:03] that's my contribution to your inquiry, mutante [22:51:43] New patchset: Aaron Schulz; "Cleaned up profiler config." [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63793 [23:06:34] PROBLEM - LVS HTTP IPv4 on m.wikimedia.org is CRITICAL: HTTP CRITICAL: HTTP/1.1 200 OK - pattern not found - 21524 bytes in 0.002 second response time [23:06:46] mutante: i belief vumi is being used by the wikipedia zero team [23:07:13] i belief that zhen is the hot-backup node for the other vumi server (there are 2 in total) [23:07:26] yea we got info on it in -mobile [23:07:36] we have to inform partner (just one) to swap over [23:07:40] zero is different servers i think. [23:07:40] vumi is a service where people can send a text message to [23:07:45] but dunno [23:07:52] drdee: the other would be "silver" [23:07:55] thanks [23:07:56] rigght [23:08:23] i see there is a big spring cleanup going on :D [23:08:47] so "silver" is fine, it is in eqiad [23:08:53] all we want is move out of Tampa [23:15:32] New patchset: coren; "Tool Labs: restore tools.wmflabs.org" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63797 [23:15:35] * TimStarling reads http://blog.zorinaq.com/?e=74 [23:15:54] "Oh god, the NTFS code is a purple opium-fueled Victorian horror novel that uses global recursive locks and SEH for flow control." [23:15:56] TimStarling: yeah that was an interesting read [23:16:58] TimStarling: oh, and totally unrelated but this might be of interest to you: http://www.techempower.com/benchmarks/ [23:19:02] Change merged: coren; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63797 [23:19:56] New patchset: Demon; "Don't bother replicating to formey anymore" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63801 [23:22:22] New patchset: coren; "Tool Labs: Actually use puppet syntax" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63802 [23:23:13] !log maxsem synchronized php-1.22wmf3/resources/startup.js [23:23:20] Logged the message, Master [23:23:23] New review: coren; "*facepalm*" [operations/puppet] (production) C: 2; - https://gerrit.wikimedia.org/r/63802 [23:23:24] Change merged: coren; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/63802 [23:26:20] Change merged: jenkins-bot; [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63793 [23:26:27] binasher: can you give a quick glance at the sql patch in https://gerrit.wikimedia.org/r/#/c/61801/ ? [23:29:53] pgehres: sure [23:31:02] ugh, I forgot one minor change to the patch [23:31:08] "That's literally the explanation for PowerShell. Many of us wanted to improve cmd.exe, but couldn't." [23:31:15] I always wondered [23:31:16] should be ADD INDEX /*i*/aa_method (aa_method) [23:32:36] pgehres: does 0 = web login? [23:33:01] web or anything that is not explicitly defined [23:33:06] e.g. API [23:33:30] pgehres: will > 50% of rows have aa_method = 0? [23:33:53] maybe?, is there a better default? [23:34:13] and there will be how many values? [23:34:33] Login on mobile will be a lot less.. [23:34:39] currently only mobile, but there was interest expressed in API, OAuth, etc [23:35:04] New review: MZMcBride; "Whee." [operations/mediawiki-config] (master) - https://gerrit.wikimedia.org/r/63756 [23:35:10] sounds like cardinality will be too low for an index on aa_method to be of use [23:35:31] ah, interesting [23:35:40] it can always be added later [23:36:26] happy to remove it if you'd liek [23:36:28] New patchset: Ori.livneh; "Set common rsync and dsh parameters in mw-deployment-vars" [operations/puppet] (production) - https://gerrit.wikimedia.org/r/57890 [23:36:37] Change merged: Andrew Bogott; [operations/puppet] (production) - https://gerrit.wikimedia.org/r/51798 [23:37:34] !log aaron synchronized wmf-config/StartProfiler.php [23:37:42] Logged the message, Master [23:38:16] usefulness depends on how it will be queried though. it's not impossible that it will be useful, but that's more likely to be the case in the magical future [23:38:29] pgehres: i'd vote on removing for now though [23:38:45] k, commiting PS6 in just a sec [23:39:29] binasher: done [23:45:26] pgehres: if i was being pedantic, i'd say s/UNIQUE INDEX/PRIMARY KEY/ but meh. does this need to be applied to production before being merged? [23:45:33] hmm chrome's telling me that foundationwiki wants to use my computer's location - any idea why? [23:45:45] that's on https://wikimediafoundation.org/wiki/Special:SpecialPages [23:46:29] binasher: I will happily fix that too, technically it only needs to be applied before the next branch in 10 days, so as long as it is run today or tomorrow, should be fine [23:46:54] pgehres: tomorrow should be ok. this is on all wikis, right? [23:47:00] correct [23:47:08] Thehelpfulone: Location would probably suggest it's something to do with mobile [23:47:25] on the desktop version? that sounds like something's broken [23:47:46] They're the only people who have any use for location at the browser level [23:47:49] and they deployed earlier [23:48:04] MaxSem: awjr jdlrobson yurik ^ [23:48:08] thanks [23:48:32] looks like it only needs to be done as an osc on enwiki, and can be done as a regular alter statement against the master everywhere else [23:48:45] Noting I can also replicate it on chrome [23:48:55] Reedy: ? [23:48:59] wow, enwiki has 124k users! [23:49:08] awjr: [23:49:09] [00:45:33] hmm chrome's telling me that foundationwiki wants to use my computer's location - any idea why? [23:49:09] [00:45:45] that's on https://wikimediafoundation.org/wiki/Special:SpecialPages [23:49:09] lol [23:49:11] Thehelpfulone: seems like nearby code is running on Special:SpecialPages o_O [23:49:22] that is weird. [23:49:26] jdlrobson, on the *desktop* version of the site too.. [23:49:37] Thehelpfulone: Special:Nearby actually works on the desktop version of the site. [23:49:41] it should only run on https://wikimediafoundation.org/wiki/Special:Nearby [23:50:05] it works on that page jdlrobson too [23:50:26] binasher: updated to PRIMARY KEY in PS7 [23:50:40] jdlrobson: i just noticed that on Special:SpecialPages on my local instance, in mobile mode, I see the 'refresh' icon in the upper right corner [23:50:46] what's special about https://en.wikipedia.org/wiki/Special:SpecialPages ? [23:50:51] * jdlrobson greps code [23:50:54] some of the nearby code must not be totaly isolated [23:50:56] pgehres: thanks! i haven't reviewed the php but since others have, i will +2 after applying the changes [23:51:07] binasher: thank you very much! [23:51:09] jdlrobson: Nothing [23:51:13] I get it on enwiki too [23:51:31] https://en.wikipedia.org/wiki/Special:SpecialPages [23:51:58] * MaxSem prepares his revert wrench [23:52:11] Reedy: - so what's happening is for some reason mobile.nearby.scripts is being added to Special:SpecialPages AND Special:Nearby [23:52:16] it doesn't seem to effect other special pages.. [23:52:33] does Special:SpecialPages construct special pages? [23:53:07] this makes no sense.. [23:53:38] jdlrobson, it's a list of all the special pages [23:53:49] wait a minute.. that page is adding mobile styles as well [23:54:29] New review: Tim Starling; "There's no mw-deployment-vars.sh installed on scap client servers, so you can't use MW_RSYNC_ARGS in..." [operations/puppet] (production) C: -1; - https://gerrit.wikimedia.org/r/57890 [23:54:42] jdlrobson: Looks might to some extent [23:54:43] $pages = SpecialPageFactory::getUsablePages( $this->getUser() ); [23:54:47] why would a class extending UnlistedSpecialPage get constructed on Special:SpecialPages ? [23:54:51] foreach ( $pages as $page ) { [23:55:02] Reedy: where's that code.. ? [23:55:21] includes/specials/SpecialSpecialPages.php [23:55:38] @return Array( String => Specialpage ) [23:55:45] but the styles get added class UnlistedSpecialMobilePage extends UnlistedSpecialPage { [23:55:45] [23:55:45] public function __construct( $name, $restriction = '', $function = false, $file = 'default' ) { [23:55:45] parent::__construct( $name, $restriction, false, $function, $file ); [23:55:45] $this->clearPageMargins(); [23:55:46] $this->addModules( $name ); [23:55:53] lolololol [23:56:10] ouch that's a nasty side effect [23:56:31] fixing [23:56:36] well reverting today's deployment won't fix that problem in whole - this has been present for a few weeks [23:57:03] MaxSem: how you planning to fix? [23:57:51] MaxSem: ? [23:58:15] by moving that "magic" outta constructor [23:58:56] where to? [23:59:36] execute ? [23:59:43] yes