[00:01:49] anomie|awayish: So, lighting deploy? Shall I go first given that you're |awayish ? [00:01:55] PROBLEM - gitblit.wikimedia.org on antimony is CRITICAL: HTTP CRITICAL: HTTP/1.1 500 Server Error - 1703 bytes in 6.343 second response time [00:02:09] RoanKattouw: I'm just waiting for Jenkins to merge, I'll be done in a few minutes [00:02:17] OK you go first then [00:02:19] My prep will be slower [00:03:31] <^d> !log restarted gitblit service on antimony [00:03:36] !log anomie synchronized php-1.23wmf15/includes/htmlform/ 'Backport [[gerrit:117038]] to fix regression' [00:03:40] Logged the message, Master [00:03:47] Logged the message, Master [00:04:02] !log anomie synchronized php-1.23wmf16/includes/htmlform/ 'Backport [[gerrit:117038]] to fix regression' [00:04:09] RoanKattouw: Ok, I'm done [00:04:09] Logged the message, Master [00:04:15] Sweet [00:04:18] <^d> RoanKattouw: It's booting. [00:05:37] <^d> RoanKattouw: Back. [00:05:44] Cool [00:05:46] Thanks man [00:05:55] RECOVERY - gitblit.wikimedia.org on antimony is OK: HTTP OK: HTTP/1.1 200 OK - 179008 bytes in 7.271 second response time [00:11:39] !log catrope updated /a/common/php-1.23wmf16 to {{Gerrit|I706751cd2}}: Update VisualEditor to wmf16 branch for cherry-pick [00:11:48] Logged the message, Master [00:20:05] !log catrope updated /a/common/php-1.23wmf16 to {{Gerrit|Id675e756a}}: Update VisualEditor extension, for the right cherry-pick this time [00:20:14] Logged the message, Master [00:20:51] !log catrope synchronized php-1.23wmf16/extensions/VisualEditor/modules/ve-mw/dm/nodes/ve.dm.MWBlockImageNode.js 'Fix 2-pixel image bug' [00:20:59] Logged the message, Master [01:12:37] (03CR) 10Catrope: [C: 031] Remove old Tampa srv* and mw* apaches from dsh groups [operations/puppet] - 10https://gerrit.wikimedia.org/r/108070 (owner: 10Chad) [01:27:40] (03CR) 10Chad: Elasticsearch upgrade starting (031 comment) [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117095 (owner: 10Manybubbles) [01:47:04] !log s6 xtrabackup clone db1022 to db1010 [01:47:13] Logged the message, Master [02:11:40] (03PS2) 10Manybubbles: Elasticsearch upgrade starting [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117095 [02:11:55] (03CR) 10Manybubbles: "Good idea, ^d" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117095 (owner: 10Manybubbles) [02:25:35] PROBLEM - Host mw31 is DOWN: PING CRITICAL - Packet loss = 100% [02:26:16] !log LocalisationUpdate completed (1.23wmf15) at 2014-03-06 02:26:16+00:00 [02:26:24] Logged the message, Master [02:26:25] RECOVERY - Host mw31 is UP: PING OK - Packet loss = 0%, RTA = 35.92 ms [02:50:59] !log LocalisationUpdate completed (1.23wmf16) at 2014-03-06 02:50:59+00:00 [02:51:08] Logged the message, Master [03:05:58] (03CR) 10Ryan Lane: [C: 031] Adding git-fat support for trebuchet deployment [operations/puppet] - 10https://gerrit.wikimedia.org/r/112944 (owner: 10Ottomata) [03:20:43] !log db1034 testing /a ext4 noatime,barrier=0 [03:20:52] Logged the message, Master [03:29:47] !log springle synchronized wmf-config/db-eqiad.php 's1 depool db1049' [03:29:55] Logged the message, Master [03:30:35] !log s1 xtrabackup clone db1049 to db1034 [03:30:44] Logged the message, Master [03:33:45] (03CR) 10Dzahn: turn planet into a module (033 comments) [operations/puppet] - 10https://gerrit.wikimedia.org/r/108674 (owner: 10Dzahn) [03:34:41] !log LocalisationUpdate ResourceLoader cache refresh completed at 2014-03-06 03:34:41+00:00 [03:34:49] (03PS10) 10Dzahn: turn planet into a module [operations/puppet] - 10https://gerrit.wikimedia.org/r/108674 [03:34:51] Logged the message, Master [04:19:58] !log springle synchronized wmf-config/db-eqiad.php 's6 repool db1010 warm up' [04:20:05] Logged the message, Master [05:24:44] (03PS2) 10Nemo bis: [gdash] Add some yearly graphs [operations/puppet] - 10https://gerrit.wikimedia.org/r/117020 [05:59:38] (03CR) 10Ori.livneh: [C: 032] "Thank you dearly for the yearly graphs," [operations/puppet] - 10https://gerrit.wikimedia.org/r/117020 (owner: 10Nemo bis) [06:02:38] ori-l, gerrit poet [06:06:27] mutante|away: :) [06:08:38] ori: did you see anomie's mail? [06:08:45] (good morning) [06:44:05] (03CR) 10Dzahn: [C: 031] "tested one more time on fresh eqiad labs instance. as long as i comment the installation of the certificate, all fine, no warnings, create" [operations/puppet] - 10https://gerrit.wikimedia.org/r/108674 (owner: 10Dzahn) [06:45:10] (03CR) 10Dzahn: "paravoid, could you give this one more review given the comments above?" [operations/puppet] - 10https://gerrit.wikimedia.org/r/108070 (owner: 10Chad) [06:45:12] (03CR) 10Nemo bis: "wow, thanks! :)" [operations/puppet] - 10https://gerrit.wikimedia.org/r/117020 (owner: 10Nemo bis) [06:51:33] James_F|Away: fresh graph at the end of https://gdash.wikimedia.org/dashboards/ve/ , is that normal? [07:34:37] (03PS1) 10Springle: repool db1010 and db1034 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117166 [07:35:04] (03CR) 10Springle: [C: 032] repool db1010 and db1034 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117166 (owner: 10Springle) [07:35:11] (03Merged) 10jenkins-bot: repool db1010 and db1034 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117166 (owner: 10Springle) [07:44:25] !log springle synchronized wmf-config/db-eqiad.php 's6 db1010 full steam, s1 db1034 warm up' [07:44:34] Logged the message, Master [08:24:52] paravoid: no, i hadn't, but i just read it now. thanks for flagging it. i'll look into it right now. [09:04:29] (03PS2) 10Hashar: beta: vary deployment-bastion by ::site [operations/puppet] - 10https://gerrit.wikimedia.org/r/116982 [09:04:31] (03PS2) 10Hashar: beta: on eqiad mount /srv using labs_lvm [operations/puppet] - 10https://gerrit.wikimedia.org/r/116987 [09:37:19] !log springle synchronized wmf-config/db-eqiad.php 's1 db1034 full steam' [09:37:27] Logged the message, Master [09:40:24] (03CR) 10Hashar: "On the beta cluster, the call_geoip cookie cause Varnish to eat the central auth cookies preventing new users from login in. See bug 62244" [operations/puppet] - 10https://gerrit.wikimedia.org/r/113935 (owner: 10Ori.livneh) [09:41:14] <_joe_> /win 15 [09:41:18] <_joe_> ooops sorry [09:41:28] (03PS2) 10Faidon Liambotis: Revert "CentralNotice: set $wgCentralGeoScriptURL to false" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117086 (owner: 10Ori.livneh) [09:41:39] (03PS2) 10Faidon Liambotis: Disable cookie-based geolocation on text varnishes [operations/puppet] - 10https://gerrit.wikimedia.org/r/117004 (owner: 10Ori.livneh) [09:41:49] (03CR) 10Faidon Liambotis: [C: 032] Revert "CentralNotice: set $wgCentralGeoScriptURL to false" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117086 (owner: 10Ori.livneh) [09:41:57] (03Merged) 10jenkins-bot: Revert "CentralNotice: set $wgCentralGeoScriptURL to false" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117086 (owner: 10Ori.livneh) [09:42:52] !log faidon updated /a/common to {{Gerrit|Id235690c6}}: Revert "CentralNotice: set $wgCentralGeoScriptURL to false" [09:43:01] Logged the message, Master [09:43:41] !log faidon synchronized wmf-config/CommonSettings.php 'revert wgCentralGeoScriptURL to false' [09:43:48] Logged the message, Master [09:44:01] (03CR) 10Faidon Liambotis: [C: 032] Disable cookie-based geolocation on text varnishes [operations/puppet] - 10https://gerrit.wikimedia.org/r/117004 (owner: 10Ori.livneh) [10:00:18] (03CR) 10Hashar: "Geoip in cookie has been disabled with https://gerrit.wikimedia.org/r/#/c/117004/" [operations/puppet] - 10https://gerrit.wikimedia.org/r/113935 (owner: 10Ori.livneh) [10:22:50] who has the keys to change mailman passwords? [10:23:14] File a bug in bugzilla under Mailing Lists [10:23:45] yes, but then how do we get the password? [10:24:01] and what is the turn around time? [10:25:00] when I ran the mailing lists at RootsWeb we had a hack over the top so admins could do it themselves [10:26:26] sDrewth: email, depends if people are around [10:26:29] though I suppose with mail-jacking that is less feasible these days [10:26:51] okay, so don't request it on the weekend [10:35:17] sDrewth: i'll handle. the best way is rt request. mutante|away will probably handle [10:53:23] hi! I often get a 503 error trying to query wikidata [10:53:34] for example: curl http://www.wikidata.org/wiki/Special:EntityData/Q52.json [10:54:27] am I doing something wrong? is there a better way to get JSON representations of wikidata entries? [12:44:09] ema: IIRC it's a known bug for some entities, check bugzilla [12:56:09] Nemo_bis: right, bug #60003 [12:56:14] Nemo_bis: thanks [13:18:33] :) [13:55:25] PROBLEM - MySQL Slave Running on db1007 is CRITICAL: CRIT replication Slave_IO_Running: Yes Slave_SQL_Running: No Last_Error: Error Table ./metawiki/translate_messageindex is marked as crashed [13:59:25] RECOVERY - MySQL Slave Running on db1007 is OK: OK replication Slave_IO_Running: Yes Slave_SQL_Running: Yes Last_Error: [14:13:56] (03PS1) 10Andrew Bogott: Don't hard-code the labs status region to pmtpa [operations/puppet] - 10https://gerrit.wikimedia.org/r/117179 [14:15:46] (03CR) 10Andrew Bogott: [C: 032] Don't hard-code the labs status region to pmtpa [operations/puppet] - 10https://gerrit.wikimedia.org/r/117179 (owner: 10Andrew Bogott) [14:16:15] (03PS5) 10Aude: Setup test.wikidata as repo for test2 and test.wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/115366 [14:17:44] (03CR) 10Mark Bergsma: [C: 032] DNS setup for legalteamwiki [operations/dns] - 10https://gerrit.wikimedia.org/r/116220 (owner: 10Jalexander) [14:26:07] (03PS3) 10Mark Bergsma: Apache config for legalteamwiki [operations/apache-config] - 10https://gerrit.wikimedia.org/r/116219 (owner: 10Jalexander) [14:27:05] (03CR) 10Mark Bergsma: [C: 032] Apache config for legalteamwiki [operations/apache-config] - 10https://gerrit.wikimedia.org/r/116219 (owner: 10Jalexander) [14:31:37] (03CR) 10Hoo man: [C: 032] "Let's do this" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/115366 (owner: 10Aude) [14:32:33] (03Merged) 10jenkins-bot: Setup test.wikidata as repo for test2 and test.wikipedia [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/115366 (owner: 10Aude) [14:32:58] :) [14:34:52] !log hoo synchronized wmf-config/ 'Setup test.wikidata as repo for test2 and test.wikipedia {{Gerrit|I6f4c512}}' [14:35:00] Logged the message, Master [14:35:01] \o/ [14:35:34] !log hoo synchronized wikidataclient.dblist 'Setup test.wikidata as repo for test2 and test.wikipedia {{Gerrit|I6f4c512}}' [14:35:41] Logged the message, Master [14:35:52] aude: Ok... shall we bump the parser cache epoch now or not needed? [14:36:23] might not be necessary [14:36:40] ok :) People can purge, if needed [14:36:47] not that many items [14:37:05] we're done, then [14:37:43] all looks ok [14:38:01] except i need to make new test items on test wikidata [14:38:48] aude: Did you already verify? [14:39:17] i verify test2 does not use wikidata [14:39:54] wikipedia still uses wikidata :) [14:40:08] That was the first thing I tested :D [14:40:38] https://test2.wikipedia.org/wiki/Kitten [14:40:40] works [14:41:16] hooray :) [14:41:54] (03PS3) 10Ottomata: Adding git-fat support for trebuchet deployment [operations/puppet] - 10https://gerrit.wikimedia.org/r/112944 [14:41:58] (03PS6) 10TTO: Initial setup for legalteamwiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112850 [14:42:12] (03CR) 10Ottomata: [C: 032 V: 032] Adding git-fat support for trebuchet deployment [operations/puppet] - 10https://gerrit.wikimedia.org/r/112944 (owner: 10Ottomata) [14:45:53] (03CR) 10Reedy: [C: 032] Initial setup for legalteamwiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112850 (owner: 10TTO) [14:45:59] (03PS4) 10Jforrester: Enable VisualEditor for legalteamwiki by default [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112717 [14:46:04] (03Merged) 10jenkins-bot: Initial setup for legalteamwiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112850 (owner: 10TTO) [14:46:10] (03CR) 10Jforrester: [C: 031] Enable VisualEditor for legalteamwiki by default [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112717 (owner: 10Jforrester) [14:46:38] !log reedy synchronized database lists files: [14:46:46] Logged the message, Master [14:48:24] !log reedy rebuilt wikiversions.cdb and synchronized wikiversions files: add legalteamwiki [14:48:31] Logged the message, Master [14:48:51] !log reedy updated /a/common to {{Gerrit|I495e02449}}: Initial setup for legalteamwiki [14:48:56] (03PS1) 10Reedy: Add legalteamwiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117188 [14:49:00] Logged the message, Master [14:49:14] (03CR) 10Reedy: [C: 032] Add legalteamwiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117188 (owner: 10Reedy) [14:49:16] (03CR) 10jenkins-bot: [V: 04-1] Add legalteamwiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117188 (owner: 10Reedy) [14:49:27] stfu jenkins [14:49:55] (03CR) 10Reedy: [V: 032] Add legalteamwiki [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117188 (owner: 10Reedy) [14:49:59] that is most probably Gerrit yielding a 'host mismatch key' :( [14:50:42] !log reedy synchronized wmf-config/ 'legalteamwiki config and touch of IS' [14:50:49] Logged the message, Master [14:51:46] (03PS1) 10Mark Bergsma: Revert "Apache config for legalteamwiki" [operations/apache-config] - 10https://gerrit.wikimedia.org/r/117190 [14:52:30] (03CR) 10Mark Bergsma: [C: 032] Revert "Apache config for legalteamwiki" [operations/apache-config] - 10https://gerrit.wikimedia.org/r/117190 (owner: 10Mark Bergsma) [14:53:30] (03PS2) 10Hashar: beta: natfixup for eqiad public IP addresses [operations/puppet] - 10https://gerrit.wikimedia.org/r/116787 [14:55:32] (03CR) 10coren: [C: 032] "That works." [operations/puppet] - 10https://gerrit.wikimedia.org/r/116987 (owner: 10Hashar) [14:55:52] (03PS3) 10Hashar: beta: vary deployment-bastion by ::site [operations/puppet] - 10https://gerrit.wikimedia.org/r/116982 [14:57:11] (03PS3) 10Hashar: beta: on eqiad mount /srv using labs_lvm [operations/puppet] - 10https://gerrit.wikimedia.org/r/116987 [14:57:39] (03CR) 10coren: [C: 032] "Trivial enough." [operations/puppet] - 10https://gerrit.wikimedia.org/r/116787 (owner: 10Hashar) [14:57:48] !! [14:58:49] (03PS1) 10Mark Bergsma: Revert "Revert "Apache config for legalteamwiki"" [operations/apache-config] - 10https://gerrit.wikimedia.org/r/117191 [14:58:59] (03CR) 10Mark Bergsma: [C: 032] Revert "Revert "Apache config for legalteamwiki"" [operations/apache-config] - 10https://gerrit.wikimedia.org/r/117191 (owner: 10Mark Bergsma) [14:59:35] (03CR) 10coren: [C: 032] "Yeah $::site!" [operations/puppet] - 10https://gerrit.wikimedia.org/r/116982 (owner: 10Hashar) [15:00:33] (03CR) 10coren: [C: 032] "LVM > fixed partitions" [operations/puppet] - 10https://gerrit.wikimedia.org/r/116987 (owner: 10Hashar) [15:01:35] (03CR) 10coren: [C: 032] "Package add." [operations/puppet] - 10https://gerrit.wikimedia.org/r/112202 (owner: 10Tim Landscheidt) [15:02:39] (03PS1) 10Hashar: contint: clone some more repos for labs CI slaves [operations/puppet] - 10https://gerrit.wikimedia.org/r/117193 [15:03:45] (03CR) 10coren: [C: 032] "What's the worst that could happen?™" [operations/puppet] - 10https://gerrit.wikimedia.org/r/117193 (owner: 10Hashar) [15:03:54] lol [15:05:00] hashar: pass him over to me when he is done :) [15:05:05] (03CR) 10coren: [C: 04-1] "I see no point to this; puppet in pmtpa tools is disabled until it dies." [operations/puppet] - 10https://gerrit.wikimedia.org/r/115609 (owner: 10Tim Landscheidt) [15:09:03] hashar: jenkins is moving to labs? [15:09:22] the slaves [15:12:02] (03PS1) 10Hashar: labs_lvm: logoutput => on_failure for all exec calls [operations/puppet] - 10https://gerrit.wikimedia.org/r/117199 [15:13:27] aude: some yeah [15:13:37] yay! [15:13:39] aude: because there is a bunch of stuff I am not willing to package for production [15:13:58] would be nice to get rid of the wikidata jenkins and use wmf jenkins on labs [15:14:11] aude: there is no a few slaves in labs for my own use. Would announce that properly once labs is migrated to eqiad [15:14:21] but been nice to make jenkins work the way we need [15:14:21] potentially, we could use composer on them. [15:14:24] yep [15:14:31] for python script I am already using tox / venv / pip [15:14:36] * aude nods [15:14:37] and javascript has a bunch of npm jobs there as well [15:19:33] !log Jenkins: added phpunit/phpcs/kss on the labs slaves. Flagged them with label hasPhpUnit hasPhpcs [15:19:41] Logged the message, Master [15:21:59] (03PS1) 10coren: Labs: Minor fixes to labs_lvm::volume [operations/puppet] - 10https://gerrit.wikimedia.org/r/117200 [15:22:33] (03PS5) 10Reedy: Remove old Tampa srv* and mw* apaches from dsh groups [operations/puppet] - 10https://gerrit.wikimedia.org/r/108070 (owner: 10Chad) [15:22:45] (03CR) 10jenkins-bot: [V: 04-1] Remove old Tampa srv* and mw* apaches from dsh groups [operations/puppet] - 10https://gerrit.wikimedia.org/r/108070 (owner: 10Chad) [15:23:32] (03CR) 10coren: [C: 032] "Minor fixes are minor." [operations/puppet] - 10https://gerrit.wikimedia.org/r/117200 (owner: 10coren) [15:27:27] (03PS5) 10Jforrester: Enable VisualEditor for legalteamwiki by default [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112717 [15:27:39] (03CR) 10Reedy: [C: 032] Enable VisualEditor for legalteamwiki by default [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112717 (owner: 10Jforrester) [15:27:47] (03Merged) 10jenkins-bot: Enable VisualEditor for legalteamwiki by default [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/112717 (owner: 10Jforrester) [15:28:58] !log reedy synchronized database lists files: Ia651454e773e6f26f84c334b51fa64cb3dd44762 [15:29:05] Logged the message, Master [15:29:34] !log reedy synchronized wmf-config/InitialiseSettings.php 'touch' [15:29:43] Logged the message, Master [15:30:33] Reedy: Care to review https://gerrit.wikimedia.org/r/#/c/117154/ before I scap a broken l10n file again today? [15:30:45] (03PS1) 10coren: Labs: more minor fixes to labs_lvm [operations/puppet] - 10https://gerrit.wikimedia.org/r/117202 [15:31:17] (03CR) 10coren: [C: 032] Labs: more minor fixes to labs_lvm [operations/puppet] - 10https://gerrit.wikimedia.org/r/117202 (owner: 10coren) [15:33:18] (03Abandoned) 10Tim Landscheidt: Tools: Set group for $sysdir according to $::site [operations/puppet] - 10https://gerrit.wikimedia.org/r/115609 (owner: 10Tim Landscheidt) [15:50:08] (03CR) 10Alexandros Kosiaris: [C: 032] Fix ruby stability hash issue in recursor.conf [operations/puppet] - 10https://gerrit.wikimedia.org/r/116978 (owner: 10Alexandros Kosiaris) [15:51:54] (03PS1) 10Reedy: Don't check tampa apaches in apache-fast-test [operations/puppet] - 10https://gerrit.wikimedia.org/r/117204 [16:01:28] (03PS2) 10Hashar: Configuration for beta cluster caches in eqiad [operations/puppet] - 10https://gerrit.wikimedia.org/r/115629 [16:01:42] (03CR) 10Mark Bergsma: [C: 032] Don't check tampa apaches in apache-fast-test [operations/puppet] - 10https://gerrit.wikimedia.org/r/117204 (owner: 10Reedy) [16:02:09] bd808: deploying today? [16:02:25] aude: Yup [16:03:29] (03PS3) 10Hashar: Configuration for beta cluster caches in eqiad [operations/puppet] - 10https://gerrit.wikimedia.org/r/115629 [16:03:41] (03PS1) 10Reedy: Remove pmtpa apaches from site.pp,dhcpd [operations/puppet] - 10https://gerrit.wikimedia.org/r/117206 [16:03:47] (03CR) 10Hashar: "Updated deployment-apache01 instance IP address since it got deleted and recreated." [operations/puppet] - 10https://gerrit.wikimedia.org/r/115629 (owner: 10Hashar) [16:03:54] bd808: nothing of note for wikidata [16:04:13] aude: I merged the version bump patch from hoo [16:04:13] well, deploy to testwikidata [16:04:26] great [16:04:29] (03PS2) 10Dzahn: remove sockpuppet from puppet [operations/puppet] - 10https://gerrit.wikimedia.org/r/115527 [16:05:01] hoo: "deploy to testwikidata"? Is that just a normal group0 host? [16:05:15] it is [16:05:16] bd808: Yep, nothing to worry about ;) [16:05:29] :) [16:07:01] (03PS1) 10Alexandros Kosiaris: Stabilize the result of stdlib's keys function [operations/puppet] - 10https://gerrit.wikimedia.org/r/117207 [16:09:36] (03PS6) 10Reedy: Remove old Tampa srv* and mw* apaches from dsh groups [operations/puppet] - 10https://gerrit.wikimedia.org/r/108070 (owner: 10Chad) [16:10:48] (03CR) 10Mark Bergsma: [C: 032] Remove old Tampa srv* and mw* apaches from dsh groups [operations/puppet] - 10https://gerrit.wikimedia.org/r/108070 (owner: 10Chad) [16:12:14] (03PS3) 10Dzahn: remove sockpuppet from puppet [operations/puppet] - 10https://gerrit.wikimedia.org/r/115527 [16:12:27] * bd808 applauds mark for making scap twice as fast (in theory) [16:12:59] no [16:13:03] just half as busy ;) [16:13:43] Think of the poor overworked electrons! [16:16:42] (03PS7) 10Reedy: Remove db and job queue pmtpa files [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/116036 [16:16:47] (03CR) 10Reedy: [C: 032] Remove db and job queue pmtpa files [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/116036 (owner: 10Reedy) [16:17:01] (03Merged) 10jenkins-bot: Remove db and job queue pmtpa files [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/116036 (owner: 10Reedy) [16:17:35] (03CR) 10Dzahn: [C: 032] remove sockpuppet from puppet [operations/puppet] - 10https://gerrit.wikimedia.org/r/115527 (owner: 10Dzahn) [16:17:46] !log reedy synchronized docroot and w [16:17:56] Logged the message, Master [16:18:10] !log reedy synchronized wmf-config/ [16:18:18] Logged the message, Master [16:18:40] (03CR) 10Alexandros Kosiaris: [C: 032] Stabilize the result of stdlib's keys function [operations/puppet] - 10https://gerrit.wikimedia.org/r/117207 (owner: 10Alexandros Kosiaris) [16:20:39] akosiaris: hashes shouldn't be assumed to be stable though [16:20:54] (03PS1) 10Reedy: Remove pmtpa apaches [operations/dns] - 10https://gerrit.wikimedia.org/r/117210 [16:20:55] mark: depends on the ruby version [16:21:06] 1.9.3+ has stable hashes IIRC [16:21:07] oh of course, ruby [16:21:24] but yeah, safer to not assume stability of hashes anyway [16:21:42] Reedy: leave that part to the ops decommissioning process [16:21:43] but this constant change in the catalog was making my differ unhappy [16:21:49] it would only upset people anyway ;-) [16:22:11] there's a stringent server lifecycle document around that hehe [16:22:20] awww [16:22:34] Reedy: they'll want you to leave the mgmt in there for wiping [16:22:47] but remove the others.. oh well [16:23:03] Should I amend it? [16:23:28] sure, why not, we'll just use it later [16:24:32] touched the post commit hook and puppet-merge to remove sockpuppet [16:24:55] (03PS2) 10Reedy: Remove pmtpa apaches, leaving management entries [operations/dns] - 10https://gerrit.wikimedia.org/r/117210 [16:41:16] Common ruby :( [16:41:16] - result = hash.keys [16:41:16] + result = hash.keys.sort [16:43:12] hashar: That seems like a php-ism. Who besides a php trained dev expects a hash to preserve insertion order? [16:43:56] All though I guess java has LinkedHashSet and friends [16:47:28] <_joe_> bd808|deploy, python has collections.OrderedDict as well [16:47:54] * bd808|deploy nods [16:48:38] * _joe_ goes back to lurking  [16:50:28] :) [16:51:11] greg-g: we have a hhvm jenkins job running mw/core unit tests !!!!! [16:51:15] greg-g: thanks to ori :] [16:51:51] on our jenkins? sweet! [16:51:58] before it was just travis [16:52:01] that's great news [16:52:09] bd808|deploy: yeah that is why I hate PHP. It has a bunch of awesome features that are not matched by other languages. [16:52:22] greg-g: yeah ori got a package and figured out the command to run [16:52:31] bd808|deploy: holy dependency chain, batman: https://gerrit.wikimedia.org/r/#/c/117014/ [16:52:34] s/awesome features/constant overhead/ [16:52:52] greg-g: Yeah it got a little out of hand [16:52:55] :) [16:53:00] I got a bunch of PHP culprit on my laptop [16:53:03] need to do a pres [16:53:24] hashar: oh right, the package from fb, since the debian one is stalled (afaict) [16:53:33] the most hated of all is $myArray + $someArray [16:53:47] greg-g: I guess so [16:53:55] (03PS3) 10Alexandros Kosiaris: remove sockpuppet, decom [operations/dns] - 10https://gerrit.wikimedia.org/r/115688 (owner: 10Matanya) [16:54:27] good luck bd808|deploy on the deploy! [16:57:50] * bd808|deploy is prepping 1.23wmf17 on tin [17:00:15] PROBLEM - Host mw31 is DOWN: PING CRITICAL - Packet loss = 100% [17:01:15] RECOVERY - Host mw31 is UP: PING OK - Packet loss = 0%, RTA = 35.60 ms [17:03:35] PROBLEM - Apache HTTP on mw31 is CRITICAL: HTTP CRITICAL: HTTP/1.0 500 Internal Server Error - 50631 bytes in 0.151 second response time [17:05:26] I am off, see you tomorrow [17:09:51] (03PS1) 10BryanDavis: Add 1.23wmf17 symlinks [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117221 [17:09:53] (03PS1) 10BryanDavis: Wikipedias to 1.23wmf16 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117222 [17:09:55] (03PS1) 10BryanDavis: Group0 wikis to 1.23wmf17 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117223 [17:12:37] (03CR) 10BryanDavis: [C: 032] Add 1.23wmf17 symlinks [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117221 (owner: 10BryanDavis) [17:13:09] * bd808|deploy waits for gerrit [17:16:03] (03Merged) 10jenkins-bot: Add 1.23wmf17 symlinks [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117221 (owner: 10BryanDavis) [17:16:17] 4 minutes?! [17:16:34] waiting on zuul? [17:16:59] Yeah. It took 4 minutes to merge the symlink patch [17:17:06] <^d> gerrit said 3. [17:17:36] My irc timestamps say 4 but tomato tomato [17:17:43] I just want to know what hardware I can try to get approved so its never more than 15 seconds ;] [17:17:48] I was watching zuul when that happened [17:17:59] Looked like loads of stuff was just sitting queued and then suddenly stuff got run [17:18:38] even when its slow, im just happy i can actually watch them run now [17:18:47] recall that wasnt available for a long while =] [17:19:06] at least now i can see if there is a ton of crap in front of my tests [17:19:09] https://gerrit.wikimedia.org/r/#/c/104711/ - 13 minutes [17:20:16] !log sockpuppet - disabling puppet,disabling monitoring,remove from stored configs,revoke puppet cert,delete salt key [17:20:25] Logged the message, Master [17:20:34] Reedy: Killing sockpuppet.pmtpa.wmnet...done. [17:20:39] wooo [17:20:43] Key for minion sockpuppet.pmtpa.wmnet deleted. [17:20:57] another one bites the dust [17:21:10] so that one has a lot of important data on it [17:21:19] make sure it has its very own wipe ticket please =] [17:21:32] (they all should, but just sayin) [17:22:15] (03CR) 10Alexandros Kosiaris: [C: 032] "It is referenced, albeit through a fact." [operations/puppet] - 10https://gerrit.wikimedia.org/r/116953 (owner: 10Matanya) [17:22:41] I am really tired of looking at racks and racks accessories. it may be time to do my backlogged personal expense reports for a bit. (thats how tired of racks i am) [17:22:50] =P [17:23:03] we just need one more question answered here [17:23:06] "Ryan, is multimaster safe to turn on again?" [17:23:09] for salt [17:23:23] apergos: that was your question actually, just saw it [17:23:40] yes, it's my question [17:31:10] (03PS1) 10Ottomata: Adding submodule update support to git::clone [operations/puppet] - 10https://gerrit.wikimedia.org/r/117229 [17:32:08] (03CR) 10Ottomata: "Andrew, I am mostly doing this so that puppet::self instances can automatically have git submodules updated in their clones of operations/" [operations/puppet] - 10https://gerrit.wikimedia.org/r/117229 (owner: 10Ottomata) [17:44:32] (03CR) 10Ori.livneh: [C: 04-1] "There's a '--recurse-submodules' argument you can pass the initial git-clone operations. (This suggests that the name of the Puppet parame" [operations/puppet] - 10https://gerrit.wikimedia.org/r/117229 (owner: 10Ottomata) [17:48:56] greg-g: Ready for the scap and testwiki to 1.23wmf17? [17:50:03] you're a bit early :) but nothing else is going on. but, one sec... VE may need a backport... [17:50:45] greg-g: Oh, ok. my notes from Sam say the scap happens before the window. [17:50:53] oh right... [17:51:20] yeah, sorry, the testwiki to 17 part confused me, but if that's what normally happens, fine [17:56:08] greg-g: Imma gonna do it now then [17:56:17] kk [17:56:28] !log bd808 Started scap: testwiki to php-1.23wmf17 and rebuild l10n cache [17:56:36] Logged the message, Master [17:57:09] !log bd808 scap aborted: testwiki to php-1.23wmf17 and rebuild l10n cache (duration: 00m 41s) [17:57:22] Logged the message, Master [17:57:28] ... [17:57:29] Blerg. l10n cache patch not quite right [17:57:33] ah [17:57:43] though you no-op'd yesterday? [17:57:43] rm: cannot remove `/a/common/php-1.23wmf17/cache/l10n/l10n_cache-en.cdb': Permission denied [17:57:46] oh man [17:58:01] I think I can fix... [18:03:09] greg-g: I made a patch. Hopefully ori is looking at it [18:03:19] k [18:04:43] ori: Do you have a second to force that to update on tin? [18:04:53] It's not needed on the other hosts [18:05:56] ori: Nevermind. It pulled for me [18:06:35] !log bd808 Started scap: testwiki to php-1.23wmf17 and rebuild l10n cache (try #2) [18:06:44] Logged the message, Master [18:09:49] !log shut down sockpuppet - bye for good [18:09:58] Logged the message, Master [18:11:28] bd808|deploy: do you notice a difference without tampa ? [18:11:46] Not sure yet. Just getting to that part [18:11:54] ok:) [18:12:59] mutante: (ok: 1; fail: 0; left: 235). That's ~200 fewer hosts. We'll see if it's really faster or not [18:13:19] PROBLEM - Apache HTTP on mw40 is CRITICAL: HTTP CRITICAL: HTTP/1.0 500 Internal Server Error - 50631 bytes in 0.150 second response time [18:13:26] (03PS1) 10Jforrester: Enable VisualEditor by default on French Wikiveristy [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117238 [18:13:28] PROBLEM - Apache HTTP on srv270 is CRITICAL: HTTP CRITICAL: HTTP/1.0 500 Internal Server Error - 50631 bytes in 0.184 second response time [18:13:38] hrmm [18:13:46] what's up with those [18:14:08] PHP Warning: require(/usr/local/apache/common-local/php-1.23wmf15/../wmf-config/db.php): failed to open stream: No such file or directory in /usr/local/apache/common-local/wmf-config/CommonSettings.php on line 235 [18:14:22] (03CR) 10Jforrester: [C: 031] Enable VisualEditor by default on French Wikiveristy [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117238 (owner: 10Jforrester) [18:16:44] mutante: That error I pasted seems to be coming from srv270, mw31, mw40 mostly [18:17:05] scap-proxies:mw40.pmtpa.wmnet [18:17:22] scap-proxies:srv270.pmtpa.wmnet [18:17:31] mw31 i don't see anymore in dsh [18:17:40] but the other 2 have that in common, they are scap-proxies [18:17:46] what does that mean [18:18:03] * bd808|deploy looks around for Reedy [18:20:08] PROBLEM - Disk space on snapshot1004 is CRITICAL: DISK CRITICAL - free space: / 1055 MB (3% inode=68%): [18:20:18] PROBLEM - Disk space on terbium is CRITICAL: DISK CRITICAL - free space: / 1086 MB (3% inode=75%): [18:20:48] PROBLEM - Disk space on snapshot1003 is CRITICAL: DISK CRITICAL - free space: / 1055 MB (3% inode=69%): [18:21:36] Reedy: bd808|deploy ..so, there is no db.php file there, but instead [18:21:41] db-eqiad.php db-labs.php db-secondary.php [18:21:44] mutante: wmf15 is going to be gone in about 30 minutes... [18:21:58] so it's right about "no such file" [18:21:58] !log bd808 Finished scap: testwiki to php-1.23wmf17 and rebuild l10n cache (try #2) (duration: 15m 22s) [18:22:07] Logged the message, Master [18:22:08] PROBLEM - Disk space on snapshot1001 is CRITICAL: DISK CRITICAL - free space: / 1034 MB (3% inode=69%): [18:22:08] PROBLEM - Disk space on snapshot1002 is CRITICAL: DISK CRITICAL - free space: / 866 MB (3% inode=69%): [18:22:35] * bd808|deploy thinks that's a change Reedy synced this morning... [18:24:01] mutante: https://gerrit.wikimedia.org/r/#/c/116036/ [18:24:15] Sam deleted the db-pmtpa.php file [18:24:34] what it is looking for was just db.php [18:24:59] Sort of. Multiversion falls back to that when it can't see the realm specific file [18:26:23] What's up? [18:27:26] you broke it all Reedy :) [18:27:45] (03PS1) 10BryanDavis: Make an empty db-pmtpa.php file [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117239 [18:27:47] What's new? [18:27:51] Reedy: ^ [18:27:58] What's trying to read it? :| [18:28:19] dp.php from the scap fanout servers [18:28:30] srv270, mw31, mw40 [18:28:40] stop scapping to them then :D [18:28:43] We're not sure why mw31 is in there [18:28:51] too late :) [18:28:52] Reedy: it wanted db.php on hosts like mw31 [18:29:11] (03CR) 10jenkins-bot: [V: 04-1] Enable VisualEditor by default on French Wikiveristy [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117238 (owner: 10Jforrester) [18:29:17] and the ones marked as scap proxies have errors [18:29:49] Reedy: scap-proxies:srv270.pmtpa.wmnet scap-proxies:mw40.pmtpa.wmnet and they have 500 Internal Server Error since last scap [18:30:09] nothing should be targetting them [18:30:12] bar maybe scap stuff [18:30:18] (03CR) 10Alexandros Kosiaris: [C: 031] "Puppet catalog differ reports no change (apart from the comment). LGTM" [operations/puppet] - 10https://gerrit.wikimedia.org/r/109869 (owner: 10Matanya) [18:30:27] And ganglia/icinga [18:30:47] Reedy: still in dsh, just those 2 though, not mw31 [18:31:01] wasn't sure what it means that they are scap-proxies [18:31:09] solution: add role::appserver::dead_and_buried [18:31:10] It's part of the fanout [18:31:16] It syncs to N servers first [18:31:16] scap uses slave rsync servers [18:31:27] Then the rest of the servers uses those N to sync from, rather than just tin [18:31:33] so they should stay in there for now [18:31:38] even though they are tampa? [18:31:39] mutante: If I create a ticket... which Queue should I use? [18:31:42] Nope [18:31:47] about the terbium low disk thing [18:31:49] (03CR) 10Alexandros Kosiaris: [C: 032] varnish: puppet 3 compatibility fix: correct variable [operations/puppet] - 10https://gerrit.wikimedia.org/r/109869 (owner: 10Matanya) [18:31:49] We should just stop the syncs to them in the scap scripts [18:32:08] hoo: just mail ops-requests@rt.wikimedia.org [18:32:14] Reedy: works for me. Plus touch db.php on them [18:32:29] To keep icinga checks from whining [18:32:59] heh, yeah [18:33:18] Gotta go. Back in a bit... [18:33:26] mutante: DECOMISSION ALL THE APP SERVERS [18:33:26] :D [18:34:21] mutante: Do you wanna make the patch to pull them from the dsh group or should I and you can review/merge? [18:34:23] Reedy: but but.. you left them in ..wasnt on purpose? [18:34:59] bd808|deploy: now that you ask ...:) would be nice if you can, didnt even get to make coffee yet [18:35:15] * bd808|deploy starts a patch [18:37:09] thx [18:37:39] mutante: I suppose merging https://gerrit.wikimedia.org/r/#/c/115688 can be done, right ? sockpuppet is all done, may he RIP [18:39:17] akosiaris: yes, i shut it down [18:39:26] and the world didnt explode [18:39:57] and also removed where it was mentioned in puppet-merge [18:40:30] (03PS2) 10Jforrester: Enable VisualEditor by default on French Wikiversity [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117238 [18:40:48] mutante: ok merging then [18:40:55] (03CR) 10Alexandros Kosiaris: [C: 032] remove sockpuppet, decom [operations/dns] - 10https://gerrit.wikimedia.org/r/115688 (owner: 10Matanya) [18:42:46] bd808|deploy: unlikely you need me for any deployment stuff, but i am back in half hour or so [18:42:50] * aude needs to go home [18:42:58] * bd808|deploy waves to aude  [18:43:00] mutante akosiaris don't forget https://gerrit.wikimedia.org/r/#/c/116936/ [18:43:03] Go home and eat [18:43:06] if this is part of it [18:43:11] :) [18:43:16] If there's WD stuff, I'm also around [18:43:24] although I may also head of briefly [18:43:25] * aude tormented by collegues who got food [18:43:39] (03PS1) 10BryanDavis: Remove all pmtpa hosts from dsh scap-proxies group [operations/puppet] - 10https://gerrit.wikimedia.org/r/117244 [18:44:21] mutante: ^^ after your coffee [18:44:35] matanya: yes I am on it, thanks :-) [18:44:42] ori: i'm trying —recurse-submodules on git pull [18:45:07] i think it only works with fetch [18:45:17] modules(5)). That [18:45:17] might be necessary to get the data needed for merging submodule commits, a feature Git learned in 1.7.3. Notice that the result of a [18:45:17] merge will not be checked out in the submodule, "git submodule update" has to be called afterwards [18:45:51] but, i can just —recurse-submodules on clone [18:45:52] greg-g: Other than the scap-proxies from ptmpa whining, the scap went well. (and fast) [18:45:57] hoo@terbium:~$ du -h /var/log/wikidata/ [18:45:57] 1.3G /var/log/wikidata/ [18:45:59] and then update —init on pull [18:46:26] greg-g: 15m including a brand new branch l10n update [18:46:32] bd808|deploy: yay [18:46:36] whoa, included l10n [18:47:05] And I think I finally squashed the new branch l10n initialization problem [18:47:27] https://test.wikipedia.org/wiki/Special:Version on 1.23wmf17 and en l10n looks right [18:48:03] ori: speaking l10n, mind reviewing https://gerrit.wikimedia.org/r/#/c/116718/ ? would love to see trends there (if they exist :) ). [18:49:18] <^d> bd808|deploy: How much of a difference did removing tampa make a difference? [18:49:29] <^d> Redundant. [18:49:34] ^d: 20 minutes ish? [18:49:38] <^d> Heh [18:49:42] At least 10 [18:50:05] Full scap with new branch l10n took 15 minutes [18:50:27] so much about it being no difference [18:51:44] akosiaris: don't forget: https://gerrit.wikimedia.org/r/#/c/115323/ :p [18:56:19] RECOVERY - Disk space on terbium is OK: DISK OK [18:56:55] (03PS3) 10Hoo man: Simplify the AbuseFilter configuration a little [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/114656 [18:57:13] !log lvresize -L +40G terbium/root, resize2fs /dev/mapper/terbium-root. See RT #6984 [18:57:19] (03CR) 10BryanDavis: [C: 032] Simplify the AbuseFilter configuration a little [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/114656 (owner: 10Hoo man) [18:57:21] Logged the message, Master [19:02:47] greg-g: Does deploy go on normal schedule during a metrics meeting or do I wait? [19:03:45] bd808|deploy: go for it, opsen don't watch the metrics meeting ;) [19:03:57] (03PS2) 10Ottomata: Adding submodule update support to git::clone [operations/puppet] - 10https://gerrit.wikimedia.org/r/117229 [19:04:09] (03CR) 10Dzahn: [C: 032] Remove all pmtpa hosts from dsh scap-proxies group [operations/puppet] - 10https://gerrit.wikimedia.org/r/117244 (owner: 10BryanDavis) [19:04:31] (03PS3) 10Ottomata: Adding submodule update support to git::clone [operations/puppet] - 10https://gerrit.wikimedia.org/r/117229 [19:05:01] (03CR) 10Dzahn: [V: 032] "manual verify, looks like bd808 would need to be added to trusted users otherwise and it's just dsh files" [operations/puppet] - 10https://gerrit.wikimedia.org/r/117244 (owner: 10BryanDavis) [19:05:08] * bd808|deploy pokes zuul [19:05:09] (03CR) 10Ottomata: "Ok, done. --recurse-submodules only works on the fetch part of pull though, not on the merge. git submodule update still has to be calle" [operations/puppet] - 10https://gerrit.wikimedia.org/r/117229 (owner: 10Ottomata) [19:05:20] (03CR) 10Alexandros Kosiaris: [C: 04-1] sockpuppet: remove leftovers (031 comment) [operations/puppet] - 10https://gerrit.wikimedia.org/r/116936 (owner: 10Matanya) [19:05:37] bd808|deploy: done, see comment above [19:05:58] mutante: Cool. [19:06:07] it's a thing where jenkins doesn't do +2 , but no worries in that case it doesnt matter [19:06:21] we can fix that later though if you submit actual puppet changes [19:06:35] fixing akosiaris [19:06:37] Ah. I've submitted dozens of puppet changes [19:06:43] there's a file with a trusted user regex [19:07:05] hm, and jenkins verified them? then it's something else [19:07:12] maybe just super slow [19:07:24] but there were like 20 minutes [19:07:40] mutante: https://gerrit.wikimedia.org/r/#/q/owner:%22BryanDavis+%253Cbdavis%2540wikimedia.org%253E%22+project:operations/puppet,n,z [19:08:21] bd808|deploy: i see, so some of them it did and some humans have just overridden it [19:08:27] zuul seems to be all backed up too. [19:08:31] must be another issue then, ignore for the moment [19:09:09] mutante: Do you want to touch the missing file on those 3 hosts or should I? [19:09:41] bd808|deploy: but it wanted "db.php"? [19:09:47] as opposed to db-pmtpa.php [19:10:02] that's what was in the error you pasted at least [19:10:12] That's a red herring. db-pmtpa.php will fix it [19:10:22] ok, one sec [19:10:30] It's the magic of multiverison/het deploy [19:11:31] The php code asks for `getRealmSpecificFilename( "$wmfConfigDir/db.php" )` [19:11:44] !log touch /usr/local/apache/common-local/wmf-config/db-ptmpa.php on srv270,mw40,mw31 to fix Apache errors, details in change 117244 [19:11:52] Logged the message, Master [19:12:09] getRealmSpecificFilename looks for db-pmtpa.php first, if not found on disk it defaults to db.php [19:12:36] manybubbles: how busy are you with deploy prep? wanna chat about archiva with me for a sec? [19:12:44] (03PS4) 10Matanya: sockpuppet: remove leftovers [operations/puppet] - 10https://gerrit.wikimedia.org/r/116936 [19:12:46] * bd808|deploy didn't know all this stuff a month ago [19:12:48] ottomata: chatting would be great [19:12:53] noot busy but jumpy [19:12:57] aye k [19:12:59] bd808|deploy: makes sense [19:13:07] well, i've got stuff working in a local install, and i want to set up in labs now [19:13:11] but, i want to set up like we will have in production [19:13:16] waits for the RECOVERY though [19:13:17] and i'm not exactly sure how to set up the archiva repos/proxies [19:13:37] k [19:13:38] like, i know we want itt pretty locked down, so it won't be a transparent proxy [19:13:39] right? [19:13:41] The gate-and-submit queue on zuul has not moved in at least 15 minutes [19:13:41] but, also [19:13:45] there are a LOT of artifacts [19:13:51] so, what, do we just populate it ahead of time? [19:13:53] hmm, there are also 7 access requests [19:13:57] i have a populated repository on my VM [19:14:00] and it's Thursday,, just sayig [19:14:03] i've used it to build both camus and kraken [19:14:07] we could just start with that? [19:14:14] for now we'll just upload what we need I guess [19:14:15] and then anytime we need new artifacts we add them manually somehow? [19:14:29] pretty much [19:14:34] we can upload them, I imagine [19:14:41] yeah, we can [19:14:53] but there are multiple files that ship with artifacts [19:14:55] poms, shas, etc. [19:14:58] various things [19:15:08] dunno if uploading a .jar manually would get all those things [19:15:12] proxying just gets everything [19:15:20] greg-g: Who do I bug about zuul being shitty when hashar's not here? [19:15:45] ok then, seems this is priority, cya later [19:15:50] hoo|away: /dev/mapper/terbium-root 77G 34G 39G 47% / [19:15:50] * bd808|deploy can't deploy without zuul working  [19:16:06] what the [19:16:15] Krinkle|detached: but he's gone, too. [19:16:19] ^d: Do you have skills to fix zuul being stuck? [19:16:36] so proxying is ok I think if we're ok with just using the hash code [19:16:36] <^d> Lemme see. [19:16:38] sha [19:16:50] bd808|deploy: the yet to be hired Test Infra Eng :/ [19:16:58] hm [19:17:03] hm [19:17:09] greg-g: heh. I have a code review to do for that position [19:17:37] dunno, though, i think the point is to not let the internet deploy things willy nilly [19:17:51] i know we would specify a sha for what we want to deploy [19:17:52] via git deploy [19:18:02] but, those deployed .jars woudl be built using the artifacts in archiva [19:18:13] 13:51 logmsgbot: hoo synchronized wmf-config/InitialiseSettings.php 'I62b3288' [19:18:16] :-( [19:18:17] i think the idea is to freeze what we need to build our stuff [19:18:19] and then manually add things when we need to [19:18:35] rather than letting archiva just download whatever it thinks it needs [19:18:35] right? [19:18:37] we're not just deploying our things [19:18:55] we've got a few things we're going to want to pull from central and deploy without rebuilding [19:18:58] yes [19:19:13] but we'd manually add them to archiva, rather than transparently proxying, right? [19:19:28] (03Abandoned) 10BryanDavis: Make an empty db-pmtpa.php file [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117239 (owner: 10BryanDavis) [19:19:30] so if we want to add them then we'll just add everything [19:19:53] at worst we can write a script to suck a thing out central [19:19:56] 22:58 logmsgbot: ebernhardson synchronized php-1.23wmf15/extensions/Flow/ [19:20:05] we'd have to make sure the sha is right there too, but that would work [19:20:05] can people please write what exactly they are doing? [19:20:06] thanks [19:20:21] greg-g: ^ [19:20:28] (03CR) 10Dzahn: "19:11 mutante: touch /usr/local/apache/common-local/wmf-config/db-ptmpa.php on srv270,mw40,mw31 to fix Apache errors, details in change 11" [operations/puppet] - 10https://gerrit.wikimedia.org/r/117244 (owner: 10BryanDavis) [19:20:42] right, the sha is going to be correct, but, i think the idea is that we don't trust the maven networks [19:20:54] like, if someone got wind that we were transparently proxying and installing [19:20:58] and they had access to maven servers [19:21:11] then they could change a jar and a sha behind our backs [19:21:11] etc. [19:21:14] but if we freeze all deps [19:21:18] <^d> bd808|deploy: I don't think it's zuul, zuul's running fine. [19:21:20] but we have to freeze the sha [19:21:21] <^d> logs look sane. [19:21:22] that is the point [19:21:27] <^d> I think it's jenkins. [19:21:30] we declare the sha we need [19:21:38] that is only done via git-deploy [19:21:42] grr… I suppose that's more likely [19:21:43] for things we are deploying [19:21:49] but not for things we are building [19:21:53] when building our own jars [19:21:57] we'll point maven at our archiva [19:21:59] odder: SAL isn't the place for paragraphs, the commit message is. Go to gerrit and search for that commit hash (the 'I62b3288' part, without quotes) [19:22:00] and it will use whatever is there [19:22:07] so, the sha on our jar will be frozen [19:22:11] ^d: did I miss your announcement about enabling CirrusSearch for Wikiquote? [19:22:12] odder: but yeah, a sentence is usually good :) [19:22:15] but if we transparently proxy, when building a new jar, who knows what will be includd in it [19:22:25] greg-g: I didn't mean that commit, I meant [19:22:32] 22:58 logmsgbot: ebernhardson synchronized php-1.23wmf15/extensions/Flow/ [19:22:37] that's quite cryptic [19:22:41] greg-g: ^d says that it's jenkins rather than zuul. :/ [19:22:43] <^d> odder: I probably should've made an announcement, yes. [19:23:00] odder: that's an auto message [19:23:03] ^d: No worries, I'll add it to the Monday's issue of Tech News [19:23:24] 00:17 logmsgbot: ori synchronized php-1.23wmf16/extensions/CentralNotice 'Update CentralNotice to tip of wmf_deploy for I7d8259fc4' [19:23:24] <^d> Oh cool. Yeah, it's available as a Beta Feature for all wikiquotes now. [19:23:28] it's logging, sometimes cryptic/not made for non-inductees :) [19:23:42] greg-g: so it's possible to actually write what the synchro is about [19:23:49] https://integration.wikimedia.org/ci/ is brain dead as far as I can tell [19:23:51] bd808|deploy: just that the Apaches don't recover yet... [19:24:02] mutante|away: I see that... [19:24:22] odder: I see, yeah, ideally people would, but somtimes it is redudant or just "doing the flow update" I hear ya, but tell me how to phrase that without getting the deployers annoyed :) [19:24:56] even a hash would be fine, give you direction for more info if you need it [19:25:00] gives* [19:26:01] mutante|away: Needs "db-pmtpa.php"; got "db-ptmpa.php". Probably my fault in the comment [19:26:03] * greg-g nods [19:26:17] odder: so making sure the hash is included inthe output, no matter what [19:26:41] I thought we had something like that in place, or tried, but there's also the issue of security patches that are applied on production before they're made public in gerrit... [19:27:00] every rule has an exception... [19:27:00] bd808|deploy: duuh, blind copy/paste :P fixing [19:27:24] read "pmtpa" one too many times [19:27:50] * aude back [19:30:24] odder: bz email/username? [19:30:30] (03Merged) 10jenkins-bot: Simplify the AbuseFilter configuration a little [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/114656 (owner: 10Hoo man) [19:30:36] * greg-g is reporting bug to have it not be forgotten [19:32:01] !log ms-be1005 down for update [19:32:04] * bd808|deploy kicks jenkins again and curses the beta-update-databases job that's hogging all the runners [19:32:10] Logged the message, Master [19:32:34] we need more runners! [19:32:36] and some day we should also fix this [19:32:37] /etc/init.d/apache2: 55: [: nice: unexpected operator [19:32:39] sigh:p [19:32:40] so zuul is broken? [19:33:01] bd808|deploy: that also explains what i said about jenkins verify, so completely ignore [19:33:04] aude: zuul's fine, jenkins is not being very fair [19:33:09] :( [19:33:16] i also fixed permissions on those db files, mwdeploy:mwdeploy [19:33:28] PROBLEM - Host ms-be1005 is DOWN: PING CRITICAL - Packet loss = 100% [19:33:37] and now i'm really off [19:33:50] * odder waves mutante|away good-off [19:34:02] (03CR) 10Jforrester: [C: 031] Enable VisualEditor by default on French Wikiversity [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117238 (owner: 10Jforrester) [19:34:09] !google mutante's time zone [19:34:20] 24/7 odder [19:34:33] zulu, then? :-)) [19:36:21] !log bd808 synchronized wmf-config/abusefilter.php 'Simplify the AbuseFilter configuration Ia472af8' [19:36:23] wow, took almost an hour for jenkins to merge my patch [19:36:29] Logged the message, Master [19:36:34] 55 min [19:36:58] aude: 30 minutes for one tiny file in https://gerrit.wikimedia.org/r/#/c/114656/ [19:37:05] aude: Yeah, jenkins is very broken. :-( [19:37:10] :( [19:37:28] At least it /is/ passing things now. [19:37:40] (03PS2) 10BryanDavis: Wikipedias to 1.23wmf16 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117222 [19:37:41] * aude doesn't know quite enough to fix it [19:38:15] bd808|deploy: Having fun? :-( [19:38:27] aude: It's just got a job that spawns more jobs running right now that's hogging many of the runners [19:38:34] James_F: Super fun! [19:38:48] RECOVERY - Host ms-be1005 is UP: PING WARNING - Packet loss = 64%, RTA = 0.24 ms [19:39:43] (03PS1) 10Nemo bis: Add cron job to run characterEditStats.php on multilingual wikis weekly [operations/puppet] - 10https://gerrit.wikimedia.org/r/117250 [19:40:07] (03CR) 10jenkins-bot: [V: 04-1] Add cron job to run characterEditStats.php on multilingual wikis weekly [operations/puppet] - 10https://gerrit.wikimedia.org/r/117250 (owner: 10Nemo bis) [19:40:33] It's Thursday at 11:40, surely the site is down by now? [19:40:48] * rdwrer knocks on wood [19:40:55] * greg-g kicks rdwrer  [19:41:13] James_F: Tonight is The Night [19:41:45] James_F: The Night you add VE items to Tech News :-)) [19:41:50] (03PS3) 10Chad: Elasticsearch upgrade starting [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117095 (owner: 10Manybubbles) [19:41:52] (03PS1) 10Chad: Elasticsearch upgrade ending [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117251 [19:42:10] odder: It's not remotely "night" here – it's not even noon. :-) [19:42:18] odder: But yes. Lots to write about this time. [19:42:23] (03CR) 10BryanDavis: [C: 032] Wikipedias to 1.23wmf16 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117222 (owner: 10BryanDavis) [19:42:58] James_F: doesn't matter -- you can add it during your night :-) [19:43:11] (03Merged) 10jenkins-bot: Wikipedias to 1.23wmf16 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117222 (owner: 10BryanDavis) [19:43:26] :-P [19:43:37] (03CR) 10Manybubbles: [C: 04-1] Elasticsearch upgrade ending (031 comment) [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117251 (owner: 10Chad) [19:43:58] !log bd808 rebuilt wikiversions.cdb and synchronized wikiversions files: wikipedias to 1.23wmf16 [19:44:05] Logged the message, Master [19:45:33] (03CR) 10Chad: Elasticsearch upgrade ending (031 comment) [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117251 (owner: 10Chad) [19:46:03] (03PS2) 10Chad: Elasticsearch upgrade ending [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117251 [19:47:25] (03CR) 10Manybubbles: [C: 031] Elasticsearch upgrade ending [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117251 (owner: 10Chad) [19:47:55] !log Touched wmf-config/PoolCounterSettings-pmtpa.php and jobqueue-pmtpa.php as more follow up to I332c0e8 [19:48:04] Logged the message, Master [19:48:26] !log on mw31, mw40 and srv270 [19:48:34] Logged the message, Master [19:52:46] greg-g: All wikis on 1.23wmf16 LGTM. Still some dumb problems with the 3 pmtpa hosts that got synced with the change that dropped pmtpa multiversion config. [19:53:50] huh, neat [19:53:51] :) [19:54:09] (03PS2) 10BryanDavis: Group0 wikis to 1.23wmf17 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117223 [19:57:33] (03PS4) 10Ottomata: [WIP] Adding archiva module [operations/puppet] - 10https://gerrit.wikimedia.org/r/117024 [19:58:41] (03CR) 10BryanDavis: [C: 032] Group0 wikis to 1.23wmf17 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117223 (owner: 10BryanDavis) [20:00:59] (03CR) 10BryanDavis: "poke" [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117223 (owner: 10BryanDavis) [20:01:48] (03Merged) 10jenkins-bot: Group0 wikis to 1.23wmf17 [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117223 (owner: 10BryanDavis) [20:03:29] * bd808|deploy wonders what zuul and jenkins are doing this time. [20:03:51] !log bd808 rebuilt wikiversions.cdb and synchronized wikiversions files: group0 wikis to 1.23wmf17 [20:04:00] Logged the message, Master [20:06:33] (03PS2) 10Nemo bis: Add cron job to run characterEditStats.php on multilingual wikis weekly [operations/puppet] - 10https://gerrit.wikimedia.org/r/117250 [20:11:34] bd808|deploy: greg-g hoo doesn't need to be right now, but we like https://gerrit.wikimedia.org/r/#/c/117306/ deployed [20:11:49] yep [20:11:52] we missed putting that in our branch [20:11:56] whenever there's a spot, we can jump in [20:11:58] * aude just back from holiday today [20:12:21] I cherry picked one thing but forgot that one :P [20:13:22] otherwise, test wikidata looks fine as does wikidata stuff on test2 [20:14:00] aude: I'm done so if greg-g is cool with it you can do it now [20:14:13] greg-g: also; {{done}} [20:14:56] bd808|deploy: change your name :) [20:15:04] nick* [20:15:19] There's still some crap from srv270, mw40 & mw31 that I'm going to fix by putting back the files that got removed [20:15:22] (03CR) 10Nemo bis: "Is Translate available on fenari?" [operations/puppet] - 10https://gerrit.wikimedia.org/r/117250 (owner: 10Nemo bis) [20:15:40] aude: yeppers, you're all clear if you want to do it now [20:15:49] ok :) [20:15:53] aude: Will do it [20:16:04] am just verifying stuff again as I didn't CR the wikidata change [20:16:38] hoo: ok, go ahead [20:16:39] ok the commits match up :) [20:21:28] RECOVERY - Apache HTTP on srv270 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.234 second response time [20:21:38] RECOVERY - Apache HTTP on mw31 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.436 second response time [20:22:19] RECOVERY - Apache HTTP on mw40 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 808 bytes in 0.278 second response time [20:23:02] !log Restored pre Ic56177a versions of wmf-config/*pmtpa* config files to srv270, mw31 and mw40 [20:23:10] Logged the message, Master [20:23:20] mh, what's up with jenkins? [20:23:52] hoo: jenkins and zuul have been kinda slow today. Big job backlog apparently [20:24:01] mh [20:24:10] ah, we're done [20:28:17] !log hoo synchronized php-1.23wmf17/extensions/Wikidata/ 'Update Wikidata to fix an uncaught exception in claim html formatting' [20:28:26] Logged the message, Master [20:28:42] yay [20:29:00] got a couple of rsync errors from snapshot hosts [20:29:15] bd808: ^ [20:29:28] snapshot1, snapshot2, snapshot3, snapshot4 [20:29:48] Hmmm… those should be gone now I thought [20:31:37] !log Syncing to snapshot[1-4] failed [20:31:46] Logged the message, Master [20:33:26] hoo: I see them in the mediawiki-installation dsh group but I'd swear that aper.gos pulled them. I'll check the git history. They may have accidentally put back [20:33:50] bd808: ok... git rebase at times does [20:34:06] we had something like that with wikibase recently [20:34:48] PROBLEM - Disk space on snapshot1003 is CRITICAL: DISK CRITICAL - free space: / 1054 MB (3% inode=69%): [20:35:00] hoo: Yeah. SUL show me complaining about those hosts 3 weeks ago [20:35:22] And I remember that aper.gos removed them right after [20:35:46] I'll dig up the patch and make a new one that gets rid of them again. [20:35:54] But now… lunch [20:58:08] PROBLEM - Disk space on snapshot1001 is CRITICAL: DISK CRITICAL - free space: / 1033 MB (3% inode=69%): [21:02:54] greg-g: ping [21:03:25] hi [21:03:31] Language cache problems or smth? [21:03:33] greg-g: Nemo_bis found an l10n bug in wmf17 -- https://www.mediawiki.org/wiki/Special:Preferences#mw-prefsection-betafeatures [21:03:39] [MatmaRex] [21:04:09] [21:58] Did I add i18n for sp-contributions-newonly correctly? :/ [21:04:09] [21:59] It looks ok here http://deployment.wikimedia.beta.wmflabs.org/wiki/Special:Contributions/108.32.45.153 [21:04:10] [21:59] but broken here https://www.mediawiki.org/wiki/Special:Contributions/PiRSquared17 [21:04:21] greg-g: Requesting permission to scap again to see if that fixes it [21:04:23] [21:04:45] bd808: oooh, samesies. [21:05:01] this might be the same thing as huh's contributions issue. [21:05:17] huh, MatmaRex: yeah. Looks like my l10n "fix" for new branches isn't fixed yet :/ [21:05:19] (or ULS is broken, again) [21:10:07] !log bd808 Started scap: php-1.23wmf17 l10n cache rebuild [21:10:15] Logged the message, Master [21:12:26] bd808: good, don't let me block on that [21:13:08] Gah. all 366 wmf17 l10n files touched [21:13:10] thanks for working on it [21:13:11] !log all imagemagick-related packages updated to latest on image/video scalers [21:13:18] Logged the message, Master [21:13:34] Nemo_bis, huh: the real bug here is https://bugzilla.wikimedia.org/show_bug.cgi?id=51174 [21:13:37] (03PS1) 10Tim Landscheidt: ldaplist: Switch to new servicegroups structure [operations/puppet] - 10https://gerrit.wikimedia.org/r/117313 [21:13:44] Which I've been swatting at for several weeks now [21:14:03] But it's a "once a week" problem so it's hard to iron out [21:14:35] ok [21:14:41] I did capture the bad json files this time so maybe that will help narrow it down [21:19:48] (03PS1) 10Tim Landscheidt: ldaplist: Fix typo [operations/puppet] - 10https://gerrit.wikimedia.org/r/117314 [21:21:27] !log bd808 Finished scap: php-1.23wmf17 l10n cache rebuild (duration: 11m 20s) [21:21:36] Logged the message, Master [21:22:04] Nemo_bis, huh: Better now? [21:22:24] * bd808 sees the one Nemo_bis pointed out looks better [21:22:56] Yes. [21:22:58] Thanks. [21:23:06] !log "No space left on device" errors from snapshot1004.eqiad.wmnet during scap [21:23:13] Logged the message, Master [21:23:23] huh: You're welcome. Thanks for pointing it out [21:23:46] greg-g: second scap seems to have fixed l10n again. [21:30:28] 13:20 <+icinga-wm> PROBLEM - Disk space on snapshot1004 is CRITICAL: DISK CRITICAL - free space: / 1055 MB (3% inode=68%): [21:30:31] 13:20 <+icinga-wm> PROBLEM - Disk space on terbium is CRITICAL: DISK CRITICAL - free space: / 1086 MB (3% inode=75%): [21:30:34] :/ [21:30:38] from a while ago [21:30:49] greg-g: terbium already fixed by akosiaris_away [21:30:53] * greg-g nods [21:30:56] cool [21:31:13] LVM and free extents, so it didnt take long, yea, was cool [21:31:24] was in RT [21:31:33] * greg-g nods [21:32:05] reported by hoo #6984. but .. snapshot probably not yet [21:32:57] mutante|away: I fixed the errors from the pmtpa hosts by copying the 4 *-pmtpa.php config files that Sam had removed back to them. [21:33:44] The touch-a-file fix got past that error to the next and kept going... [21:34:05] But they seem happy now. Icinga alerts cleared [21:35:18] bd808: great, thanks for the update [21:35:37] * bd808 logged to SAL too [21:36:10] :) nice (but no guarantees about the Twitter bridge, heh) [21:36:20] i think it's still down [21:55:02] (03PS1) 10BryanDavis: Re-remove snapshot[1234] from mediawiki-installation dsh group [operations/puppet] - 10https://gerrit.wikimedia.org/r/117326 [21:59:26] (03CR) 10Hoo man: [C: 031] Re-remove snapshot[1234] from mediawiki-installation dsh group [operations/puppet] - 10https://gerrit.wikimedia.org/r/117326 (owner: 10BryanDavis) [21:59:48] bd808: :) I'd approve it, if I could [22:00:11] hoo: It will get handled. ;) [22:01:28] sure :) [22:09:12] greg-g: What up? [22:10:09] not much, you? [22:10:12] :) [22:10:29] I think you might be referring to my ping about who to talk to when Zuul is misbehaving [22:10:35] ...and antoine's not around. [22:11:50] anyone know what would cause the issue would be on a labs (non tools) instance where you can connect to the instance, but it attempts to create your home directory and failes 'Creating directory '/home/jamesur'. Unable to create and initialize directory '/home/jamesur' ' (then sends the welcome message and closes the connection) [22:12:19] I'm not sure why it's even attempting to create the home directory... I've been on the instance before (though not for a while) and the webservices on it are running fine... [22:12:40] * greg-g points jamesofur over to #wikimedia-labs [22:12:41] :P [22:12:50] yeah... it was actually supposed to be typed there [22:12:51] :P [22:12:56] nto sure why this room was open instead ;) [22:13:02] jamesofur: hi anyways [22:13:02] * hoo whispers nfs [22:13:12] though here I'll say 'hmm legalteam.wikimedia.org appears to be created but it now redirects to wmfWiki ...' ;) [22:13:39] <^d> Zuul is fine. [22:13:52] <^d> I'm pretty sure it's jenkins. [22:13:57] <^d> gallium's pretty bogged down [22:14:00] yeah [22:14:01] <^d> greg-g: ^ [22:14:15] Krinkle: just FYI at this point ^ [22:14:47] greg-g: OK, do things appear back to normal now? [22:14:58] <^d> I doubt it. [22:15:15] well, all the runners are bogged down with... what job was it? [22:15:23] that spawned a ton of sub-jobs that are eating up all the time [22:15:45] <^d> 21252 jenkins 20 0 6935m 2.6g 10m S 705 33.8 1274:35 java [22:15:57] <^d> Some job's holding it up [22:15:59] THe job was beta-database- [22:17:11] * bd808 waits and waits and waits for the jenkins home screen to load to see if it's still there [22:18:20] <^d> I'm going to kick jenkins. [22:18:30] <^d> I can't think of anything else to do and this thing is totally hung up. [22:18:50] <^d> Yay? Nay? Jfdi? [22:18:55] jfdi [22:19:24] And bring up 4 new jenkins master to split the job catalog across [22:19:40] And a lot more job runners [22:19:54] <^d> "The jenkins init script can only be run as root" [22:19:56] <^d> Fannnntastic [22:20:25] I think I had more job runners for the ~75 jobs I had at $DAYJOB-1 [22:20:31] :( [22:20:33] <^d> Oh duh [22:20:55] <^d> !log restarting jenkins on gallium. It's totally hung and nothing's getting done. Jobs will probably need retriggering. [22:21:03] Logged the message, Master [22:21:07] /usr/bin/nodejs /srv/ssd/jenkins-slave/workspace/parsoidsvc-parsertests-run-harder/parsoid/tests/../tests/mockAPI.js [22:21:18] ^ in case it had something to do with hanging on parsoid tests [22:21:22] could also be random though [22:21:30] last thing it was doing though [22:24:09] <^d> mutante: Could you peek at apache's access log on gallium? Just wanna make sure nothing insane was happening. [22:26:00] ^d: it's empty :p [22:26:17] ah, looking closer [22:28:59] now it would help to know how this usually looks, but nothing really obvious to me yet [22:29:15] <^d> Meh, it's probably nothing. [22:29:15] integration_access.log that is [22:29:22] <^d> I *think* it's slowly catching up. [22:29:26] <^d> But it's still a mess. [22:29:31] we also have quit_access.log , doc_access.log ... [22:31:47] <^d> Ok, there it goes. [22:31:47] <^d> Finally getting unclogged methinks. [22:34:05] looked at wrong logs. gallium very slow though [22:35:16] <^d> It's finally churning through the jobs. [22:35:22] <^d> cpu usage looks like its going down [22:35:25] 423M /var/log/jenkins/access.log [22:35:41] a bit much for vi [22:36:22] <^d> heh [22:37:05] nah [22:37:08] vi can take it [22:38:03] yoyoyo manybubbles, whatcha think of merging those thangs? [22:38:14] if i merged them now I could run an errand and be back in an hour [22:40:48] <^d> Ok, seems mostly fixed. [22:48:18] <^d> CPU only at about 15-20% usage now on gallium. [22:59:36] (03CR) 10Chad: [C: 031] "Let's go ahead with this one now so it'll get out in time." [operations/puppet] - 10https://gerrit.wikimedia.org/r/117096 (owner: 10Manybubbles) [22:59:52] <^d> ottomata: ^ [23:00:20] ok we ready? [23:00:22] for that? [23:00:26] even though manybubbles might not be around? [23:00:43] <^d> He's finishing dinner with his fam, should be back shorlt.y [23:00:43] ^d? [23:00:46] ok cool [23:00:46] <^d> That one's fine. [23:00:48] k [23:00:50] <^d> The other let's wait for him. [23:00:53] (03PS2) 10Manybubbles: Remove cirrus jobs from priority list [operations/puppet] - 10https://gerrit.wikimedia.org/r/117096 [23:01:00] (03CR) 10Ottomata: [C: 032 V: 032] Remove cirrus jobs from priority list [operations/puppet] - 10https://gerrit.wikimedia.org/r/117096 (owner: 10Manybubbles) [23:01:13] ok [23:02:45] <^d> All that does is start deprioritizing our jobs a bit. Which is fine since we're going to stop giving it jobs in ~58m [23:02:59] aye k [23:05:28] greg-g: See my e-mail; I can fix up the BF copy if needed (and rdwrer offered to deploy if we need it), but interested in your thoughts. [23:07:01] BF copy? [23:07:11] Nemo_bis: Beta Features. [23:07:16] I know that [23:07:24] * Nemo_bis imagines James_F cloning boyfriends [23:07:33] oh well, to bed [23:07:37] Nemo_bis: That'd be great; they could help build a better VE. :-) [23:07:43] Nemo_bis: It's incoherent non-English and needs fixing. :-) [23:18:47] James_F: yep [23:31:47] James_F: Re above, should I be looking at a patch or something? [23:34:15] greg-g, rdwrer: Writing now. [23:35:44] Kay [23:39:19] manybubbles: HIiiiiiii? [23:41:03] greg-g, rdwrer: Feature flag and language done in https://gerrit.wikimedia.org/r/#/c/117212/ and https://gerrit.wikimedia.org/r/#/c/117341/ [23:41:27] greg-g: I'm happy for this to out, however; should I also do the config change to enable the Feature Flag for all wikis? [23:42:41] <^d> ottomata: We do kind of need him :p [23:44:10] yeah! ha, i was hoping to merge this thang and get you guys settled, and then go run an errand and be back [23:44:11] hmm [23:45:19] <^d> Lemme review it really carefully. [23:45:19] (03PS1) 10Jforrester: Enable ULS 'compact language links' Beta Feature on all normal wikis [operations/mediawiki-config] - 10https://gerrit.wikimedia.org/r/117344 [23:45:24] greg-g: ^^^ done. [23:45:25] <^d> He said it was fine to merge. [23:45:45] greg-g: Also, can I abuse rdwrer's doing a deploy to also push the JS breakage fix for VE, if he's content? [23:46:52] (03CR) 10Chad: [C: 031] "Let's merge." [operations/puppet] - 10https://gerrit.wikimedia.org/r/117037 (owner: 10Manybubbles) [23:46:59] <^d> ottomata: Go ahead and merge that. [23:47:06] <^d> Then you're free. [23:47:23] ^d we are on lucene now? [23:47:27] (03PS2) 10Manybubbles: Update Elasticsearch monitoring for 1.0 [operations/puppet] - 10https://gerrit.wikimedia.org/r/117037 [23:47:31] (temporarily) [23:47:34] (03CR) 10Ottomata: [C: 032 V: 032] Update Elasticsearch monitoring for 1.0 [operations/puppet] - 10https://gerrit.wikimedia.org/r/117037 (owner: 10Manybubbles) [23:47:36] <^d> aude: Not yet, soon. [23:47:52] ok, ^d i should probably run puppet on es nodes, ja? [23:48:14] ok [23:48:16] James_F: yeah, that one should be done, too [23:49:11] <^d> ottomata: Nah, Nik or I can handle that as needed. Long as it's through on puppet master we're good. [23:49:33] greg-g: OK. rdwrer: You up for it? [23:50:23] oh ha, am running :) [23:50:26] shoudl be ok, i mean we merged it [23:50:28] and its just monitoring [23:50:41] ok cool, ^d, anything else right ? [23:50:42] now? [23:50:59] <^d> Nah, I think we're done for now. [23:51:00] <^d> Thanks! [23:51:16] back [23:52:02] merged both puppet patches? [23:52:08] i see [23:52:11] <^d> Yep. [23:52:13] perfect [23:53:23] rdwrer: So… config: https://gerrit.wikimedia.org/r/#/c/117344/ ULS: [to wmf17] https://gerrit.wikimedia.org/r/#/c/117345/ and https://gerrit.wikimedia.org/r/#/c/117346/ VE: [to wmf16] https://gerrit.wikimedia.org/r/#/c/117230/ [to wmf17] https://gerrit.wikimedia.org/r/#/c/117228/ [23:53:36] rdwrer: Don'cha love me? [23:55:18] <^d> manybubbles: Ok, I think we're all ready to go in 5m. [23:55:46] (03PS1) 10coren: Labs: Disable FSC on all NFS mounts [operations/puppet] - 10https://gerrit.wikimedia.org/r/117347 [23:56:08] James_F: Any chance you've done the submodules too? :P [23:57:02] ^d: I'm ready. We can start when greg-g gives us the conch. [23:57:51] manybubbles: go forth, you need config changes right now [23:57:52] ? [23:58:18] basically, should I stop mw deploys while you do certain parts? [23:58:31] manybubbles: i was goign to run an errand now, you ok with me doing that? [23:58:38] rdwrer: Sorry, I can't do that for you 'cos I don't have deploy rights, sorry. [23:58:38] i will be back within an hour, probably les [23:58:38] ottomata: cool [23:58:39] less [23:58:44] ok, cool [23:58:45] greg-g: I'll send a small sync [23:58:47] rdwrer: Maybe I should just get deploy access… [23:58:48] back laters [23:59:43] alright, the order: manybubbles does his small sync, then rdwrer does the VE bit, then jgonera and kaldari do their bit [23:59:57] James_F: So taht you can break the site yourself? Go ahead :D :)