[00:21:57] 10Cloud-VPS (Project-requests), 10Reading List Service, 10Reading-Infrastructure-Team-Backlog: Request creation of reading-lists VPS project - https://phabricator.wikimedia.org/T178110#3695567 (10Tgr) [00:27:23] 10Data-Services, 10cloud-services-team (FY2017-18), 10DBA, 10Goal: Migrate all users to new Wiki Replica cluster and decommission old hardware - https://phabricator.wikimedia.org/T142807#3695569 (10bd808) [00:33:34] 10Data-Services, 10cloud-services-team (FY2017-18), 10DBA, 10Goal: Migrate all users to new Wiki Replica cluster and decommission old hardware - https://phabricator.wikimedia.org/T142807#3695570 (10bd808) [00:39:11] bd808: There appears to be a disk problem on my instance. [00:39:26] what's the problem? [00:39:29] cyberbot-exec-iabot-01 should have 120GB of available disk space [00:39:50] When I ran GZIp on my DB dump, I got a "Out of disk space" error [00:40:11] But when I ran du on the entire VM, I see only 14.6GB of actual data. [00:40:26] Am I missing something [00:40:34] Cyberpower678: do you have the role applied that allocates the extra disk? Are you gzipping to the /srv partition where that disk is mounted? [00:40:55] No clue [00:41:03] I was gzipping to my home directory [00:41:21] https://wikitech.wikimedia.org/wiki/Help:Adding_Disk_Space [00:41:43] your $HOME is alwasy going to be on the default system disk [00:42:00] I see. [00:42:11] to get the quota allowed space beyond the system disk you need to use that Puppet role [00:44:10] bd808: I'm adding the roles. [00:44:36] bd808: mutante has helped a lot in understanding how puppet works. :D [00:44:49] he's a pretty awesome guy :) [00:46:41] :-) [00:49:03] 10Data-Services, 10cloud-services-team (FY2017-18), 10DBA, 10Goal: Migrate all users to new Wiki Replica cluster and decommission old hardware - https://phabricator.wikimedia.org/T142807#3695588 (10bd808) [00:49:04] 10Data-Services, 10cloud-services-team (Kanban), 10DBA, 10User-bd808: Create and announce timeline for shutting down labsdb100[13] - https://phabricator.wikimedia.org/T175086#3695586 (10bd808) 05Open>03Resolved Announced at https://lists.wikimedia.org/pipermail/cloud-announce/2017-October/000005.html [00:50:19] bd808: I added the role and ran puppet agent, but no new volume [00:50:28] At least none the size of 120 GB [00:50:38] what's the instance name: [00:50:40] ? [00:50:57] cyberbot-exec-iabot-01 [00:51:33] I only see a lost+found [00:51:40] /dev/mapper/vd-second--local--disk 140G 60M 132G 1% /srv [00:52:22] Oh oops. I queried with the wrong command. [00:52:26] So /srv is a 140G partition with 132G of usable space [00:52:52] is the partition call "lost+found"? [00:53:16] no, that is a unix directory that is used to track damaged files [00:53:24] the partition is /srv [00:53:29] Oh [00:53:37] Oh cool. [00:54:04] there is a /lost+found directory too. It's normal stuff [00:55:06] Okay [00:56:46] (03PS165) 10Ricordisamoa: Initial commit [labs/tools/wikidata-slicer] - 10https://gerrit.wikimedia.org/r/241296 [00:58:37] * Cyberpower678 tries gzip again [01:01:38] (03CR) 10Ricordisamoa: [C: 04-2] "PS165 uses the 'hidden' class to hide the alert div on the save page" [labs/tools/wikidata-slicer] - 10https://gerrit.wikimedia.org/r/241296 (owner: 10Ricordisamoa) [01:04:45] Cyberpower678: basically any root directory of an ext4 filesystem, be default, has a lost+found directory [01:05:13] zhuyifei1999_: so how much space do the VMs have without SRV [01:05:18] fsck can put files in it when it files lone inodes [01:05:35] by default / has around 20 gigabytes [01:05:38] when it what? [01:05:58] all other disk space are unallocated [01:06:08] https://wikitech.wikimedia.org/wiki/Help:Adding_Disk_Space [01:06:33] zhuyifei1999_: I saw that but "fsck can put files in it when it files lone inodes" [01:06:43] What's files lone inodes? [01:06:52] you can create new partitions on it and mount it (which that srv puppet role does automatically) or extend / [01:06:54] uh [01:07:07] basically files are three parts [01:07:11] 1. the data [01:07:55] 2. the inode, marking where the data are stored, the attributes (eg owner, access time, etc.) [01:08:03] 3. the directory listing [01:08:37] the file name is not saved in the inode, but in the directory listing [01:08:45] this is why you can have hard links [01:09:04] which are basically two directory listings pointing to the same inode [01:09:55] zhuyifei1999_: so how do I extend /? [01:10:03] Instead of mount srv? [01:10:07] if there is a file (an inode), but there is no directory listing pointing to the inode, no one knows what the inode's filename is [01:11:02] and the file will not be easily accessible. fsck can find such inodes and put it under lost+found directory of the corresponding filesystem [01:11:15] https://wikitech.wikimedia.org/wiki/Help:Resize_root_partition [01:11:21] ^ to extend / [01:11:45] but beware it may not be well-supported [01:12:19] there is no puppet role to do it automatically [01:12:36] uhhhh [01:12:43] I think I'll using symlinks [01:13:41] huh? [01:15:03] zhuyifei1999_: in regards to my new DB instance [01:15:13] I need it for the disk space and processor [01:16:12] I mean idk what you mean by using symlinks. do you mean moving the mariadb storage into /srv and symlink it? [01:16:23] But I would take it all of the DB stuff will safe to the root disk and not the srv mount [01:16:31] Yea [01:16:43] *save [01:16:52] yeah. that's the best thing to do [01:16:58] https://github.com/wikimedia/puppet/blob/production/modules/quarry/manifests/database.pp#L6 [01:17:12] oh nice [01:17:29] zhuyifei1999_: mutante setup my own roles, which gave me insights on how to write them in the first place. :D [01:18:09] k [01:18:10] zhuyifei1999_: Oh nice. Great idea. Thanks for the advice. [01:18:13] np [01:18:34] idk how well that 'mysql' module works with mariadb though [01:18:49] https://phabricator.wikimedia.org/T165625 [01:33:35] Cyberpower678: so, what path do you want to use as data dir? [01:33:44] /srv/mysql/data sounds ok>? [01:33:53] Yes please [01:34:23] and you want it to listen on all interfaces, right [01:34:27] Yes [01:38:42] Cyberpower678: what is the current data dir , do you know? [01:38:57] No I do not. We just installed it. [01:39:34] Documentation says it's /var/lib/mysql [01:39:50] ok, and it's owned by mysql:mysql , right [01:41:37] change coming [01:43:34] unfortunately we dont see that here [01:44:30] mutante: yes they all belong to mysql [01:44:40] Cyberpower678: https://gerrit.wikimedia.org/r/#/c/385115/2/modules/profile/manifests/cyberbot/db.pp [01:46:03] So how do these changes get applied to the already set up instance? [01:46:13] Reboot, reinstall, etc? [01:46:32] puppet agent [01:46:39] That's it? [01:46:45] yeah [01:46:50] Cool [01:47:08] the agent will apply all the changes [01:47:35] Cyberpower678: and now .. you can run it [01:48:12] bd808: since everyone has to update their URIs now anyway, how hard would it be to set up aliases like enwiki.web.db.toolforge? [01:48:21] Cyberpower678: it should create /srv/mysql/data first [01:48:31] enwiki.web.db.svc.eqiad.wmflabs is super cryptic [01:49:17] mutante: running puppet [01:49:27] Toolforge isn’t right. These dbs are for all of Cloud [01:50:39] are you going to even remove "labs" from DNS at some point in the future? [01:51:06] We talked about switching wmflabs for wmcloud but chase wasn’t ready to commit to a zone naming scheme [01:51:14] gotcha [01:52:39] We are waiting for the wildcard letsencrypt certs to introduce the Toolforge.org and wmcloud.org TLDs [01:53:31] oh, you bought those? aha [01:54:08] that makes sense for external domains [01:54:21] but the internals don't use TLS anyway, do they? [01:54:25] puppet seems to have hung up, mut [01:54:28] mutante: [01:55:16] bd808: ^ [01:55:31] Cyberpower678: what do you mean by "hung up" [01:55:48] i have never seen that happen ..like ever [01:55:49] It's not outputting a thing, but it's clearly running [01:55:54] either an error or finishing [01:56:03] Tgr: we want a sane service naming scheme. There are other things we will be using the .svc.eqiad.wmflabs domain for. [01:56:49] And there will “soon” be more zones that need differentiation [01:56:59] Cyberpower678: remind me of the instance name please [01:57:01] mutante: It's not doing anything [01:57:07] cyberbot-db-01 [01:57:17] Soon being relative to the heat death of the universe ;) [01:58:06] mutante: can I kill it? [01:58:09] Cyberpower678: cant confirm, it's running [01:58:12] and applying all the changes [01:58:18] works as expected :) [01:58:23] it just created your data dir [01:58:29] and changed your mysql config [01:58:42] Cyberpower678: no, don't kill it [01:58:43] My CLI is still blocked [01:58:53] please paste your commandline [01:58:54] I totally understand both of the why-not questions though. They are things I asked myself [01:59:20] Cyberpower678: if you were already running puppet, i wouldnt have been to able to do it :) [01:59:48] killed it for you, heh [02:00:11] Confirmed it's killed [02:00:22] Cyberpower678: see how you have /srv/mysql/data now? [02:00:44] Yes. :-) [02:00:54] Cyberpower678: and the datadir in /etc/mysql/my.cnf is set to that [02:01:05] nothing was manual [02:01:39] well, i think we have to restart it [02:01:42] and that fails :) [02:01:51] looks why [02:05:54] Cyberpower678: sigh, that existing mysql server class has an issue .. i am looking and cleaned up everyhting.. hold on [02:06:13] Depends: mariadb-server-5.5 (= 5.5.34+maria-1~precise) [02:06:15] precise :p [02:06:27] ... [02:06:36] That's ancient [02:06:53] It would probably explain ERROR 2003 (HY000): Can't connect to MySQL server on '10.68.18.167' (111) [02:07:03] Cyberpower678: that's because i removed it [02:07:07] give me a minute [02:07:10] Okay [02:10:58] damn, it's because that class installs APT sources for mariadb on precise [02:11:11] which of course conflict badly on stretch [02:11:55] maybe we should just copy the relevant parts and not use that mysql class [02:13:27] mutante: I don't get it. Why is the latest version of MariaDB relying on Precise? [02:14:01] Cyberpower678: because mysql::server::package says so [02:14:19] 6 if $package_name =~ /mariadb/ { [02:14:19] 7 apt::repository { 'wikimedia-mariadb': [02:14:19] 8 uri => 'http://apt.wikimedia.org/wikimedia', [02:14:19] 9 dist => 'precise-wikimedia', [02:14:42] hardcoded precise in there [02:14:55] that wont work anywhere nowadays [02:15:09] basically it means that the class cant be used [02:15:10] No kidding [02:15:21] or we start making that flexible of course [02:15:55] there is mariadb in normal debian repos now, we don't need to use our own [02:16:21] and then we get version 10.1 instead of 5.x [02:17:43] Cyberpower678: class mysql isn't used by anything then :) [02:17:48] in prod we use module mariadb [02:18:26] I se [02:18:28] *see [02:18:35] thinks about the easy fix for you [02:19:10] bd808: Any chance we can drop the disk limit on cyberbot-exec-iabot-01 from 120GB to maybe 40GB, and divert that to cyberbot-db-01 [02:21:59] Cyberpower678: not really, no. It would require making custom ‘flavors’ for both and the building new vms from scratch. [02:22:15] Oh [02:23:46] * bd808 is on vacation, drinking beer, and watching his Cubs. Responses may stop at any time. [02:24:54] Cyberpower678: https://gerrit.wikimedia.org/r/385119 [02:25:10] bd808: enjoy the beer. [02:26:59] mutante: awesome. I'm getting the DB dump onto that node. [02:27:22] Cyberpower678: it's not merged:) [02:27:43] Yea, but the upload will take another 30 minutes to finish [02:28:19] ;-) [02:30:51] i will change it to make it more fool-proof that it cant touch anything else [02:34:17] meh, quarry is the only thing using that to install mariadb [02:35:02] i wonder how it can work with precise sources [02:44:41] Hmmm... [02:44:55] The dump is uploaded and ready to go. [02:46:15] mutante: ^ [02:46:59] (03PS166) 10Ricordisamoa: Initial commit [labs/tools/wikidata-slicer] - 10https://gerrit.wikimedia.org/r/241296 [02:52:19] Cyberpower678: i wanna know what quarry runs on now .. [02:52:29] i really dont want to touch that in any way [02:52:39] it's the only thing using this [02:52:55] (03CR) 10Ricordisamoa: [C: 04-2] "PS166 changes save step to avoid page reload" [labs/tools/wikidata-slicer] - 10https://gerrit.wikimedia.org/r/241296 (owner: 10Ricordisamoa) [02:53:22] If it's running precise, then I'll break out Kali Linux. XD [02:53:35] o.O? [02:54:11] zhuyifei1999_: What? [02:54:34] mutante: what's the command to check mysql version? [02:54:53] Cyberpower678: how do you break precise with kali? [02:55:04] * zhuyifei1999_ is unaware of that [02:55:24] zhuyifei1999_: I was joking, since Kali Linux is primarily used for hacking. [02:55:38] zhuyifei1999_: mysql -V ? what i would really like to know is "lsb_release -a" though whereever quarry runs [02:55:57] zhuyifei1999@quarry-main-01:~$ lsb_release -a [02:55:57] No LSB modules are available. [02:55:57] Distributor ID: Ubuntu [02:55:57] Description: Ubuntu 14.04 LTS [02:55:57] Release: 14.04 [02:55:58] Codename: trusty [02:56:21] zhuyifei1999@quarry-main-01:~$ mysql -V [02:56:21] mysql Ver 15.1 Distrib 5.5.39-MariaDB, for debian-linux-gnu (x86_64) using readline 5.1 [02:56:55] interesting [02:56:58] I think the two workers are jessie but this main (db, redis, web, etc) is trusty [02:56:58] zhuyifei1999_: cat /etc/apt/sources.list.d/wikimedia-mariadb.list [02:57:03] ^ that pleaes [02:57:16] zhuyifei1999@quarry-main-01:~$ cat /etc/apt/sources.list.d/wikimedia-mariadb.list [02:57:16] deb http://apt.wikimedia.org/wikimedia precise-wikimedia mariadb [02:57:16] deb-src http://apt.wikimedia.org/wikimedia precise-wikimedia mariadb [02:57:21] how does that work :) [02:57:27] no idea [02:58:19] ok, ehm.. [02:58:39] Cyberpower678: I wan't aware of any extremely exploitable bugs in precise. that was why I asked. kali at least needs some sort of bugs to exploit [02:58:47] is torn between not using the mysql module at all ... or tryign to fix it [02:59:04] zhuyifei1999_: it was only a joke. :-) [02:59:07] k [02:59:38] * Cyberpower678 is using some of his free time to learn about ethical hacking. [03:00:15] And of course, making my own stuff secure [03:00:36] nice :) [03:04:06] mutante: https://phabricator.wikimedia.org/T165625 fyi [03:04:59] Cyberpower678: i'm fixing it by .. not using that module [03:05:10] Ok [03:05:24] zhuyifei1999_: thanks! i agree there is no future for it probably :) [03:05:25] So are we not using puppet to setup a DB? [03:05:37] zhuyifei1999_: it looks like that comes down to "make quarry stop using it" [03:05:55] Cyberpower678: oh, we are definitely using puppet, but we are not using mysql::server [03:06:02] Ah [03:06:07] well, I'm not familiar with all the puppet modules to change that [03:06:19] Because I am now getting used to Puppet [03:07:19] mutante: could you cc me to whichever patch you create to "make quarry stop using it"? [03:07:40] I do hope there's someway to test the patch before breaking everything though [03:09:42] zhuyifei1999_: so i just decided to not try and fix the mysql::server class [03:09:55] because i did _not_ want to touch quarry in any way [03:10:07] um ok [03:10:29] what i am doing now is going back to just putting the config file in place with puppet [03:10:31] well quarry still has to move to a new class, if possible [03:10:35] and installing the package [03:10:36] ok [03:10:42] because i want the fix for Cyberpower678 right now [03:10:51] ok [03:10:58] while i can guarantee i am not touching anythng else [03:11:07] k [03:13:10] * Cyberpower678 gives mutante a beer [03:13:34] * mutante cools it and is planning to open it in 5 min [03:13:36] uploads [03:13:48] * mutante is already missing the bots here [03:14:02] because with them i wouldnt have to keep saying that or manually link to gerrit [03:15:54] CC'ed you on the gerrit change [03:16:30] https://gerrit.wikimedia.org/r/#/c/385125/ [03:16:32] (03PS167) 10Ricordisamoa: Initial commit [labs/tools/wikidata-slicer] - 10https://gerrit.wikimedia.org/r/241296 [03:17:57] Cyberpower678: fixed, you can try it now [03:18:05] mariadb 10.1 (not 5) is installed now [03:18:09] puppet is happy [03:18:14] ERROR 2003 (HY000): Can't connect to MySQL server on '10.68.18.167' (111) [03:18:37] please add the commands you are running [03:18:38] From cyberbot-exec-iabot-01 [03:18:39] because: [03:18:39] mysql [03:18:39] Welcome to the MariaDB monitor. Commands end with ; or \g. [03:19:04] (03CR) 10Ricordisamoa: [C: 04-2] "PS167 clears the interval for the conflict handler upon starting the save step" [labs/tools/wikidata-slicer] - 10https://gerrit.wikimedia.org/r/241296 (owner: 10Ricordisamoa) [03:19:13] you can't expect a connection from another host to work yet [03:19:16] before adding GRANTs [03:19:35] I thought it would give me an error saying it won't allow me to connect. [03:19:48] didn't it? [03:19:57] It gave me ERROR 2003 (HY000): Can't connect to MySQL server on '10.68.18.167' (111) [03:20:13] The server apparently didn't respond at all. [03:20:22] what's the difference between "me an error saying it won't allow me to connect" and "ERROR 2003 (HY000): Can't connect to MySQL server " [03:21:16] The last time we tried to connect when it was working, it said this user isn't allowed to connect. It didn't say it can't connect. The former confirmed a response from the server [03:21:30] ERROR 2003 is a mysql error though [03:21:54] That latter implies it didn't. Unless the messages got vaguer in the newer versions [03:22:11] mutante: well wouldn't the mysql app return a mysql error. [03:22:15] "didnt respond at all" != "responds with error 2003" [03:22:41] Cyberpower678: do you still have your GRANTS line from earlier? [03:22:49] or just give me user/password :) [03:22:58] or i set one and give it to you [03:23:20] also, weren't you going to import it .. from localhost [03:23:31] and that was the whole point of uploading the .sql file [03:27:14] running the import of the dump is a step before granting access to it [03:27:30] so that a DB exists that you can give GRANTs to [03:28:39] so the name of the database is "s51059__cyberbot" with 2 underscores? [03:29:16] !log cyberbot copying mysqldump file to /srv/; gunzip'ing it; creating database s51059__cyberbot ; importing mysqldump [03:29:20] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Cyberbot/SAL [03:31:38] Cyberpower678: it's only 12G unzipped [03:31:46] didnt you say even more? [03:38:05] bd808: quick question. Isn't a large instance granted 80GB of disk space. Mine only got 60GB in the SRV [03:39:45] ok, i purged all the things , removed that precise file, let puppet install it all again .. [03:42:41] Cyberpower678: import ongoing .. sice in /srv/mysql is increasing [03:42:44] size [03:42:57] root@cyberbot-db-01:/srv/mysql# watch du -s . [03:43:00] ^ try that [03:43:10] mutante: looks like it's writing to the correct folder now. [03:43:32] Cyberpower678: last attempt i hadn't removed the remnants of the old version and restarted [03:43:39] now i did [03:43:47] That'll do it. ;p [03:44:11] Cyberpower678: try this: become root, and then run "screen -x" [03:44:29] do you see my command then ?:) [03:44:50] apparently yes :) [03:44:54] since you are hitting keys [03:44:57] dont:) [03:45:01] let it run [03:45:28] Yes, I do, but that was me scrolling on the trackpad. I was trying to scroll the terminal up. [03:46:02] How do I get out. [03:46:10] i am trying to clean that up but it adds more [03:46:15] trackpad is not gonna work [03:46:18] I don't want to accidentally kill the import [03:46:39] Cyberpower678: Ctrl+A +d [03:46:43] mutante: sorry, I'm not even try to add more. I switched between windows. [03:46:57] you hit Crtl+A, then you let go of both keys, and you hit "d" [03:47:12] Ooops [03:47:15] it should say "detached" and bring you back [03:47:22] did it? [03:47:30] I hit Crtl+A then Crtl+D [03:47:35] Yes [03:47:53] it's only a "d" for detach [03:48:03] all screen commands start with Ctrl+A [03:48:06] Apparently both work. :-) [03:48:09] and then some letter [03:48:23] so now you know how to get out of it [03:48:53] with "screen -x" you can see it again [03:49:10] this means i can now disconnect and stop working .. [03:49:17] while the import keeps running [03:49:20] and you can watch it :) heh [03:49:21] Cool [03:49:30] Well it's 23:49 here [03:49:52] you are on the US East Coast? [03:49:57] Yep [03:50:21] well, here is what you do.. just disconnect and sleep.. when you come back ssh to it and "screen -x [03:50:29] and see it finished :) [03:50:45] Or see a new error. :p [03:51:03] yea, but unlikely at this point [03:51:50] you might want to screen -d -r if nobody is attacked to that screen session [03:51:57] mutante: BTW, the previous attempt indeed wasn't working anyway. [03:51:59] ERROR 1130 (HY000): Host 'cyberbot-exec-iabot-01.cyberbot.eqiad.wmflabs' is not allowed to connect to this MariaDB server [03:52:20] That definitely confirms the server is responding to my connection attempt [03:53:11] Cyberpower678: right, it's a different error [03:53:17] but both are mysql errors [03:53:56] one is the _client_ saying i cant even connect [03:54:04] ann one is the server saying you arent allowed [03:54:08] mutante: Yes, but the other one was a complete failure to even connect, or get a response from the server. I get that error when my tunnels to the DB aren't open. [03:54:29] it was because the new package was installed but there was no restart [03:54:35] mutante: ah. [03:54:39] so it didn't have the "bind-address 0.0.0.0" part [03:54:45] and wasnt listen besides localhost [03:54:49] Yea, that was my point. :p [03:54:53] you are right then [03:55:35] * Cyberpower678 goes to bed. His eyelids weigh as much as anvils [03:56:19] mutante: good night and thanks for your help. :D [03:58:27] Cyberpower678: good night :) leaving too [07:09:49] (03CR) 10jenkins-bot: Localisation updates from https://translatewiki.net. [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/385139 (owner: 10L10n-bot) [07:17:17] 10Tool-wikiloves: Generalize wikiloves tool beyond the "Wiki Loves X" format - https://phabricator.wikimedia.org/T173005#3695693 (10JeanFred) [08:23:54] (03PS1) 10Lokal Profil: Add WLM favicon [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/385151 [08:28:39] (03CR) 10Jean-Frédéric: [C: 032] Add WLM favicon [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/385151 (owner: 10Lokal Profil) [08:30:55] (03Merged) 10jenkins-bot: Add WLM favicon [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/385151 (owner: 10Lokal Profil) [08:32:30] (03CR) 10jenkins-bot: Add WLM favicon [labs/tools/heritage] - 10https://gerrit.wikimedia.org/r/385151 (owner: 10Lokal Profil) [10:57:57] 10Cloud-VPS, 10Graphite: grafana-labs often fails to generate graphs with c.datapoints is undefined - https://phabricator.wikimedia.org/T177747#3696296 (10fgiunchedi) >>! In T177747#3675523, @hashar wrote: > Hurrah it works fine on https://grafana-labs.wikimedia.org/dashboard/db/labs-project-board?orgId=1&var-... [11:39:03] 10PAWS: Server error (500) while trying to download files from Commons from PAWS - https://phabricator.wikimedia.org/T178567#3696395 (10Chicocvenancio) [14:23:44] 10Cloud-VPS (Quota-requests): Request increased quota for mwstake Cloud VPS project - https://phabricator.wikimedia.org/T178012#3696666 (10MarkAHershberger) We've moved to WMFLabs... I'll turn off the old AWS instance tonight [15:04:40] chasemp: I need your assistance [15:09:04] Cyberpower678: chase is out for a long weekend [15:10:42] andrewbogott: I'm trying to mysql into cyberbot-db-01.cyberbot.eqiad.wmflabs from tools-login.wmflabs.org [15:10:47] It won't connect [15:12:06] * andrewbogott tries it [15:12:58] do you get an error or does it just hang? [15:14:10] It eventually times out and says it can't connect. [15:14:41] ok [15:14:55] I assume this isn't by design — will dig around a bit [15:15:29] andrewbogott: I can connect just fine from within instances in my project [15:15:46] so this is the database you made yourself, right? [15:16:01] Have you already changed your security group to allow access to things outside the project? [15:16:39] andrewbogott: mutante suggested I needed to. [15:16:46] But I have no idea what to change [15:17:03] I have a webservice on tools that needs access to this DB. [15:17:19] so by default pretty much all ports are closed to access outside of a project [15:17:40] so you'll need to open up port 3306 (probably) to 10.0.0.0 (which means everything in the cloud) [15:17:51] have you modified a security group before? Or applied one? [15:17:56] probably only one or 2 ips [15:18:09] if he has dedicated clients, too [15:18:18] andrewbogott: I have not, but I have the interface open on Horizon [15:18:32] jynus: since he's trying to connect from tools-login, I assume that he's going to access it from the grid at some point? [15:18:36] oh [15:18:38] is he? [15:18:52] Cyberpower678: I thought you had dedicated clients? [15:18:57] Cyberpower678: can you tell me what your long-term plans are for where/what is going to need to access this? [15:18:59] other VMs [15:19:27] https://tools.wmflabs.org/iabot needs access. [15:19:27] i was hoping it was possible without the "all of 10.0.0.0" part [15:19:38] Also the web workers on the grid need access too. [15:19:40] and we'd just need the right IP [15:20:03] if it's something running on the grid I don't think we can be more selective [15:20:24] for some reason i first assumed it's like this: tool -> exec node -> db [15:20:32] but then i learned he needs both [15:20:42] connections to db from exec node AND from a tool [15:20:54] Cyberpower678: is that really true? ^ [15:21:16] mutante: I don't understand what you mean by 'exec node'. Typically when we say that we mean 'the things that tools run on' [15:21:18] cyberbot-exec-iabot-01 and cyberbot-exec-01 access the DB on cyberbot-db-01 [15:21:24] Directly [15:21:29] andrewbogott: "cyberbot exec node" [15:21:37] oh, well, that's confusing :) [15:21:45] Workers on the grid access the DB directly too [15:21:50] andrewbogott: within the project "cyberbot" he has 2 types of instances, "exec" and "db" [15:21:53] As well as the webservice on tools [15:22:10] It's been like this since tools-db was pretty much accessible everywhere [15:22:32] So I never tunneled through anything to access tools.labsdb [15:22:57] mutante: so, what's the problem with opening it to 10.0.0.0/8? [15:23:04] andrewbogott: nothing [15:23:14] Just need some guidance doing that [15:23:27] I have the security groups interface open on horizon [15:23:47] andrewbogott: my admin instincts saying "limit it to the correct IP". that's all [15:23:59] ok [15:24:02] nothing, if it's just like toolsdb [15:24:14] Cyberpower678: so, create a new group called 'mysql' [15:24:19] and then add a rule... [15:24:35] ok, or 'DB Access' :) [15:24:38] I created a group called DB Access [15:25:07] Quick dumb question. MYSQL uses TCP.? [15:25:13] * Cyberpower678 assumes so [15:25:20] then add a rule for port tcp/ip, port 3306 (which is mysql usually) and the ip range 10.0.0.0/8 (which is basically 'everything in the cloud') [15:25:23] yeah [15:25:58] Rule added [15:25:59] yep, looks right to me [15:26:06] ok, now go back to the page with the list of VMs [15:26:25] And 'Edit Security Groups' for the db server [15:26:31] and add that group to that VM [15:28:03] Done [15:28:44] ok, that's it for your firewall [15:28:57] there might be grants and whatnot left to do, but that would be internal to your mysql setup [15:29:56] andrewbogott: yep, another question. Everything in tools, like tools-login, the grid, and the webservices all connection from *.tools.eqiad.wmflabs? [15:30:19] I'd expect so [15:33:21] Hmmm... I added the grant *.tools.eqiad.wmflabs [15:33:27] Still getting ERROR 1130 (HY000): Host 'tools-bastion-03.tools.eqiad.wmflabs' is not allowed to connect to this MariaDB server [15:34:25] mutante: andrewbogott ^ [15:34:47] That's definitely the right track but I don't know specifically what the syntax is for this (using a wildcard). Probably paste the grants you have here and then mutante or jynus might have an idea [15:37:02] Cyberpower678: that seems like a mysql error, so it seems to be able to conenct [15:37:19] but need an account with proper grants on mysql [15:38:11] he used a grant with "TO 's51059'@'*.tools.eqiad.wmflabs' [15:38:21] that doesn't look good [15:39:12] ON s51059__cyberbot.* TO 's51059'@'*.tools.eqiad.wmflabs' [15:39:29] the eildcard for usernames is % [15:39:37] * is for databases and tables [15:39:53] pluse there are some restrictions on how widcards can be used [15:40:06] and how they are resolved [15:40:28] How do I get lighttpd to output ; charset=utf-8 [15:40:38] jynus: Thank you. I'm in. [15:40:50] also the name should be escaped [15:41:16] s51059__cyberbot must be expressed as `s51059\_\_cyberbot` [15:55:39] Dispenser: ? [15:55:51] for a static file? [15:56:34] https://tools.wmflabs.org/dispenser/tmp/logs/wrongext-commonswiki.html Mojibake at second last line. Firefox has selectable encoding, but chrome doesn't [15:56:57] On my own server I've got a modified version of https://redmine.lighttpd.net/attachments/1241/create-mime.assign.pl [15:58:29] Dispenser: mimetype.assign = ( ".html" => "text/html; charset=utf-8" ) ? [15:59:17] IIRC that broke lighttpd (duplicate keys) [15:59:39] uh :( [15:59:49] And it would be nice if we could fix it for everyone [16:00:20] * zhuyifei1999_ isn't lighttpd exprt [16:01:41] andrewbogott: btw, cyberbot-db-01 is a large instance, with 80GB allocated. My SRV partition only has barely 55GB. [16:02:00] andrewbogott: the DB will eventually run out, sooner than later. [16:02:42] Any chance we can bump that up a bit. [16:02:57] Cyberpower678: it looks right to me… 20G for /, 60 for /srv, total = 80 [16:03:07] Grunt. [16:03:21] The 20GB is in the root, while the DB lives on SRV [16:04:25] Since these are logical volumes, can't you resize the cyberbot-exec-iabot-01 volume down maybe 60 GB and add that to the logical volume on cyberbot-db-01 [16:06:15] andrewbogott: ^ [16:06:50] Cyberpower678: If you need more disk space then you should probably start with a larger instance type. [16:07:00] I'm just saying that you asked for an 80G instance and got one. [16:07:17] which wouldn't be a bad thing because then we can use stretch [16:07:18] I was only given enough for a large instance. I primarily need the disk space not the CPUs [16:07:21] and also kill the trusty [16:08:15] andrewbogott: I didn't ask for 80GB, it was just given to me. I asked for an instance large enough to house a growing DB. [16:08:44] Nor was I aware of the fact that 20GB of it practically unusable since it's located elsewhere/ [16:08:53] If you know your specific needs I'd recommend opening a quota increase task specifying what you need. [16:08:54] Physiclaly [16:09:35] So you can't just raise the logical volume size of the SRV mount of an instance? [16:10:03] I thought that was the beauty of logical volumes. [16:10:06] I don't know — maybe? I certainly wouldn't stop you. [16:10:28] I think a sysadmin would need to do that. [16:10:43] https://wikitech.wikimedia.org/wiki/Help:Resize_root_partition [16:10:56] zhuyifei1999_: I'm not resizing the root partition [16:10:56] Cyberpower678: sorry, I'm confused [16:11:11] you have root on that box don't you? Isn't it in a private project that you control? [16:11:19] Yes. [16:11:27] But the root disk space is 20GB [16:11:44] so what do you mean by 'a sysadmin'? [16:12:06] I was advised to have puppet allocate actual disk space by applying the SRV to my instances. [16:12:24] Behold a new logical volume appears with the added space to the instance. [16:12:51] /srv already used up all the space available to you. if you want to extend /srv you need to make / smaller (i.e. resize it) [16:13:23] i wouldnt go much lower than maybe 15 for / [16:13:32] or you will run into issues too fast [16:14:36] Definitely on my 'top 5 reasons I have to intervene to rescue users' is them filling up /var/log [16:14:42] zhuyifei1999_: I thought SRV is a different disk that is just partitioned and mounted. [16:14:43] which will happen faster with a smaller / [16:15:10] andrewbogott: logrotate :P [16:15:46] zhuyifei1999_: new cron spam from logrotate :) [16:15:47] Cyberpower678: no, the entire disk is 80GB, 20GB is partitioned for /, and the rest: do whatever you want with it [16:15:49] What exactly is SRV aside from it being a LVM [16:15:58] Oh great [16:16:09] I did not know that. [16:16:38] I would have asked for an xtra large instance in that case. [16:17:00] zhuyifei1999_: I certainly encourage users to properly manage their VMs but I don't have time to check the logrotate policies on 700 servers that I didn't set up [16:17:10] lol [16:18:00] Cyberpower678: remember it's not as much work to re-create it anymore since you have the role now.. also it's possible to add the GRANT part to puppet so that also becomes automatic [16:18:42] andrewbogott: my servers are fine. I have an extra large instance that needs the CPU and RAM, but not the diskspace, and I have a large instance that needs the diskspace, but likely not as much RAM. :/ [16:19:55] CPU & RAM => m1.gigantic; disk => m1.bigdisk [16:19:57] mutante: I know that. [16:20:09] disk space is only used if you use it, so the xlarge instance isn't much of an issue [16:20:25] for the server I think there's a 'bigdisk' flavor that has more disk space without more RAM [16:20:50] andrewbogott: I'd like that. [16:21:08] 300GB incoming [16:21:34] andrewbogott: so if my volume remains empty, it's just used elsewhere? [16:22:09] So I don't need to feel guilty about a 130GB empty SRV mount on my xlarge instance? [16:22:45] correct. If you don't apply labs::lvm::srv then the disk space is never allocated in the first place. [16:22:55] I did apply it. [16:22:58] why? [16:23:13] I applied it to the wrong instance. [16:23:19] I meant to apply it to DB. [16:23:26] Not my exec node [16:23:28] welp [16:23:32] anyway, to apply for a quota increase, https://phabricator.wikimedia.org/project/profile/2880/ [16:23:39] i've put phabricator's one onto /srv as it can get large if you use repo's. [16:23:43] :) [16:24:11] andrewbogott: https://phabricator.wikimedia.org/T178332 [16:24:43] andrewbogott: I can unapply it right? I can also unallocate the disk once again right? [16:25:07] I think if you remove the class and then remove the /srv mount from /etc/fstab and reboot you should be good. [16:25:25] Great. I'll give you back your disk. [16:25:52] But when I created my large instance, I don't recall seeing a bigdisk option. [16:26:22] yeah, I'll have to assign that new flavor to your project. [16:26:44] Can my current instance be modified, or do I have to wipe and rebuild? [16:27:00] * Cyberpower678 prefers modified [16:27:24] you can't change the flavor of an existing instance [16:27:30] Derp. [16:27:35] also I believe mutante was hoping you'd rebuild with a new OS anyway? [16:27:37] * Cyberpower678 is thankful he has puppet. [16:27:51] Well the DB host is new. [16:27:54] * zhuyifei1999_ don't believe space can be freed on the labvirts just by not mounting the volumes, unless there is some magic going on [16:28:01] My exec nodes aren't [16:28:36] zhuyifei1999_: disk space is only consumed on the virt hosts when it's actually filled. If the volume isn't mounted it definitely won't be filled. [16:28:36] yea, the DB host is fine, nevermind [16:28:43] zhuyifei1999_: don't worry. The exec nodes are getting trashed soon anyways. That should free up the disks regardless. [16:28:48] but the exec nodes.. would be nice to rebuild them [16:29:00] ok :) [16:29:01] andrewbogott: ok [16:29:26] Let's first worry about the disk space problems for the DB host [16:30:48] andrewbogott: I'm going to rebuild the DB host, with new instance. [16:31:12] Can you add a large instance option with bigdisk flavor to Cyberbot [16:31:45] 10cloud-services-team (FY2017-18), 10Operations, 10Puppet, 10User-Joe: Upgrade to puppet 4 (4.8 or newer) - https://phabricator.wikimedia.org/T177254#3697047 (10herron) [16:32:06] You will need to file a request for increased quota, or edit the existing request. Please be specific about what you think you actually need [16:32:14] (since not being specific got you… not enough, apparently?) [16:32:43] Cyberpower678: if you want to add the GRANTs stuff to puppet.. there is an example here: modules/wikistats/manifests/db.pp [16:32:51] Well I don't need a quota increase? [16:33:17] andrewbogott: ^ I'm going to trash the current instance, and create a new one. I just need that flavor available? [16:33:40] Which is what that request was supposed to address. :p [16:34:56] Quotas describe how much resources a project uses. [16:35:02] You are proposing to use more resources (disk space) [16:35:06] which is effectively a quota increase. [16:35:14] Please specify how much disk space you expect to need. [16:35:32] There doesn't appear to be a disk limit quota though. [16:35:37] In a ticket (that one or another one) so we can consider it during our weekly resource-allocation issue. [16:35:53] Just Security Groups, CPUs, and RAM, as well as floating IPs [16:35:56] And yet disk space is a finite resource which we consider during our resource-allocation meeting. [16:36:25] * Cyberpower678 drives his head through a desk [16:36:34] Cyberpower678: just write it down, it will make it all easier [16:37:35] mutante: aside from learning puppet, I feel like a lot of time was wasted. [16:37:59] Cyberpower678: the point is that creating tickets actually saves time [16:38:18] for example you don't have to repeat things to multiple people [16:38:24] you can just link to it [16:38:28] and look it up later [16:38:38] Now I have a DB running out of space soon. [16:39:52] yea, so open that phab ticket, and add a sentence that describes what you need [16:40:07] Yea. [16:40:20] Sorry, just a tad frustrated. [16:40:36] andrewbogott: sorry, I didn't mean to blow up on you [16:40:46] If it seemed that way. [16:42:53] It's ok. Just, think ahead about what you need and be specific. [16:43:03] We can't magically predict what you'll need [16:44:43] Ok [16:44:53] me reopens the ticket. [16:45:07] :) [16:52:51] 10Cloud-VPS (Quota-requests): Request increased quota for cyberbot Cloud VPS project - https://phabricator.wikimedia.org/T178332#3697127 (10Cyberpower678) 05Resolved>03Open Unfortunately with the DB at 45GB, I need the bigdisk flavor. [16:54:53] andrewbogott: chasemp: That ticket has been reopened [16:57:12] 10Cloud-VPS (Quota-requests): Request increased quota for cyberbot Cloud VPS project - https://phabricator.wikimedia.org/T178332#3688709 (10Andrew) @Cyberpower678 Can you provide any kind of long-term estimate about how big you expect this database to grow? [17:00:34] 10Cloud-VPS (Quota-requests): Request increased quota for cyberbot Cloud VPS project - https://phabricator.wikimedia.org/T178332#3697168 (10Cyberpower678) The DB is shared across every wiki IABot runs on. As it expands to more wikis the DB will grow a little more. The DB also contains log entries for https://t... [17:07:46] andrewbogott: ^ [17:08:37] thanks, looks good [17:52:05] 10Toolforge: Add charset=utf-8 by default to lighttpd - https://phabricator.wikimedia.org/T178601#3697366 (10Dispenser) [17:52:46] I filed a bug since I can't see how we're enumerating extensions https://phabricator.wikimedia.org/T178601 [18:07:25] 10Cloud-Services, 10Outreachy (Round-15): Proposal: Improvements for the Toolforge 'webservice' command - https://phabricator.wikimedia.org/T177603#3697442 (10Sowjanyavemuri) @srishakatux Thanks for the reaching out to the organizers and updating me on that. [18:10:38] 10Toolforge: Add charset=utf-8 by default to lighttpd - https://phabricator.wikimedia.org/T178601#3697366 (10zhuyifei1999) The mimetype is assigned via: [[https://github.com/wikimedia/operations-software-tools-webservice/blob/7d7df1bbdd2c7637fc70e30f398859ca7131d3c0/toollabs/webservice/services/lighttpdwebservic... [18:18:55] 10Toolforge: Add charset=utf-8 by default to lighttpd - https://phabricator.wikimedia.org/T178601#3697490 (10zhuyifei1999) https://github.com/lighttpd/lighttpd2/blob/ca446911ef08e97417a74651ecd8528f9c7c27cf/contrib/create-mimetypes.conf.pl contains a more feature-rich `create-mime.assign.pl` that sets utf-8 in s... [18:25:53] 10Cloud-Services, 10Outreachy (Round-15): Proposal: Improvements for the Toolforge 'webservice' command - https://phabricator.wikimedia.org/T177603#3697515 (10srishakatux) @Sowjanyavemuri Hi! You have filled out January 18th as the final date of enrollment for the semester that overlaps with Outreachy. So, it... [18:39:48] !log git deploying https://gerrit.wikimedia.org/r/#/c/363734/ upgrading gerrit to 2.14.6-pre on gerrit-test3 and installing lfs plugin [18:39:51] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Git/SAL [18:40:11] 10PAWS, 10Pywikibot-Commons, 10Pywikibot-General: Server error (500) while trying to download files from Commons from PAWS - https://phabricator.wikimedia.org/T178567#3697578 (10Chicocvenancio) [18:41:22] 10Cloud-Services, 10Outreachy (Round-15): Proposal: Improvements for the Toolforge 'webservice' command - https://phabricator.wikimedia.org/T177603#3697581 (10Sowjanyavemuri) @srishakatux Hi! Thanks for letting me know. But, I have some concerns regarding the same. Considering Jan 18, 2018(Thursday) as my las... [18:50:39] 10PAWS, 10Pywikibot-Commons, 10Pywikibot-General: Server error (500) while trying to download files from Commons from PAWS - https://phabricator.wikimedia.org/T178567#3697611 (10zhuyifei1999) Are the list of 500-ed images the same every time you execute the script? Or is the list random? [18:54:13] 10PAWS, 10Pywikibot-Commons: Server error (500) while trying to download files from Commons from PAWS - https://phabricator.wikimedia.org/T178567#3697615 (10zhuyifei1999) [19:02:24] 10Cloud-Services, 10Outreachy (Round-15): Proposal: Improvements for the Toolforge 'webservice' command - https://phabricator.wikimedia.org/T177603#3697677 (10srishakatux) @Sowjanyavemuri Yes, that's right! You are only falling short of 5 days (Jan 18th - Jan 23rd ), but that's almost a week. [19:54:24] 10Cloud-Services, 10Outreachy (Round-15): Proposal: Improvements for the Toolforge 'webservice' command - https://phabricator.wikimedia.org/T177603#3697905 (10Sowjanyavemuri) Hi @srishakatux , I have considered the week days(i.e Monday to Friday) as the working days for an intern. In my case, number of working... [20:27:07] 10Cloud-Services, 10Outreachy (Round-15): Proposal: Improvements for the Toolforge 'webservice' command - https://phabricator.wikimedia.org/T177603#3697954 (10srishakatux) Hello @Sowjanyavemuri! I am not the person who can make any decisions on this anyway :) I am just following the rules of the program. I wou... [20:37:52] 10Cloud-Services, 10Outreachy (Round-15): Proposal: Improvements for the Toolforge 'webservice' command - https://phabricator.wikimedia.org/T177603#3697975 (10Sowjanyavemuri) Hi @Andrew @bd808 @madhuvishy I have some queries regarding the eligibility for Outreachy. I have considered the week days(i.e Monday... [21:06:02] bd808: is there something like https://www.mediawiki.org/wiki/Best_practices_for_extensions for Toolforge tools? [21:06:38] I want to point Outreachy applicants to a document where they can get an idea of the "unknown unknowns" [21:15:39] the only 'required'/'should' are in the policies afaik [21:28:17] 10Data-Services, 10DBA, 10Dumps-Generation, 10Blocked-on-schema-change, 10MediaWiki-Platform-Team (MWPT-Q2-Oct-Dec-2017): Schema change for refactored comment storage - https://phabricator.wikimedia.org/T174569#3698106 (10CCicalese_WMF) [22:40:54] Hi ! I was working with the database schema and found that one of the field 'page_counter' from page table was deprecated from version 1.25 but our database replicas on Toolforge still retain this field. Is it that the database replicas are not up to date with current version ?? [22:44:02] yasha: still exists in the prod dbs as well. While i cant say 100%, generally removing an now unused field is more trouble than its worth in many cases when you have large databaes [22:45:14] ebernhardson: But this may be a problem for new developers since they end up using such fields with doesn't exit on master db. [22:45:38] *being deprecated, sorry [23:33:56] tgr: there is not afaik but that’s a great idea. File a task? [23:40:27] bd808: in case you didn't notice, I reopened the quota ticket. How's your vacation so far? [23:41:57] Cyberpower678: I saw. Travel is done so vacation should get better soon [23:42:34] * Cyberpower678 hands mutante and bd808 a beer [23:59:00] (03PS168) 10Ricordisamoa: Initial commit [labs/tools/wikidata-slicer] - 10https://gerrit.wikimedia.org/r/241296