[02:21:03] can I get an unblock [02:21:06] please [02:21:17] MdupontDataBot [02:21:36] mdupont: looking [02:22:12] I am developing a new script, this was it's first run [02:23:13] I have patched it to skip all namespaced articles and will be watching it [02:23:26] To git@github.com:h4ck3rm1k3/open-everything-library.git 3e42d46..e0c8220 extractor -> extractor is the new patch [02:23:57] mdupont: did you put up a request on your talk page? [02:24:30] i just https://www.wikidata.org/w/index.php?title=User_talk:MdupontDataBot&oldid=303057140 responded to this thread [02:24:43] about the requst for bots I have one yes [02:25:01] but I am developing new methods all the time [02:26:23] you're going to have to work it out with the blocking adminb [02:26:40] k [09:09:13] hoo: hey, Can you check the patch again? I'm not sure I'm using waitforReplication correctly [09:10:49] Sure, I'll have a look [09:11:48] thanks :) [10:12:24] intgr, sjoerddebruin you guys there? [10:12:30] mdupont: hi [10:12:43] Ok, I am sorry leaving this bot running when I left for work [10:13:03] I have adjusted the code to remove all namespaced articles [10:13:17] so anything with a : in the name will be skipped [10:13:39] hoo: please check again [10:13:39] the pywikipedia bot is not able to manage these things because it gets a stack overflow [10:13:45] if you have some time [10:13:49] mdupont: Ah, okay. [10:13:54] it cannot handle deep recursion [10:13:57] mdupont: I'm here [10:14:06] I am sorry for causing problems. [10:14:37] can you please unblock my bot and let me continue with the import of software projects [10:14:51] Okay, be sure to keep a eye on the creations. ;) [10:15:02] Amir1: Will do in a sec [10:15:02] I will not leave my children unattended [10:15:14] parents must be with children at all times [10:15:15] :D [10:15:19] awesome [10:15:39] Great, unblocked. [10:16:48] thx [10:17:54] DanielK_WMDE: https://gerrit.wikimedia.org/r/270255 [10:18:11] Should we maybe just throw away all of that, now that SULF is done? [10:37:07] hoo: when are you comign to the office? [10:37:39] aude: Will be there in another 2:30h :) [10:37:44] ok :) [10:38:06] * aude will not be there for the daily, but will come around 12pm or so [10:38:19] Thiemo_WMDE: Jonas_WMDE ^ (not sure who is there) [10:39:31] aude: no daily today. it's just me and Thiemo. [10:39:47] aude: Only daniel and me in the office. We just did some kind of daily. Do you have something to share right now? [10:40:12] I'm currently working on DataModel 5 and a possible EntityDocument::copy, which currently does not exist. [10:42:27] ok [10:43:01] i'm just figuring out how to score labels in search results [10:43:29] i have it where we stop putting the giant text blob in and instead structured list of labels and descriptions :) [10:44:12] but now how to adjust the more scoring nicely :) [10:44:42] * aude runs away :) [10:44:58] @aude I am lying in bed :/ [10:45:03] :( [10:45:30] scoring is quiet a complex problem, I'm afraid :/ [10:45:35] i also want to check things in the office for our mapping event tomorrow [10:45:43] @aude we could arrange a meeting next week for the scoring [10:45:44] e.g. make sure i know how to setup the projector [10:58:35] GerardM, hoi [10:58:54] sjoerddebruin, it looks like all the items were already processesd [10:59:16] so the next question of notability [11:00:07] https://en.wikipedia.org/wiki/Free_Software_Directory what about importing the projects from the free software directory [11:00:19] it is a semantic media wiki [11:00:53] are we going to get into notability issues on smaller projects? [11:03:28] so I will start to first enrich the existing articles with information found [11:03:41] before creating new project pages. [11:14:13] hoo: another PS to review ;) [11:14:56] Amir1: Will have a look in a bit [11:15:01] :) [11:16:06] great :) [11:30:16] Amir1: I can't test the script myself, really (without live hacks or setting up the actual service), so you will have to do manual testing yourself [11:30:50] I do the testing for each patchset [11:31:30] hoo: regarding moving the cache and score to constructor, strangely when I want to run it, it returns not found error [11:32:11] but it works in execute function [11:32:35] That's strange... I think doMaintenance includes the autoloader before doing that [11:33:13] but could be that it constructs first, hm [11:34:01] well, ok then… the maintenance stuff is pretty awry... for other things we have factory method support [11:34:03] but not for this [11:34:11] so no way to properly inject anything [11:35:54] so what do you suggest? [11:36:19] should be there, or load autoload classes in construct? [11:36:42] Implement get methods that lazy init the fields or just make them function scope variables [11:36:59] I think that's the cleanest option we have [11:37:02] * options [11:48:18] hoo: Done [12:19:57] I'll arrive in a few, cu in the office o/ [12:20:42] hoo: added [12:20:52] will look after lunch [12:20:53] thanks [12:21:10] sure :) [12:21:12] thanks [12:36:59] Wondering if my internet is slow or Wikidata... [13:42:31] Hellooo everybody. Lovin IRC [13:45:04] Now we know why Einstein always quoted, "No Problem". The C was a Si. [14:26:54] hoo: can you have a look at https://gerrit.wikimedia.org/r/#/c/270283/ ? [14:27:07] I wonder if that is actually correct, but it would be cool to get rid of that setting [14:27:36] benestar: Will do [14:53:13] hoo: when you are already doing reviews, would you mind looking at some more of my patches? [14:53:21] all about EntityDocument stuff [14:54:41] benestar: Already doing ;) [14:55:14] thanks! [15:10:16] hoo: done :) [15:10:25] Will have a look, thanks :) [15:23:55] Amir1: I think we're good now :) [15:24:03] Only have some style stuff left [15:24:18] awesome [15:24:22] I'm fixing them [15:24:29] I test it every time. don't worry [15:29:11] hoo: I fixed [15:33:26] here you go :) [15:33:52] yay [15:33:53] thanks [16:16:54] DanielK_WMDE + everyone, reminder https://lists.wikimedia.org/pipermail/wikitech-l/2016-February/084770.html [16:17:03] no more jenkins for a while [16:25:07] DanielK_WMDE: https://lists.wikimedia.org/pipermail/wikitech-l/2016-February/thread.html [19:39:14] a quick question: Does any here knows what proportion of edits in Wikidata are made by bots? [19:39:15] overall [19:39:21] not this month [19:39:57] aude, Lydia_WMDE ^ [19:40:03] I need the number urgently :( [19:41:06] Amir1: bot is recorded in recent changes table but not in revision [19:41:06] Amir1: 8:40pm on a friday is a bad time for a question like that [19:41:30] Based on https://grafana.wikimedia.org/dashboard/db/wikidata-edits I would say half of the edits [19:41:32] i could tell you from the recent changes table, but that's not overall [19:41:48] Amir1: by "overall" you mean what percentage of revisions was created by a bot, ever? [19:42:22] we can only guess by looking at which users now have the bot flag. but that doesn't mean they did back then, and it doesn't mean that edit was a bot edit. [19:42:29] DanielK_WMDE: I've deadline of research paper in several hours [19:42:40] yes [19:42:46] Amir1: then use this month's number [19:42:56] okay [19:43:10] can you run the query? [19:43:16] I think it's easy [19:43:16] https://phabricator.wikimedia.org/P2611 [19:43:16] rc_bot = 1 [19:43:16] thanks [19:43:22] yes [19:43:23] from the recentchanges table [19:43:31] yes [19:49:04] 13091793 has been made last month [19:49:08] *edits [19:49:43] 54% [19:49:51] I was close. ;) [19:51:10] :P [19:51:11] Amir1: try weighting by abs( rc_new_leng - rc_old_leng ) [19:51:44] i.e. what part of the content (in bogo-bytes) did bots create [19:52:07] hmm, let me give it a try [19:52:11] I will let you know soon [19:54:14] DanielK_WMDE: do you have the query? [19:54:48] not handy, not. and it's not that important to me. I thought it might be interesting to you. [19:55:20] it is [19:55:32] well, actually, xcan's you just do SELECT SUM( abs( rc_new_leng - rc_old_leng ) ) FROM recentchanges? [19:55:47] sigh https://www.wikidata.org/w/index.php?title=Q282275&action=history [19:55:52] I get it [20:05:54] DanielK_WMDE: sql wikidata "SELECT SUM( abs( rc_new_len - rc_old_len ) ) FROM recentchanges;": 3939503666 [20:05:54] same for bots: 2337842671 [20:06:24] 59% [20:07:27] same ballpark. sounds about right [21:21:51] Does anybody remember - there was a project for making command-line tool to run queries against wikidata sparql endpoint?