[07:36:08] Good morning. :) [07:37:29] multichill: to answer my question last night, yes mix-n-match resolves redirects. [07:38:41] Good to know [07:45:11] hello! [07:45:29] Harmonia_Job: hi! :D [07:45:33] Another 10k bytes, I saw. [07:45:45] yep [07:45:59] the violation constraint page doesn't update totally every day [07:46:09] Yeah, I know. [07:46:18] so some errors corrected days ago still show up on the page [07:46:26] but still, I'm progressing [07:46:29] Hello ! [07:46:51] * Harmonia_Job want to come under the thousand violation [07:46:53] s [07:47:29] I'm going to work on adding more, to legit items. :) [07:48:09] sjoerddebruin: I wanted to do a query to see all unused correct family names items [07:48:18] those with only 1 P31 [07:48:25] Or low-used. [07:48:30] so that all items are used at least once [07:48:41] P734, you mean? [07:48:54] no no, P31 [07:49:10] so we don't query all those thousand "family name/disambiguation page" [07:49:25] items with a P31 → family name and not used as a value of P734 elsewhere. [07:49:26] but yes, these items should be used as value for P734 [07:49:29] Hm [07:49:55] I want all legit and clean family names items to be used as value for P734 somewhere [07:50:16] so that we have a chance to catch on when someone do wrong merges [07:50:31] unused items are harder to spot [07:50:55] At least https://www.wikidata.org/wiki/Q26903397 is used :) [07:51:05] (broken label though) [07:52:10] ^^ [07:53:12] Harmonia_Job: http://tinyurl.com/gtbjusm ? [07:53:38] hi, where do I add a new catalog to mixnmatch? [07:53:58] I cant find the link anywhere in the tool (https://tools.wmflabs.org/mix-n-match/) [07:54:06] or in manuals :/ [07:54:11] Ash_Crow: http://tinyurl.com/hdwd5nk [07:54:23] https://tools.wmflabs.org/mix-n-match/import.php [07:54:26] Wesalius: ^ [07:54:34] Ash_Crow: you didn't filter by number of P31 [07:54:39] oh [07:54:46] sjoerddebruin: thank you [07:55:29] Hm, autoedit doesn't work here.... [07:55:32] Ash_Crow: we still have thousands of family name/disambig [07:57:23] Ash_Crow: hmm, my query is too heavy :'( [07:57:31] Does autoedit works for you? https://www.wikidata.org/wiki/Q27252782 [07:58:10] Harmonia_Job: with a subquery it doesn't timeout : http://tinyurl.com/gn2fv9r [07:58:26] ah good! [07:58:38] I don't think of subqueries [07:58:49] Ash_Crow: good SundayQuery, you think? [07:59:25] yes, it is an interesting case of why subqueries can be useful [08:00:03] * sjoerddebruin feels ignored again. :( [08:00:13] Ash_Crow: can you tweet me the query by private message? [08:00:40] sjoerddebruin: can't work [08:00:44] duplicate of https://www.wikidata.org/wiki/Q23777456 [08:00:49] I'll merge [08:00:50] Ahhhhh [08:01:00] So it doesn't report conflicts and stays on spinner [08:01:19] yes, i already begged jitrixis to add an error message [08:01:25] he says he'll add it [08:01:41] no idea when [08:01:52] Will add it to the Phabricator project too then. [08:02:02] but when it doesn't work, it's always a conflict somewhere [08:02:33] Harmonia_Job: done [08:02:44] Ash_Crow: thank you! [08:04:29] [10:02:46] yes, i already begged jitrixis to add an error message → (this reminds me that I never announced it on international lists/channels, but Jitrixis is currently an intern at Wikimédia France, until the end of the month) [08:04:57] Stab him for all the bugs. :P [08:05:24] :D [08:05:33] he want to redo all his old tools [08:05:37] datadrainer too [08:05:47] Hopefully in OOjs. [08:05:55] maybe nameguzzler? (but this one work withou bug, for me) [08:06:02] sjoerddebruin: yep, 'xactly [08:06:09] sjoerddebruin: that what he plans, yes [08:06:24] that's* [08:06:42] .+nom Puller [08:06:53] wrong chan :p [08:06:55] oups, wrong chan [08:07:29] Nameguzzler could be obsolete if we can add terms for a new language for every item... [08:07:32] (for those who don't know, on #wikidata-fr, you can ask a bot to create given names and surnames items for you) [08:07:32] Ehm no wait [08:07:41] I'm confused with the labellister [08:08:40] not the same :p [08:08:53] Need caffeine. [08:16:14] * Harmonia_Job offers tea to sjoerddebruin [08:16:32] :) [13:10:33] Hm, Petscan seems to have a issiue with regex and diacritics. I've added a surname, but it didn't include a bunch of items. [13:12:33] Oh, it's my regex. [13:15:33] *fixed* [13:21:29] Ehm, why is "Van Den Bossche" the same as "Van den Bossche"? [13:22:55] because wikidata doesn't discriminate on majuscule? [13:23:57] :/ [13:23:58] same problem with "Mcdonald" and "McDonald" [13:24:05] etc. [13:24:41] Ugh [13:24:50] right now, I have enough errors to correct not to care to discriminate correctly those [13:25:27] (like Chinese people using Korean surname because the translitteration is the same) [13:25:31] (urggh) [13:27:37] sjoerddebruin: we should make a subpage where we list all those upper/lower cases surnames [13:27:45] and not care right now? [13:27:58] I dislike postponing. :( [13:28:00] I mean, we still have thousands on disambig/family names [13:28:09] of family used as surnames [13:28:12] etc. [13:28:16] Lydia_WMDE: why are the term constraints so strict? [13:29:02] (and of course, we should verify all names with an "imported from:Italian Wikipedia" as reference) [13:29:18] (because itwiki is a total mess on this subject) [13:29:29] I'll have the same issue when we decide to use items for the "tussenvoegels" + surname solution. [13:29:54] (I think a new property for them is better regarding sorting, but will discuss this on the 29th) [13:29:57] sjoerddebruin: it's an issue which exists since 2013 for disambiguation pages [13:30:22] disambig pages frequently discriminate with upper/lower case... [13:30:39] and Wikidata doesn't know what to do with that [13:31:29] Those term constraints are just too strict. Why is "tHIS" the same as "this"? [13:32:20] no idea [13:32:50] but as I still have months of work to correct current family names errors [13:32:57] I don't really care * right now* [13:33:01] Months? :O [13:34:04] yes [13:34:28] sjoerddebruin: the same wrong item is only used 7 times at the most [13:34:49] so now, I correct link by link... [13:38:36] and every time, i need to verify if it's a disambig page, if it's a family name, if the interwiki mix the two and need to be separated, if I need to create a new item, then linking disambig/surname with P1889 and correcting the wrong uses [13:38:45] it takes time... [16:52:34] hi, can sdomebody merge https://www.wikidata.org/wiki/Q13681#sitelinks-wikipedia with https://www.wikidata.org/wiki/Q660757#sitelinks-wikipedia please? [16:53:04] * DennisRoczek|afk has no knowledge how to use wikidata. I was quitting wikipedia at the time when wikidata was introduced [17:09:12] DennisRoczek|afk : they're not the same thing, they must not be merged :/ [17:12:34] it's not clear to me what the difference is [17:12:57] or rather, it's not clear to me what's going on there [17:13:11] they have the same commons link, the pictures on the german and ukrainian pages are similar, the french page has a picture of something completely different [17:14:04] then the italian page on one of the items is a redirect to the page on the other item [17:14:05] the french page has a picture of something defined as "not it", because reasons [17:14:30] in french, there's a difference between pied-de-biche and pince-monseigneur, albeit a small one [17:14:49] the notions can overlap, but they don't have to [17:15:02] a pied-de-biche is short, and has a split business end [17:15:27] a pince-monseigneur is long, and has a lightly curved business end [17:16:28] so a short bar with a split and curved business end is both, but a long bar without split business end is only a pince-monseigneur, whereas a short one, very curved, without a split end is only a pied-de-biche [17:17:03] and yes, even when it comes to thieving, we're pedantic :p [17:37:03] sjoerddebruin: At what number are we going to mass create the missing items for P&P? ;-) [17:39:14] Alphos: thx [19:27:37] multichill: not soon please :( [19:40:25] sjoerddebruin: I think with MEP we did it for the last 200 or something like that [19:44:19] multichill: maybe if all [19:44:25] tweede kamer etc [20:38:32] Asking random channel inhabitants: what are your thoughts on Wikidata schema? The idea being that if something is an instance of something else, it *must* have properties/values X, Y, Z, it *can* have properties/values A, B, C, and it *must not* have properties/values D, E, F. [20:39:25] And by encoding this as a schema, we can have automated compliance tests. [20:40:24] This would allow us to have more sophisticated compliance requirements than what current constraint violation reports allow, particularly for complex data models like we have to deal with in Source Metadata land. [20:52:28] Did you already discover the complex constraints hare? [20:53:44] I have not [21:15:53] hare: See for example https://www.wikidata.org/wiki/Wikidata:Database_reports/Complex_constraint_violations/P650 . The talk page of the property contains the setup [21:55:48] hare : tbh there's already some absurdities in wikidata, most notably with part of / has part [21:57:00] for instance has some subclasses or instances as object of a statement, which makes no sense whatsoever [22:26:22] sjoerddebruin