[08:12:56] @meanwhile the first place where we were going through some sort of thing I want [08:15:34] maybe I should get out [08:15:56] ! [12:13:16] French Wikisource starts using Wikidata for the header template on Author pages :) E.g. https://fr.wikisource.org/wiki/Auteur:Henri_Mouhot [12:15:28] Pyb: this is great!+ [12:21:29] well done Pyb [12:22:52] Do we have the number of main snaks that are wikibase-items? [12:23:06] addshore: ^ [12:23:34] Added to https://fr.wikisource.org/wiki/Cat%C3%A9gorie:Mod%C3%A8les_utilisant_les_donn%C3%A9es_wikidata [12:23:51] hoo: not that I know of [12:24:16] addshore: mh... do you have a rough estimate at hands? [12:24:18] oh hoo, a new one https://grafana.wikimedia.org/dashboard/db/wikidata-edits [12:24:28] hoo: I dont even have an estimate :/ [12:25:12] hoo: I could work it out from finding all properties that have datatype item and then counting all items that have claims with that prop [12:25:22] hm [12:25:44] I'm just trying to figure memory requirements for something [12:26:00] I'll calculate with 100M then to play it safe [12:26:02] hmmm [12:26:04] it'll grow anyway [12:26:17] that'll probably give me a hear attack, though [12:26:34] :D [12:27:01] 670 properties are of item type [12:29:52] hoo: how if your sparql foo? [12:30:10] I could try that [12:30:20] this lists all properties that are items SELECT ?s WHERE {?s wikibase:propertyType wikibase:WikibaseItem} [12:30:30] this lists all statements that use a given proeprty SELECT (count(distinct(?s)) AS ?scount) WHERE {?s wdt:P373 ?o} [12:30:54] just got to combine the two right ;) in SQL I would do a WHERE IN ( stuff ), just gotta see if thats possible in sparql [12:31:04] I have a feeling it may not work though ;) [12:31:45] I'm just going to guess [12:31:49] :D [12:31:59] tarrow: how is your sparql foo coming along? [12:32:02] it's probably not going to be nice, anyway [12:32:28] What do you need? [12:32:35] read up ^^ ;p [12:33:02] and IN or something doesnt work I wonder if there is any nice way to run a looped query and union :P or if this should be in a script rather than a single sparql query... [12:33:47] um... [12:34:36] so you want to find all properties that have a datatype item and then find all of those items with that type of property? [12:34:53] yup ;) [12:35:04] addshore: If I don't grossly miscalculate my stuff, 100M such snaks should be "fine" [12:35:40] let me have a look [12:35:58] right now a script seems nost realistic, but would be cool if it could be done in a single query [12:36:51] I think it can be [12:37:02] is it any data type? [12:37:10] or just wikibaseItem? [12:37:15] SELECT ?s WHERE {?s wikibase:propertyType wikibase:WikibaseItem} [12:37:18] just Wikibase Item [12:37:22] for now ;) [12:37:25] sure [12:38:18] and any property? [12:39:14] well, where the property is a WikibaseItem / has the datatype wikibaseitem :P [12:40:10] im guessing a filter? or something [12:58:43] DanielK_WMDE: https://phabricator.wikimedia.org/T119707 [13:01:45] benestar|cloud: https://phabricator.wikimedia.org/T119707 [13:02:42] jzerebecki: cant view it [13:04:47] ty [13:54:53] I think there could be something broken here: https://www.wikidata.org/wiki/User:Joshbaumgartner/property_available_summary/2400-2499 [14:29:13] Pyb: \o/ [15:37:11] DanielK_WMDE: #wikimedia-releng [17:37:19] Friday is the last day: (╯°□°)╯︵ ┻━┻ https://www.wikidata.org/wiki/Wikidata:Wikimania_2016 [17:45:46] SMalyshev: Is it just me or is WDQS dead? [17:45:59] shouldn't be... let me see [17:46:07] I might have killed it myself, though (by doing some awful queries) [17:48:48] don't kill it, there is a workshop about wikidata & sparql in 1 hour ;) [17:48:56] :P [17:49:07] ah, it just responded [17:49:19] yeah definitely appears to be slow but let me see why [17:49:28] It's back to normal now [17:49:32] Running query count=29 [17:49:42] somebody is having fun there... [17:49:48] Might have been me [17:49:59] well, in case my queries kept running after I got timeouts from nginx [17:50:40] the query should time out after 30 secs... but that doesn't prevent one to run 100 of them in parallel... [17:50:51] so far we don't have real concurrency controls there [17:51:07] ok [17:52:55] Running query count=32 [17:53:05] hoo: are you sending new ones? [17:53:28] Yeah, but not more than one at a time [17:53:32] always wait for them to time out [17:53:47] or error out, fwiw [17:55:01] ok, let me restart it maybe... it looks like there are so many of them that the engine can't time them out in time... will need to investigate that [17:55:10] Ok [17:55:18] I'll be more careful after [18:01:57] hoo: I'm still nseeing a ton of parallel queries [18:02:05] PREFIX wikibase: PREFIX wd: PREFIX wdt: PREFIX rdfs: PREFIX p: PREFIX v: prefix schema: Select ?jeu ?jeuLabel ?jeuarticleen ?jeuarticlefr { [18:02:05] ?jeuarticleen schema:about ?jeu . ?jeuarticleen schema:inLanguage "en" . ?jeuarticleen wikibase:badge wd:Q17437796 . ?jeu wdt:P279*/wdt:P31 wd:Q7889 . filter not exists { ?jeuarticlefr schema:about ?jeu . ?jeuarticlefr schema:inLanguage "fr" . ?jeuarticlefr wikibase:badge wd:Q17437796 . } SERVICE wikibase:label { bd:serviceParam wikibase:language "fr" . } } [18:02:11] is this yours? [18:02:15] Nope [18:02:35] ok, then somebody else is DOSing it [18:03:54] Accepted query count=157 [18:03:56] wow [18:04:19] Running query count=15 [18:05:45] hmm... somebody really needs some data NOW [18:09:19] :P [18:17:11] SMalyshev: how do you get those numbers? ;) [18:17:48] addshore: it's internal GUi for blazegraph [18:18:06] *wonders why that isnt one of the stats it spits out internally too* :P [18:18:19] addshore: yes, it's one of the counters [18:18:26] is it O_o [18:19:15] SMalyshev: Do you think my queries were related to the troubles (I was only running a small amount of them, as said earlier) [18:19:30] Can non-parallel queries actually be a problem [18:19:32] ? [18:19:40] hoo: no it looks like somebody else is doing it [18:19:49] hoo: because it keeps coming [18:19:51] Ok [18:20:29] Do we have any instance in labs still running that can be used for testing? [18:24:09] hoo: there's http://wdqs-test.wmflabs.org/ but it is out of date and being used for debugging some other issue right now... so it may bounce up and down [19:13:06] hi [19:13:06] is the sparql endpoint down ? [19:13:08] we are at a Wikidata workshop about sparql, if we cannot use it it's not good [19:13:08] Looks like it, yes [20:04:27] Friday is the last day: (╯°□°)╯︵ ┻━┻ https://www.wikidata.org/wiki/Wikidata:Wikimania_2016 [20:21:46] Wow, wikidata has a lot of circles :P [20:21:47] https://tools.wmflabs.org/hoo/Wikidata-circle-finder.php [20:30:05] moved to https://tools.wmflabs.org/hoo/Wikidata-cycle-finder.php [20:32:15] hoo: would be nice if the results were links :) [20:33:05] nikki: Yeah, indeed [20:35:12] hey hoo and nikki, do you know if dates are supposed to be localised? I'm getting "1660s" even when I set my language to Dutch [20:35:56] multichill: Where? [20:36:07] Wikidata on an item [20:36:24] they are, but it's obscure how it works [20:36:29] so could be, that that is expected [20:36:31] doubt it, though [20:36:32] I remember being annoyed by the weird way the dates were displayed for japanese [20:37:15] nikki: Done :) [20:37:19] Now off to have some food [20:37:36] so in my experience they're partly localised, but not properly [20:37:52] thanks hoo :) [20:43:50] heh, yeah, for japanese I get things like "dd mm月 yyyy" instead of "yyyy年mm月dd日", it seems like it just translates the month name and puts it where the english word goes [20:46:42] and even for english it still displays things like "20th century" as "20. century", which looks like denglisch [20:47:10] Hmm [20:47:19] I feel an i18n coming up [20:49:14] ah, I think it's https://phabricator.wikimedia.org/T63958 [20:50:32] Looks like it. [20:51:01] Bug should be blocked by the task where MwDateFormatParser is implemented [20:51:33] Can't find that one [21:08:10] I'd like to hear thoughts/opinions/about consensus on what to do in cases like: [21:08:10] If we have an object for a wikipedia article for something like "is a website and the associated software it runs on", [21:08:10] do we create an object for the site and one for the system that runs it? [21:09:07] wspecially: why should i not do that? [21:11:17] each item should have one specific nature [21:11:25] I would create two items [21:11:25] if wikidata is really primarily about storing interwiki links, then i should rather not - right? [21:11:33] rather not split [21:11:35] one P31 software / the other P31 website [21:11:44] Who told you such a lie? [21:11:49] :-) [21:12:09] "Wikidata is a free linked database that can be read and edited by both humans and machines." [21:12:22] We're not responsible for the homonimies of Wikipedia projects. [21:12:59] i think i remember something from a page like "project goals" or the like. [21:13:19] Yes, one of the main goal is to store interwiki links. [21:13:38] but not at the cost of a lack of precision of the items [21:13:44] it was talking about "phases" of the project and its use or something. [21:14:03] so - where do i put the wikilinks then? [21:14:13] The relevant question in your case is "should we anchor the interwiki to the website item or the software item?" [21:14:16] it refuses to put them with both items [21:14:28] indeed. :-) [21:14:31] okay we concur about the real question :) [21:14:51] dices?.. [21:15:14] For stuff like Reddit, website seems currently to be prefered. [21:15:30] well, in the one case at hand right now the article is mainly about one aspect - but i've seen other cases. [21:16:01] You could for each Wikipedia article evaluate if th earticle is mainly about website or software [21:16:08] and link accordingly [21:16:15] this can also change. [21:16:26] Yes, you can split the interwiki links to disambig. [21:16:39] What's the website here? [21:17:04] i'm currently thinking about WikiWikiWeb [21:17:18] given that wikidata items are the basis of the interwiki links, we can't just ignore the fact that lots of wikipedia articles don't split things up like we want to :/ [21:18:02] en.wikipedia infobox chosen is the website one [21:18:19] fr. doesn't speak about the software [21:18:49] but as a template for the wiki software ... [21:18:51] how coherent [21:19:09] pt. speaks about the software [21:19:23] ru. website [21:19:27] i mean, if i'm on an article on a website and a software and i want to jump to the equivalent in another language - and in that other language they have seperate articles... :-/ [21:19:42] split the links according the major topic of each article [21:19:50] and I bet people on ptwiki want interwiki links to frwiki too [21:20:13] there's cases when you have a company named after their main product where it's much more ambiguous than with Wiki. [21:20:50] tricky. [21:20:51] If you really really really really really want to keep these interwikis together, link them all to the website [21:21:00] If you want purity of data first, split them [21:21:06] In both case, create two items [21:21:37] i'm grateful for the insight, people! :-) [21:22:14] I sympathize, this kind of question is though [21:32:23] don't we also have this thing that notability of an object is usually proven through interwiki links? too bad if they have to be with the other object on the main topic of that article... [21:37:04] there only needs to be one sitelink, and sitelinks aren't actually required, there are other ways for something to be notable [21:38:35] so it's not really a problem for us if we link one page to one item and another page to another item, but it might be a problem for the wikipedias (or other projects) if they want those pages to be interwiki links [22:06:10] Friday is the last day: (╯°□°)╯︵ ┻━┻ https://www.wikidata.org/wiki/Wikidata:Wikimania_2016