[10:36:02] HI all, is there a guide how one should create/maintain data on wikidata? In example what are things which need special attention or how data quality can be assured etc. [12:17:39] arizcraf: as far as i know there is no global policy. Wikiprojects https://www.wikidata.org/wiki/Wikidata:WikiProjects create their own strategies. [12:17:39] What kind of data you are going to add? [12:24:04] The question is more for general data maintenance. But in my example I'd be working with GLAM data [12:36:48] https://m.wikidata.org/wiki/Category:GLAM_WikiProjects [12:37:16] There are some projects out there. [12:39:25] To be honest your question is a bit too broad. [12:41:26] BTW you are going to create bot or import data manually? [12:47:44] the question is meant to be broad [12:48:15] Trying to figure out how data import can be managed in general [13:23:32] Im also new here and i hope im wrong but afaik you need to make many decisions yourself. [13:23:32] You may look for source codes of bots used in previous imports, look into discutions around them... [14:12:42] Would "according" and "accordingly" be different lexemes? [14:13:17] What about "abandon" and "abandonee"? [14:52:38] smitop: they would have different lexical categories, so yes [22:32:38] PROBLEM - High lag on wdqs1003 is CRITICAL: 3619 ge 3600 https://grafana.wikimedia.org/dashboard/db/wikidata-query-service?orgId=1&panelId=8&fullscreen [22:39:15] hmm something is happening on Wikidata... huge spike of edits? [22:40:10] oh yes http://wikidata.wikiscan.org/hours/6/pages [22:40:17] http://wikidata.wikiscan.org/utilisateur/XabatuBot [22:40:23] or https://www.wikidata.org/wiki/Special:Contributions/XabatuBot [22:40:33] doing 474 edits per minute [22:40:44] huh [22:41:31] this seems to be not the only one [22:41:48] I see in recent changes a bunch of other stuff going on [22:43:11] I wish people would space out such things... wdqs is right now completely overloaded [22:43:23] maybe https://www.wikidata.org/w/index.php?title=Special:Contributions/CyclingInitBot&offset=&limit=500&target=CyclingInitBot [22:43:45] mmm nope [22:44:04] i would still say XabatuBot [22:44:26] 135k edits in 12 hours is still nuts [22:44:53] Hogü-456 seems to be doing some huge upload [22:46:04] it should be one user according to https://grafana.wikimedia.org/d/000000170/wikidata-edits?refresh=1m&panelId=9&fullscreen&orgId=1 i guess? [22:46:17] mostly labels and descs [22:46:31] no quickstatements [22:46:47] wow that's a big peak [22:47:06] i could try blocking that bot [22:47:43] I'd prefer whoever doing this to space it out... [22:48:05] is that edit rate per minute? [22:48:25] 474 edits per minute over 6 hours [22:49:20] yeah and way over the average load I see. Is that all the XabatuBot? [22:49:41] based on wbsetlabel on https://grafana.wikimedia.org/d/000000170/wikidata-edits?refresh=1m&panelId=4&fullscreen&orgId=1, i would assume [22:50:30] other users do set labels/descs too... like https://www.wikidata.org/wiki/Special:Contributions/Hog%C3%BC-456 [22:50:45] but I am not sure if I can see rate anywhere [22:50:45] But quickstatements should be ratelimited [22:51:08] is it? I see a lot of incoming per minute [22:52:19] sjoerddebruin: could we temporarily block the bot and see how it changes? [22:52:26] I've already did that [22:52:33] oh ok :) [22:52:46] if https://grafana.wikimedia.org/d/000000170/wikidata-edits?refresh=1m&panelId=4&fullscreen&orgId=1&from=now-3h&to=now isn't lagged it should had some impact [22:52:58] yeah edits seems to be dropping [22:53:06] let's see where do they stop [22:57:00] the problem seems to have started about 2.5 hrs ago, before it was ok... [22:57:20] same pattern can be seen on http://wikidata.wikiscan.org/gimg.php?type=edits&date=6&size=big [23:08:55] yep edits are back to normal [23:10:33] RECOVERY - puppet last run on wdqs1003 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [23:11:44] SMalyshev: good to hear! [23:12:06] https://grafana.wikimedia.org/d/000000489/wikidata-query-service?orgId=1&from=now-6h&to=now&panelId=8&fullscreen seems still rising tho :/ [23:12:56] sjoerddebruin: it still needs to go through the 2.5 hour backlog... [23:13:04] *shrugs* [23:33:19] PROBLEM - Check systemd state on wdqs1010 is CRITICAL: CRITICAL - degraded: The system is operational but one or more units failed.