[07:55:40] [[Tech]]; ArchiverBot; Bot: Archiving 1 thread (older than 30 days) to [[Tech/Archives/2016]].; https://meta.wikimedia.org/w/index.php?diff=16202367&oldid=16199648&rcid=8763350 [11:14:17] Nemo_bis I want to esculde edit count in the userspace and user talk space [11:14:40] I'll fork that though [11:25:43] ToAruShiroiNeko: for small wikis a subquery is quite fast, but if you want the English Wikipedia you'll need some optimisation. Something like this: https://quarry.wmflabs.org/query/15186 [11:26:47] hmm [11:27:24] I dont care about the filenames mind you [11:27:28] I just need the usernames [11:39:43] Nemo_bis distinct is expensive right? [11:39:52] dunno [11:40:09] I limited it to one user [11:40:10] https://quarry.wmflabs.org/query/15190 [11:40:32] That times out [11:40:43] But we'll see [11:40:46] hmm? [11:40:48] why? [11:41:05] maybe there is a better way for this querry? [11:41:34] I dont care about users with no uploads or more than say 10 [11:42:11] NOT IN ( ... a list of 867 millions numbers ... ) is certainly unlikely to be the most efficient system [11:42:46] but everyone should have at least one mainspace edit [11:42:53] DISTINCT isn't expensive if you're ordering by the field you want to make distinct [11:42:56] actually lets start with that [11:44:28] what would be a better query in identifying users without mainspace edits? [11:46:28] vast majority of vandalism only ones can be ignored [11:46:34] hmm [11:46:50] file namespace is 8 ? [11:48:14] 6 [12:00:31] I cant figure this one out :( [12:08:42] Nemo_bis is this sound? https://quarry.wmflabs.org/query/15191 [12:08:54] I probably want a where restriction to edit count [18:17:25] can someone generate for me a list of all articles whcih contain the string ? I'd really appreciate it. [18:21:11] Dragonfly6-7: on en.wp ? is it possible there is just one? try this https://en.wikipedia.org/w/index.php?title=Special:Search&profile=default&fulltext=Search&search=%22%3Cref+name%3D\%22%3A0\%22%3E%22&searchToken=ewqjxx7w2qqt76bgn8x2ihvnp [18:21:32] i used "" in search [18:23:00] mutante - no, it's widespread. [18:23:24] that's the only occurrence of that string, *misformatted* [18:23:37] so it's a search bug that it finds not 0 but also not 100, but exactly 1 random one? [18:23:43] aaah [18:24:14] but the general issue is just that you could not search for literal " characters? [18:24:21] and escaping them as [18:24:30] and escaping them as \" is still the fix? [18:24:43] .... I don't know. I don't have the requisite skillset. [18:25:01] if you could write me some instructions on how to find suhc pages myself, that'd be good. [18:25:19] Dragonfly6-7: so if you want to search for a literal " replace it with \" and then put normal " " around the whole thing [18:26:22] if you want to search for: foo "bar" baz [18:26:32] enter: "foo \"bar\" baz" [18:26:43] try that with your strings [18:27:06] google: " " site:en.wikipedia.org [18:27:08] would that do? [18:27:38] * Dragonfly6-7 tries [18:27:48] that's another thing to try but first try normal wiki search [18:27:59] the escaping thing would be the same for both , yea [18:29:20] https://en.wikipedia.org/w/index.php?search=%22%3Cref+name%3D%5C%22%3A0%5C%22%3E&title=Special:Search&go=Go&searchToken=chdvo3qjlzfzeskirnv0jxjas [18:29:22] doesn't work [18:29:32] the problem is that the string in question *is not displayed* [18:29:36] it's in the wikicode [18:29:42] but not displayed in the article text [18:31:43] have you tried insource: [18:33:01] .... I don't know how? [18:35:13] please expand? [18:41:50] * Dragonfly6-7 waves arms [18:58:18] Dragonfly6-7: see https://www.mediawiki.org/wiki/Help:CirrusSearch [18:59:59] Thank you, that's what I needed. [19:01:25] greg-g: roughly, https://en.wikipedia.org/w/index.phpfulltext=1&search=insource%3A"%2F%2F" [19:01:33] s/greg-g/Dragonfly6-7/ [19:02:03] first I got a 404 [19:02:09] when I followed the suggested alternate URL [19:02:15] The requested page title contains invalid characters: "<". [19:03:40] Dragonfly6-7: doh, because i deleted a ? that needed to be there, https://en.wikipedia.org/w/index.php?fulltext=1&search=insource%3A"%2F%2F" [19:03:53] An error has occurred while searching: Regular expression syntax error at 14: interval syntax error [19:04:19] works here, is the search string: insource:"//" [19:04:40] ... ? [19:05:26] no results matching the query [19:05:45] Dragonfly6-7: odd, i get 35k results, and the first page of highlights all look pretty sane [19:12:50] ah, there we go [19:13:09] thank you [21:20:26] Nemo_bis sorry I got badly distracted [21:20:29] still trying the same issue [21:25:58] ok [22:50:03] Nemo_bis I welcome any suggestions :/ [22:50:16] or maybe a querry may be run on the dev level since its too complicated [23:31:39] who do I talk to about problems with VE?