[03:01:56] zhuyifei1999_: McJohn was asking about a quarry newsletter or mailing list [03:05:14] ik [03:05:19] ok [03:05:37] framawiki answered [18:37:49] Hey guys, could you give me some pointers about how I can access patrol information from api.php ? Eg. I looked at https://www.mediawiki.org/wiki/Special:MyLanguage/API:Revisions but it doesn't seem to mention if it can tell which revisions have been patrolled [18:37:58] Maybe there's a separate way to query the patrol log or something [18:38:24] hm I think it was RC entries that get patrolled rather than revisions b_jonas ? [18:38:50] I don't know how it works [18:39:47] I haven't really been paying attention to this patrol thing [18:43:31] though I'm not sure how you'd look up an RC entry given a rev_id in the PAI [18:43:32] API [18:45:21] hmm, I'll look in https://www.mediawiki.org/wiki/Special:MyLanguage/API:Recentchanges [18:45:33] wait what? [18:45:50] "patrolled: Whether the change is patrolled. Only available to users with the patrol right. [18:45:53] " [18:47:20] Why do I need to be a patroller to _read_ that? [18:49:05] api.php says that too. "error": { "code": "permissiondenied", "info": "You need the \"patrol\" or \"patrolmarks\" right to request the patrolled flag." } [18:51:01] sounds like somebody wanted to err on the side of caution and restricted everything related to patrolling [18:58:53] mutante: but they didn't restrict everything. ordinary users can see if revisions are patrolled in the default HTML interface like https://hu.wikipedia.org/wiki/K%C3%A1rolyi_Amy and https://hu.wikipedia.org/w/index.php?title=K%C3%A1rolyi_Amy&action=history shows that I just made two edits pending review to this page [18:59:23] and Special:RecentChanges lets me filter to changes that haven't been reviewed yet [19:00:48] b_jonas: that sounds odd and like a bug. wanna make a ticket about that? [19:01:12] also https://hu.wikipedia.org/wiki/Speci%C3%A1lis:Elavult_ellen%C5%91rz%C3%B6tt_lapok lists pages that have pending edits, but the other special page that lists pages that have never been patrolled is restricted to editors [19:01:15] it's strange [19:01:44] not yet, I want to poke around first, I don't even know how this whole review thing is supposed to work [19:19:58] b_jonas: huwiki does not use patrolling [19:22:26] the patrol flag is hidden on Special:RC, btw [19:22:41] but Special:Log/patrol isn't so not super consistent [19:23:50] tgr: it wants to use patrolling. pending changes are currently not hidden, because of problems with patrolling, but (at least if you're logged in), the default HTML view puts a banner on the top of every mainspace page if it has pending changes, and the history view shows which revisions are patrolled [19:24:40] tgr: a patroller (an editor with the "editor" group that is, one who can mark revisions as patrolled) mentioned that they can't list pages that have pending revisions starting from the oldest pending revisions, [19:24:57] so they can't catch up with patrolling, and people keep complaining about pages that haven't been patrolled for months or yeras [19:25:27] that's entirely separate from MediaWiki's patrolling feature which is disabled on huwiki [19:25:28] I didn't believe this, and I'm trying to look at the SpecialPages: and api and data dump in case I can get such a list somehow (even if it requires downloading the ten gigs of compressed dumpos) [19:26:09] tgr: oh? I thought there used to be a patrol extension, but then it became a mediawiki feature [19:26:25] so there's two different features for this? [19:26:33] how does the one used on huwiki work then? [19:27:28] huwiki uses FlaggedRevs which was written later than patrolling in core [19:27:43] the API docs are at https://www.mediawiki.org/wiki/Extension:FlaggedRevs#API [19:28:41] I think it also extends the revision API which does not seem documented but Special:ApiSandbox is your friend [19:28:52] thanks [19:29:57] oh! it must be rvprop=flagged then [19:30:30] the oldreviewedpages API has a direction parameter [19:30:49] thanks [19:31:14] unreviewedpages does not seem to, it would probably be trivial to add if someone needs it [19:32:38] oh, right, oldreviewedpages sorts by timestamp and unreviewedpages sorts by title [19:32:43] unreviewdpages seems to have an urdir= too [19:32:47] https://hu.wikipedia.org/w/api.php?action=help&modules=query%2Bunreviewedpages [19:33:05] this seems better, I'll experiment with this [19:33:17] thank you for the pointers [19:34:25] note that the list of pages with pending revisions is oldreviewedpages [19:34:47] unreviewedpages is pages which do not have any revision that has ever been reviewed [19:35:02] so I guess it makes sense that it cannot be sorted by time [19:37:20] but https://hu.wikipedia.org/w/api.php?action=query&list=oldreviewedpages&ordir=newer seems to give the same list as https://hu.wikipedia.org/wiki/Speci%C3%A1lis:Elavult_ellen%C5%91rz%C3%B6tt_lapok , both of which are limited to a week or something [19:37:51] tgr: yes, I want both of them, sorted by the date of the oldest revision that haven't been reviewed yet [19:38:05] but I'd like the ones older than a week too [19:42:30] b_jonas: see https://www.mediawiki.org/wiki/API:Query#Continuing_queries about paging [19:43:03] for unreviewed, you probably can't sort by time with the API [19:43:12] yes, I know [19:43:16] you'll have to use Quarry or something like that [19:43:18] that's not the problem [19:43:32] oh, ok [19:43:42] but how about oldreviewdpages? [19:43:47] can I list ones older than a week? [19:45:29] wait, let me look at the extension docs first [19:45:44] at a glance it queries the live DB, shouldn't have any limitation on how far you can go back [19:46:14] so just use orstart or continuation [19:48:08] I shouldn't have to. it has an ordir parameter [19:48:15] ordir=newer says to list from oldest [19:48:29] !google is http://quarry.wmflabs.org/ [19:48:29] Unable to modify the database, access denied, link to the database isn't valid [19:48:53] can an op add quarry link? !quarry is http://quarry.wmflabs.org/ [19:50:21] this darned thing doesn't seem to be in the dumps, at least not the part with the revisions. [19:52:19] Hello! Is here anybody who can help me with maps.wikimedia.org ? [20:00:01] Tohaomg: try #wikimedia-infrastructure [20:00:42] b_jonas: there's a dump of the flaggedrevs table, but using Quarry tends to be easier [20:03:01] tgr: is that dump in the main database dumps? [20:03:53] tgr: huwiki isn't that big, the whole dump seems to be only like 10 gigabytes 7zipped or something, so the dump may be the easiest [20:05:22] I might go to sleep and continue looking at this tomorrow or later though [20:05:25] not sure what you mean by main dump, it's on the dump server [20:05:51] https://dumps.wikimedia.org/huwiki/20181101/ and search for flaggedrevs [20:06:12] I doubt it's easier than using Quarry though [20:06:43] ah [20:06:58] yeah, it's in a separate file then [20:07:03] that still works [20:07:13] Quarry... [20:07:19] I'll have to look at that too then [20:08:23] oh, there's a flaggedpages table dump too [20:08:26] that's even easier [20:09:20] tgr: I'm getting tired for this today, but thank you for the pointers [20:11:40] I mean, the compressed file expands hundredfold, but I can look at a section first for how it works, then decompress the whole thing streaming and process in one or two passes so that's not a big problem [20:12:03] I at least already know how to work with xml dumps, but Quarry is a new tool to learn [20:13:20] well, realistically it's more complicated than that, because then I might want to join with wikidata or something, so in practice it will be uglier, but at that point I can filter down the pages to a shorter list [20:23:00] in fact, the current dump and the stub-meta-history dump may be enough [20:23:26] those are much smaller than all the revisions