gharchive.org icon indicating copy to clipboard operation
gharchive.org copied to clipboard

PullRequestEvent doesn't contain review_requested action entry

Open tisonkun opened this issue 3 years ago • 3 comments

https://docs.github.com/en/developers/webhooks-and-events/events/github-event-types describes the PullRequestEvent's actions can be one of: opened, edited, closed, reopened, assigned, unassigned, review_requested, review_request_removed, labeled, unlabeled, and synchronize.

However, when analyzing GHArchive data it gives:

SELECT
    action,
    COUNT(1)
FROM github_events
WHERE event_type = 'PullRequestEvent'
GROUP BY action;

┌─action──────┬───count()─┐
│ opened      │ 226861706 │
│ closed      │ 172356866 │
│ reopened    │   1924348 │
│ labeled     │        24 │
│ synchronize │    453125 │
│ merged      │         4 │
└─────────────┴───────────┘

There're no review_requested or review_requested_removed.

I don't know whether it's no such event from the API or the crawler doesn't handle it.

tisonkun avatar Aug 05 '22 03:08 tisonkun

Also labeled and merged data are catched once but later don't occur.

tisonkun avatar Aug 05 '22 03:08 tisonkun

Actually, after 2015 there're only events with action 'opened', 'closed', 'reopened'. Others occurred before 2012, and labeled occurred in some days in 2017.

tisonkun avatar Aug 05 '22 03:08 tisonkun

@tisonkun It is not an issue about GHArchive, actually GitHub webhook events and timeline events are different although they share the same data schema. I think it is maybe for cost concern, labeled, assigned and other actions are not included in timeline events so we will not get the data in GitHub events log.

frank-zsy avatar Jan 29 '23 09:01 frank-zsy