colrev icon indicating copy to clipboard operation
colrev copied to clipboard

feat: GitHub SearchSource

Open geritwagner opened this issue 2 years ago • 10 comments
trafficstars

Description

Integrate GitHub as a SearchSource within the CoLRev environment, enabling the search and prep operations to utilize GitHub's vast repository of code and documentation. This feature will allow users to search GitHub repositories by title and README content, enhancing the scope of CoLRev's research and analysis capabilities.

Preferred Solution

  • Develop a New SearchSource for GitHub: This SearchSource should leverage the GitHub REST API to perform searches based on repository titles and README contents.
  • Implement a Prep Package for GitHub: Similar to the approach taken with the crossref-prep-metadata package, this functionality should retrieve relevant metadata from GitHub repositories, such as the repository's description, topics, license, and possibly citation information, when a github.com URL is provided.

Links for Reference and Development

User Story

  1. A user initializes CoLRev with colrev init.
  2. To perform a search, the user runs colrev search -a colrev.github, specifying search parameters that include repository titles and keywords found in README files.
  3. CoLRev utilizes the GitHub SearchSource to query the GitHub API, retrieving a list of repositories that match the search criteria.
  4. Search results, including repository metadata and links to README files, are then stored in a designated file or database.
  5. When a user has a list of GitHub repository URLs, they can use the prep operation to enrich the collected data with additional metadata from each repository, enhancing the quality and usefulness of the search results.

Expected Effort

  • Duration: 2 months
  • Team Requirement: 3-4 individuals

geritwagner avatar Sep 18 '23 15:09 geritwagner

i'd like to contribute to this issue

edensarrival avatar Apr 18 '24 08:04 edensarrival

I would like to contribute to this issue

koljarinne avatar Apr 18 '24 08:04 koljarinne

I would like to contribute to this issue.

U1TIM4T3 avatar Apr 18 '24 09:04 U1TIM4T3

I would like to contribute to this issue.

JohannesDiel avatar Apr 26 '24 11:04 JohannesDiel

I want to contribute to this issue.

k-schnickmann avatar Apr 28 '24 18:04 k-schnickmann

We currently have 5 people interested in a topic for 3-4. @JohannesDiel : can I ask you to join #360 with @pmao0907 and @MingxinJiang ?

geritwagner avatar Apr 29 '24 10:04 geritwagner

Yes, no problem. Should I just comment in #360 and delete my comment here?

JohannesDiel avatar Apr 29 '24 10:04 JohannesDiel

Thank you @JohannesDiel for joining the other group. No need to delete comments.

geritwagner avatar Apr 29 '24 10:04 geritwagner

This means we have a group of 4: @edensarrival , @koljarinne , @U1TIM4T3 , and @k-schnickmann :+1: Please go ahead, select a group lead, fork the repository and link your repository in this feed.

geritwagner avatar Apr 29 '24 10:04 geritwagner

https://github.com/edensarrival/colrev_SS24

edensarrival avatar May 02 '24 14:05 edensarrival