biomedical icon indicating copy to clipboard operation
biomedical copied to clipboard

Proposal to add Chan Zuckerberg Initiative Disease Research State Model dataset

Open GullyBurns opened this issue 2 years ago • 1 comments

Adding a Dataset

  • Name: Disease Research State Model (DRSM) Corpus
  • Description: Corpus of primary research articles studying rare disease tagged with single tags denoting main purpose of the paper. Task is multiclass labeling of research papers describing (A) clinical characterization + pathology; (B) therapeutics in the clinic; (C) disease mechanism; (D) patient based therapeutics; (E) other.
  • Task: Text Classification
  • Paper: not available
  • Data: https://github.com/chanzuckerberg/DRSM-corpus
  • License: CC0 1.0 Universal
  • Motivation: Large corpus of annotated papers (8,926), similar to LitCovid but focussed on papers describing over 50 rare diseases, an understudied field.

GullyBurns avatar Apr 21 '22 05:04 GullyBurns

#self-assign

GullyBurns avatar May 04 '22 23:05 GullyBurns