biomedical
biomedical copied to clipboard
Tools for curating biomedical training data for large-scale language modeling
Dear Hackathon Participants, To receive authorship credit on our forthcoming dataset manuscript, we need all contributors to complete the following - Add your preferred name, contact information, and affiliation to...
When I count the number of chemical and disease entities in the different splits of bc5cdr, I get different numbers than what is reported in the paper and in BLURB...
## Adding a Dataset - **Name:** *DDXPlus* - **Description:** *A new Dataset for Medical Automatic Diagnosis* - **Task:** *NA* - **Paper:** *[link to the dataset paper if available](https://arxiv.org/abs/2205.09148)* - **Data:**...
https://github.com/bigscience-workshop/biomedical/blob/master/bigbio/biodatasets/pcr/pcr.py#L54
## Adding a Dataset - **Name:** BioASQ Task Synergy - **Description:** *None provided* - **Task:** QA - **Paper:** http://ceur-ws.org/Vol-2936/paper-10.pdf - **Data:** http://participants-area.bioasq.org/general_information/Task9b/ - **License:** NLM License Code: 8283NLM123
- **Name:** Conflate Dataset - **Description:** Conflation of word pairs from Medline abstracts - **Task:** Semantic Similarity - **Paper:** https://aclanthology.org/P08-3009/ - **Data:** https://nlp.cs.vcu.edu/data.html#conflate - **License:** ?
## Adding a Dataset - **Name:** *Mimic-III* - **Description:** *Predicting patient mortality from admission and discharge notes,* - **Task:** Text classification - **Paper:** *https://arxiv.org/abs/2102.04110* - **Data:** *link to the Github...
## Adding a Dataset - **Name:** BioNLP-ST 2019 RDoc - **Description:** *None provided* - **Task:** SENT_CLASS - **Paper:** https://aclanthology.org/D19-5729/ - **Data:** https://zenodo.org/record/3596942#.YiHwUhPML9B - **License:** CC BY 4.0
## Adding a Dataset - **Name:** BioNLP-ST 2019 AGAC - **Description:** *None provided* - **Task:** NER|RE - **Paper:** https://aclanthology.org/D19-5710/ - **Data:** https://sites.google.com/view/bionlp-ost19-agac-track/home - **License:** ?
## Adding a Dataset - **Name:** BioNLP-ST 2019 CRAFT-CA - **Description:** *None provided* - **Task:** NER|COREF - **Paper:** https://aclanthology.org/D19-5725/ - **Data:** https://github.com/UCDenver-ccp/CRAFT/releases/tag/v3.1.3 - **License:** CC BY 3.0