biomedical icon indicating copy to clipboard operation
biomedical copied to clipboard

Tools for curating biomedical training data for large-scale language modeling

Results 180 biomedical issues
Sort by recently updated
recently updated
newest added

Dear Hackathon Participants, To receive authorship credit on our forthcoming dataset manuscript, we need all contributors to complete the following - Add your preferred name, contact information, and affiliation to...

When I count the number of chemical and disease entities in the different splits of bc5cdr, I get different numbers than what is reported in the paper and in BLURB...

good first issue

## Adding a Dataset - **Name:** *DDXPlus* - **Description:** *A new Dataset for Medical Automatic Diagnosis* - **Task:** *NA* - **Paper:** *[link to the dataset paper if available](https://arxiv.org/abs/2205.09148)* - **Data:**...

New Dataset

https://github.com/bigscience-workshop/biomedical/blob/master/bigbio/biodatasets/pcr/pcr.py#L54

## Adding a Dataset - **Name:** BioASQ Task Synergy - **Description:** *None provided* - **Task:** QA - **Paper:** http://ceur-ws.org/Vol-2936/paper-10.pdf - **Data:** http://participants-area.bioasq.org/general_information/Task9b/ - **License:** NLM License Code: 8283NLM123

English
QA
JSON

- **Name:** Conflate Dataset - **Description:** Conflation of word pairs from Medline abstracts - **Task:** Semantic Similarity - **Paper:** https://aclanthology.org/P08-3009/ - **Data:** https://nlp.cs.vcu.edu/data.html#conflate - **License:** ?

English
Semantic Similarity
VCU
New Dataset

## Adding a Dataset - **Name:** *Mimic-III* - **Description:** *Predicting patient mortality from admission and discharge notes,* - **Task:** Text classification - **Paper:** *https://arxiv.org/abs/2102.04110* - **Data:** *link to the Github...

New Dataset

## Adding a Dataset - **Name:** BioNLP-ST 2019 RDoc - **Description:** *None provided* - **Task:** SENT_CLASS - **Paper:** https://aclanthology.org/D19-5729/ - **Data:** https://zenodo.org/record/3596942#.YiHwUhPML9B - **License:** CC BY 4.0

CC BY 4.0
English
New Dataset

## Adding a Dataset - **Name:** BioNLP-ST 2019 AGAC - **Description:** *None provided* - **Task:** NER|RE - **Paper:** https://aclanthology.org/D19-5710/ - **Data:** https://sites.google.com/view/bionlp-ost19-agac-track/home - **License:** ?

English
NER
RE
JSON

## Adding a Dataset - **Name:** BioNLP-ST 2019 CRAFT-CA - **Description:** *None provided* - **Task:** NER|COREF - **Paper:** https://aclanthology.org/D19-5725/ - **Data:** https://github.com/UCDenver-ccp/CRAFT/releases/tag/v3.1.3 - **License:** CC BY 3.0

XML
CC BY 3.0
English
NER