kg-covid-19 icon indicating copy to clipboard operation
kg-covid-19 copied to clipboard

ingest PubChem COVID-19 Disease Map

Open realmarcin opened this issue 5 years ago • 3 comments

Name of the dataset COVID-19 Disease Map

https://pubchem.ncbi.nlm.nih.gov/#query=COVID-19%20Disease%20Map&tab=pathway&source=COVID-19%20Disease%20Map

Mapping or relevant fields A clear and concise description of what which fields you would want to be ingested.

If possible, highlight which fields map to nodes and which fields map to edges. Refer to Data Preparation for guidelines on how the final transformed data should be represented.

Additional context Add any other context, requests, concerns.

realmarcin avatar Aug 19 '20 17:08 realmarcin

This is missing "A clear and concise description of what which fields you would want to be ingested. If possible, highlight which fields map to nodes and which fields map to edges." Also, needs info about how ingesting this data source would benefit kg-covid-19. (That can go under "Additional context.")

nlharris avatar Sep 17 '20 19:09 nlharris

what is the status of this ticket? is it actionable?

cmungall avatar Mar 29 '21 15:03 cmungall

Looks like a valuable dataset. I think though we are missing a clear idea of what data exactly we want to ingest here, and what this buys us.

Looks like by PubChem's count there are 537 compounds, 177 genes, 179 proteins, 606 pathways, 19 bioassays, and 196 papers available for ingest. Which of these data would we like to ingest, and what might be redundant with other ingests (e.g. seems likely that genes, proteins, and literature are captured already by other ingests)?

justaddcoffee avatar Mar 29 '21 15:03 justaddcoffee