kg-covid-19 icon indicating copy to clipboard operation
kg-covid-19 copied to clipboard

DNA sequences and protein sequences for corona virus

Open pnrobinson opened this issue 4 years ago • 2 comments

Collection of all available sequences for pathogenic and non pathogenic HCovs

pnrobinson avatar Mar 19 '20 19:03 pnrobinson

~~Probably could ingest from here~~ <- (edit: This is just SARS-CoV-2 data, probably not the right data for this ticket.)

@pnrobinson shall we ingest the actual genome/protein sequences, or just IDs for each strain? Reasonable to do ingest all sequences now, but might get unweildy as the sequencing firehose is turned on

justaddcoffee avatar Mar 23 '20 19:03 justaddcoffee

@justaddcoffee I guess this will depend on the use case, but for a KG probably we will not want to use the raw sequence but some processed list of properties. Can we make this an optional flag?

pnrobinson avatar Mar 23 '20 19:03 pnrobinson