kg-covid-19
kg-covid-19 copied to clipboard
DNA sequences and protein sequences for corona virus
Collection of all available sequences for pathogenic and non pathogenic HCovs
~~Probably could ingest from here~~ <- (edit: This is just SARS-CoV-2 data, probably not the right data for this ticket.)
@pnrobinson shall we ingest the actual genome/protein sequences, or just IDs for each strain? Reasonable to do ingest all sequences now, but might get unweildy as the sequencing firehose is turned on
@justaddcoffee I guess this will depend on the use case, but for a KG probably we will not want to use the raw sequence but some processed list of properties. Can we make this an optional flag?