deepbiologic
deepbiologic copied to clipboard
Haystack Heuristic
https://www.nature.com/articles/s41598-017-04439-5
A cool paper that shows how a simple heuristic can find disease motifs in a dataset of B-cell receptor sequences drawn from (roughly) 50 healthy patients and 50 unhealthy patients.
A simple deep-learned version of such a model might use a 1D conv-net to predict from sequence to patient healthy/unhealthy state.
Would it be possible to get access to the data as a public resource?
The data is publicly available at http://www.immport.org/immport-open/public/study/study/displayStudyDetail/SDY1043
Should try testing DeepMHC on these datasets.