gnomix icon indicating copy to clipboard operation
gnomix copied to clipboard

Can pre-trained models be used on GRCh38 data?

Open mojoman666 opened this issue 2 years ago • 1 comments

I am interested in using GNomix for local ancestry inference on my phased WES data. I understand that the pre-trained models used GRCh37 data for training. My data is in GRCh38. What I'm trying to understand is whether GNomix uses any positional information for the predictions in a way that would make it dependent on a genome build? Is it necessary to retrain the model from scratch using GRCh38 reference in order to use it on my data, or can I use the pre-trained models for my GRCh38 data out of the box? Any advice is highly appreciated.

mojoman666 avatar Sep 15 '23 14:09 mojoman666

I've used gnomix on hg38 aligned data before and have not had a problem. I constructed a reference panel and train the models based on the reference panel. Then I use that for local ancestry inference on the query. I hope that helps!

dralhindi avatar Apr 17 '24 18:04 dralhindi