MIDAS icon indicating copy to clipboard operation
MIDAS copied to clipboard

Inclusion criteria for MAGs when setting up custom database

Open adityabandla opened this issue 5 years ago • 0 comments

Hi Stephen, Thanks for the great tool. I have a set of species-level MAGs that range in completeness from 50-100% and redundancy/contamination 0-10%. These are species representatives chosen using dRep with a genome average ANI cutoff of 95%

Since these are environmental MAGs, I would like to construct my own database. Given the above numbers, can I include all MAGs or is it better to include only MAGs that are substantially complete say >70%? Also, to what extent does redundancy affect downstream steps such as calling SNPs?

Best, Adi

adityabandla avatar Nov 06 '19 15:11 adityabandla