nextclade icon indicating copy to clipboard operation
nextclade copied to clipboard

ENH: List instead of merely counting private mutations

Open proychou opened this issue 5 years ago • 4 comments

Nextclade lets users know how many private mutations are in a sequence but it would be great if it could generate a list of these private mutations. This could be either flagging them in the list of changes with a * or (P), or listing them separately a new column of the csv report.

Why is this useful? Number of private mutations and mutation clusters are both really helpful QC metrics that tell us about sequence quality and flag potential errors. Private mutations and clusters can sometimes be due to issues with trimming or quality issues at the ends of reads combined with low coverage at these sites leading to them being called in consensus. Having a list of where these private mutations (and maybe also mutation clusters) are will help us to go back to the bams or vcfs, review read support for each mutation, determine whether they are real and clean up the sequence.

proychou avatar Jan 15 '21 02:01 proychou

Thanks, Pavitra! I like this suggestion and it should be pretty straightforward to implement.

rneher avatar Jan 15 '21 08:01 rneher

Hey Richard! Sorry to jump in, but is this feature implemented in the web version? It would be great being able to get the list of private mutations.

Thanks for a super helpful tool!

Gonzalo

GonzaloYebra avatar Aug 19 '21 16:08 GonzaloYebra

@GonzaloYebra !!!!!! 😁 It is lovely to see you, even if it's on a github issue thread! 🙃

emmahodcroft avatar Aug 19 '21 16:08 emmahodcroft

Haha likewise Emma! Any little corner would do in this day and age :P

GonzaloYebra avatar Aug 19 '21 16:08 GonzaloYebra

@proychou We've implemented this for a while now - both in web as well as in the TSV export

image image

Thanks for the idea!

corneliusroemer avatar Sep 13 '22 10:09 corneliusroemer