GTCtoVCF icon indicating copy to clipboard operation
GTCtoVCF copied to clipboard

Illumina Generated Manifest has no RefStrand column

Open tbrunetti opened this issue 5 years ago • 5 comments

I have no problems running this using BPM, however, when using an Illumina provided manifest (.csv) it is missing the RefStrand header. These are the headers that were provided, can you tell me which one maps to RefStrand (I want to say it is likely SourceStrand, but I would like confirmation before changing a header name; if is is SourceStrand, should I update all SourceStrand embedded headers to be RefStrand)?:

IlmnID, Name, IlmnStrand, SNP, AddressA_ID, AlleleA_ProbeSeq, AddressB_ID, AlleleB_ProbeSeq, GenomeBuild, Chr, MapInfo, Ploidy, Species, Source, SourceVersion, SourceStrand, SourceSeq, TopGenomicSeq, BeadSetID

tbrunetti avatar Aug 07 '19 19:08 tbrunetti

Which CSV manifest are you using? It should have a RefStrand column if it was created relatively recently. It is not the same as the SourceStrand.

jjzieve avatar Aug 08 '19 22:08 jjzieve

@jjzieve

Hi, I have exactly the same problem. The manifest is old (dated 2014), though I have to use it now to process old genotyping data. Is there a way (algorithmically) to assign RefStrand values based on whatever sequences can be found in SourceSeq/TopGenomicSeq/etc?

danilovkiri avatar Aug 04 '20 13:08 danilovkiri

@danilovkiri You could try using the SourceSeq and comparing to the ProbeSeq values if you want, but I wouldn't recommend it. The way its done internally validates against the reference genome in case the SourceSeq isn't accurate (this can occur on custom designs). I would email [email protected] and ask to get the RefStrand mapped on your manifest.

jjzieve avatar Aug 04 '20 22:08 jjzieve

Thank you very much @jjzieve

danilovkiri avatar Aug 05 '20 08:08 danilovkiri

@danilovkiri No problem! Let me know if that works out for you. If so, I'll close this issue.

jjzieve avatar Aug 05 '20 17:08 jjzieve