SeqArray icon indicating copy to clipboard operation
SeqArray copied to clipboard

seqMerge error

Open davidroberson opened this issue 1 year ago • 3 comments

I split a large bcf (bcftools view) and converted each shard to a gds. I am now trying to merge the shards back together but not all of the smaller gds will merge. Occasionally I get this error

Error in .append_node_variant(gfile, "position", "int32", storage.option, : Invalid number of variants in 'position'. Calls: seqMerge -> .append_node_variant

I have been careful to not split near variant positions. Any ideas? Thanks.

davidroberson avatar May 14 '24 13:05 davidroberson

If the original file had multi-alleles on separate records would this cause the error shown above?

davidroberson avatar May 15 '24 02:05 davidroberson

just a kind ping..please help :)

davidroberson avatar May 23 '24 14:05 davidroberson

Not sure what the issue is. Did you split the bcf file by variants or samples? If the variants are split, then seqMerge() could just simply concatenate the data.

Also, show me the session information: sessionInfo().

zhengxwen avatar May 25 '24 02:05 zhengxwen

SeqArray_1.44.1 (https://github.com/zhengxwen/SeqArray/commit/702488b6b98156998e07b34b49cfb3585d14e370) should fix this question.

zhengxwen avatar Jun 22 '24 08:06 zhengxwen

thank you. I will confirm today.

davidroberson avatar Jun 22 '24 09:06 davidroberson

Thank you @zhengxwen I can confirm that this fix worked. It allowed me to create a split, apply, combine (seqMerge) workflow to speed up a conversion pipeline we have.

davidroberson avatar Jun 23 '24 22:06 davidroberson