gfatools icon indicating copy to clipboard operation
gfatools copied to clipboard

conversion of GFA to rGFA

Open gsc74 opened this issue 2 years ago • 2 comments

@lh3, I'm curious to know , whether we can convert GFA to rGFA? . As per rGFA format, we need to maintain 3 additional information along with each segment lines (S). So, Assuming we have GFA with segments links;

S	s1	AAT
S	s2	T

can't we just add ;

S	s1	AAT	LN:i:3	SN:Z:chr1	SO:i:90374744	SR:i:0
S	s2	T	LN:i:1	SN:Z:chr1	SO:i:176753158	SR:i:0

Will it be valid rGFA ?

gsc74 avatar Apr 27 '22 13:04 gsc74

A GFA to rGFA tool would be extremely helpful for my use case at the moment. At a minimum, I could use more information about how the SO value is calculated; it's not 100% clear to me, and trying to determine what it should be while parsing the reference human pangenome freeze 1 (which only has intermittent SO tags) has proven to be unexpectedly complex!

JosephLalli avatar Aug 14 '22 05:08 JosephLalli

I think the reverse is true, a rGFA to GFA conversion tool is (also) severely lacking.

While minigraph is apparently the most efficient and promising pangenome tool at present, the use of rGFA prohibits downstream analysis using the odgi and vg toolkits (at least to my knowledge). Therefore, no SNP calling, odgi pavs calling etc is possible.

colindaven avatar May 08 '23 11:05 colindaven