MTBseq_source icon indicating copy to clipboard operation
MTBseq_source copied to clipboard

Annotation format and column meaning

Open SchwarzMarek opened this issue 1 year ago • 1 comments

Hello, thanks for the pipeline I' want to use different reference and needed to convert annotations as suggested in documentation. To get the feel about the annotation format, I've looked at the M.abscessus annotations; I' not sure what does the following columns mean and if/how this information is used in the pipeline and what are acceptable values:

What I've seen in the file:

status_region             [status 3]
status_function          [annotated]
region_number                    [5]
function_number                  [6]

Also, since genome can have multiple sequences (chromosomes or incomplete assembly with multiple contigs/scaffolds), how would this be specified in this pipeline, as it appears that no chromosome identification is present in the *.txt annotation file.?

SchwarzMarek avatar Feb 14 '24 13:02 SchwarzMarek