vg icon indicating copy to clipboard operation
vg copied to clipboard

Program stuck at [IndexRegistry]: Chunking VCF(s) for days

Open Jokendo-collab opened this issue 10 months ago • 2 comments

I am having a problem running vg mpmap. It has stuck on the [IndexRegistry]: Chunking VCF(s) for days. Is there a solution for this problem?

[IndexRegistry]: Checking for phasing in VCF(s).
[IndexRegistry]: Provided: VCF w/ Phasing
[IndexRegistry]: Chunking inputs for parallelism.
[IndexRegistry]: Chunking FASTA(s).
[IndexRegistry]: Chunking VCF(s).

Jokendo-collab avatar Apr 25 '24 05:04 Jokendo-collab

Did you provide the variants as a single VCF file for the entire genome? If so, one thing you can do to speed it up is to separate the single VCF into chromosome-level VCFs. That will let vg autoindex use more parallelism during this step. I would also recommend using approximately as many threads as chromosomes if you have them available.

jeizenga avatar Apr 25 '24 17:04 jeizenga

Hi,

My genome assembly was highly fragmented. I only extracted chromosomal scale contigs and it now running.

On Thu, Apr 25, 2024 at 8:51 PM Jordan Eizenga @.***> wrote:

Did you provide the variants as a single VCF file for the entire genome? If so, one thing you can do to speed it up is to separate the single VCF into chromosome-level VCFs. That will let vg autoindex use more parallelism during this step. I would also recommend using approximately as many threads as chromosomes if you have them available.

— Reply to this email directly, view it on GitHub https://github.com/vgteam/vg/issues/4274#issuecomment-2077837750, or unsubscribe https://github.com/notifications/unsubscribe-auth/AGJ34O64GFEHRJKZ6ROU4Q3Y7E7CFAVCNFSM6AAAAABGYFFKXGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDANZXHAZTONZVGA . You are receiving this because you authored the thread.Message ID: @.***>

Jokendo-collab avatar Apr 25 '24 18:04 Jokendo-collab