Jue Ruan comments

Results 87 comments of


                                            Jue Ruan

参数设置调整

It should be a complicate genome, try tune parameters `-R`, `--aln-dovetail 0` and others referred in issues.

参数设置调整

The minimal count of reads to support that two genomic regions are directly connected in order. On graph, the minimal count of reads to support a confident edge.

Consensus step : genome size lower than pre-consensus stage

The first step is to check whether the polished contigs are more accurate using WGS short reads.

Consensus step : genome size lower than pre-consensus stage

So, the reason should be many repeats were collapsed in assembling, not the problem of wtpoa-cns. One option is to add `-R` to `wtdbg2`, which will be 2X slower. Another...

Consensus step : genome size lower than pre-consensus stage

In wtdbg2 step, the assembly size was stated by uncorrected seqeunce length, usually will become smaller after wtpoa-cns.

Consensus step : genome size lower than pre-consensus stage

If the genome size was correctly estimated and the genome was complicated, maybe there is no way. However, please find out some contigs that differed much in size between before...

Consensus step : genome size lower than pre-consensus stage

wtdbg2 tends to collapse similar regions. For your case, please try increase '-s 0.5' to '-s 0.8' or others.

Enquiry about the parameter "-m"

1, min_match: yes, you are right. 2, `--dp-penalty-gap` and `--dp-penalty-var` are not calculated in the same way with SW, they are just used to stop the tracing in sparse matrix...

Long runtime - choosing correct parameters

Have a look at the `Quatiles` log message, like this ```txt Quatiles: 10% 20% 30% 40% 50% 60% 70% 80% 90% 95% 16 21 29 72 268 972 3329 11697...

Long runtime - choosing correct parameters

```txt [Fri Feb 5 16:57:44 2021] - high frequency kmer depth is set to 16534 ``` Try to set `-K 2000` to speed it up.