Christian Schudoma
Christian Schudoma
Hi, I got the same issue with nucleotide sequences (version 15.6f452). `cluster --threads 8 --split-memory-limit 128G --min-seq-id 0.95 -c 0.90 --cov-mode 0`
Hi @Cantalapiedra, Yes, e.g. ``` GMGC10.033_471_658.UNKNOWN full: 1120971.AUCA01000049_gene802 1.09e-07 49.7 1 33 3 35 63.6 100.0 40.2 batch: 1163671.JAGI01000002_gene3538 2.18e-07 50.8 1 33 1 33 69.7 100.0 18.1 ``` In...
Hi @Cantalapiedra, Thanks for checking this. Is there any plan to address this issue? If the diamond results are not reliable in terms of reproducibility this directly translates to the...
Hi @Cantalapiedra , @fullama has pointed out that they had a similar issue when developing `gunc` (https://github.com/grp-bork/gunc) and it is likely to do with the `--algo` option, which is by...
Hi @Cantalapiedra , > If I understand correctly, the difference in results between your runs is due to the size of the data set making diamond choose a different --algo...
Is there a patch available for this (i.e. "process only the first file of a concatenated bzip2")? In my case I can see in the log that `fastqc` stops after...