Christian Schudoma comments

Results 6 comments of


                                            Christian Schudoma

there must be an error: xx deleted from xx that now is empty, but not assigned to a cluster

Hi, I got the same issue with nucleotide sequences (version 15.6f452). `cluster --threads 8 --split-memory-limit 128G --min-seq-id 0.95 -c 0.90 --cov-mode 0`

Different results when annotating a large protein set at once vs splitting the input into multiple files

Hi @Cantalapiedra, Yes, e.g. ``` GMGC10.033_471_658.UNKNOWN full: 1120971.AUCA01000049_gene802 1.09e-07 49.7 1 33 3 35 63.6 100.0 40.2 batch: 1163671.JAGI01000002_gene3538 2.18e-07 50.8 1 33 1 33 69.7 100.0 18.1 ``` In...

Different results when annotating a large protein set at once vs splitting the input into multiple files

Hi @Cantalapiedra, Thanks for checking this. Is there any plan to address this issue? If the diamond results are not reliable in terms of reproducibility this directly translates to the...

Different results when annotating a large protein set at once vs splitting the input into multiple files

Hi @Cantalapiedra , @fullama has pointed out that they had a similar issue when developing `gunc` (https://github.com/grp-bork/gunc) and it is likely to do with the `--algo` option, which is by...

Different results when annotating a large protein set at once vs splitting the input into multiple files

Hi @Cantalapiedra , > If I understand correctly, the difference in results between your runs is due to the size of the data set making diamond choose a different --algo...

error when opening valid .fastq.bz2 (Ran out of data in the middle of a fastq entry. Your file is probably truncated)

Is there a patch available for this (i.e. "process only the first file of a concatenated bzip2")? In my case I can see in the log that `fastqc` stops after...