Benjamin Buchfink comments

Results 445 comments of


                                            Benjamin Buchfink

"Tabulator character in sequence title" warning

It doesn't affect the function, you just need to be aware that they will be escaped as `\t` in the output.

Optimising the speed of DIAMOND based on the size of input queries (and other)

> In your opinion, would such an approach lead to noticeable speed improvements. Depends on the size of your query files, I suggest testing it. > Is there a description...

Diamond blastp two sequences with themselves, there is 0 pairwise alignments reported

This is due to the repeat masking, you need to use `--masking 0`.

Diamond Clustering - Failed to allocate sufficient memory. Please refer to the manual for instructions on memory usage.

There are some issues causing increased memory use that will be fixed in the next release. For now one thing you could try is using `--bin 256` (or possibly higher).

Diamond Clustering - Failed to allocate sufficient memory. Please refer to the manual for instructions on memory usage.

Another option would be `--cluster-steps faster_lin fast_lin`, that should be sufficient for 80% id cutoff.

Diamond Clustering - Failed to allocate sufficient memory. Please refer to the manual for instructions on memory usage.

Please try again with the latest release, memory use has been reduced.

Using blast can find results, but diamond has no results

DIAMOND is not configured to find very short hits by default. I shared some tips how to do this here: https://github.com/bbuchfink/diamond/issues/832

Documentation for realign is not updated

At the moment, you need to specify `qseqid`, not `cseqid`, on the command line. It is inconsistent and should be changed in a future version.

segmentation fault

Please provide the command line you used to run diamond and your version.

segmentation fault

These are not the files you provided. I ran diamond blastx of your `ncor_cdhit.fasta` against your `ncor_cdhit.fasta.transdecoder.pep` and it completed correctly.