khmer
khmer copied to clipboard
Khmer (specifically `reverse_complement`) only intended for upper-case sequences?
Many reference FASTA/Q files have upper/lower case characters due to masking of low complexity sequences (like here or here).
I've noticed that:
>>>import khmer
>>>khmer.reverse_complement("c")
'c'
yet
>>>import khmer
>>>khmer.reverse_complement("C")
'G'
Are the functionalities of khmer only intended for upper-case characters?
Note khmer.__version__ is 2.1.1
hi @dkoslicki for 3.x, we significantly redid this - suggest installing latest conda package or latest master. See https://github.com/dib-lab/khmer/pull/1590 - it doesn't lay out the changes in detail, unfortunately, but should address the uppercase/lowercase issue.
>>> import khmer
>>> khmer.__version__
'3.0.0a3'
>>> import khmer
>>> khmer.reverse_complement("c")
'G'
>>> khmer.reverse_complement("C")
'G'
>>>