Ilya Shlyakhter

Results 66 issues of Ilya Shlyakhter

the -hm option of filter isn't in the pdf doc

Currently, a kmer that occurs twice in the same read gets a count of 2, same as a kmer that occurs once in two different reads. But the latter kmer...

Sometimes a kmer might appear in different reads, so it looks real, but all these reads are duplicates. It would be good if there was an option to drop kmers...

When a kmer in a read includes IUPAC ambiguity codes, add all possible concretizations of that kmer.

feature request

The output file format for filtering of reads is said to default to "same as input". But if the input is myfile.fasta.gz , the output is uncompressed even if named...

feature request

When filtering reads, it would help to have "just the passed read names" as one output format, for use with https://broadinstitute.github.io/picard/command-line-overview.html#FilterSamReads

enhancement

For using with makefile/pipeline tools like snakemake, it would help to return a non-zero error code in case of failure.

E.g. changing kmc1_db_writer.h does not cause kmc_tools binary to be rebuilt.

@marekkokot The filter command does not seem to handle correctly fasta files where the sequence is split across multiple lines. Also, if the file is too large, it fails with...

Add option to ignore untrustworthy kmers in reads, with "untrustworthy" defined as "having more than this many bases below a given quality".