Wei Shen

Results 235 comments of Wei Shen

Thanks for the feedback. If the file contains only one sequence, it would be easy to extract: $ seqkit head -n 1 ../tests/hairpin.fa \ | seqkit fx2tab -ni -g |...

Added. I think It's useful for checking bacterial assemblies. ``` $ seqkit stats -a ../tests/hairpin.fa file format type num_seqs sum_len min_len avg_len max_len Q1 Q2 Q3 sum_gap N50 Q20(%) Q30(%)...

Available in v2.3.0 : https://github.com/shenwei356/seqkit/releases/tag/v2.3.0

> I like your tool very much, when I wanted to use it in conda I found that it does not exist for all operating systems, even though you provide...

Great, I've never tried to build a new Conda channel. I searched across recipes of the Bioconda but found no direct solutions. Let me ask for help. I'm not sure...

> ... currently Bioconda only supports Linux and OSX. ... But someone using ubuntu in windows could also install tools from Bioconda. Does this help? https://github.com/kapsakcj/win10-linux-conda-how-to/blob/master/README.md

Sounds useful. But I'm not familiar with sub-processes handling in golang either. Someone suggested a similar feature in this or another repo, I think it's worth trying, when I have...

To my knowledge, you want to split the input file according to the sequence identities, but you also mention "non-unique headers" where the option`-i/--by-id` does not have any issue. I'd...

Except for the `XXX` which is the base name of the input file with file extension removed, there are 3 possible formats for the 3 kinds of jobs. ``` XXX.part_NNN.eee.EEE...

Added. ``` -i, --by-id split squences according to sequence ID --by-id-prefix string file prefix for --by-id -p, --by-part int split sequences into N parts --by-part-prefix string file prefix for --by-part...