Markus Schlegel

Results 17 comments of Markus Schlegel

Hi! There is an implementation of parallel processing, the documentation [can be found here](https://docs.rs/seq_io/latest/seq_io/parallel/index.html). Although there is still the description "experiments with parallel processing" in these docs, I think that...

How large are single records in your case? If you have many (possibly thousands) of sequence records per 10G FASTA file, the functions in `seq_io::parallel` should work well in distributing...

Small update: the R script in embed_krona now directly generates the XML, which is imported with ktImportXML, so the Perl script is not necessary anymore. The remaining question would be,...

`seq_io` automatically grows the internal buffer when sequences are longer, this should work out of the box. Buffer growth [can also be configured](https://docs.rs/seq_io/0.3.1/seq_io/policy/index.html), e.g. an upper limit can be set...

Thanks for the interest in this feature, unfortunately I'm quite busy right now with other work. I did a lot more refactoring since the first 0.4 alpha release, trying to...

@nh13 I'm sorry, I'm still nowhere near a second alpha release due to lots of other work, so a final 0.4.0 release will definitely take some time. Also, there will...

@nh13 I had a look at the code again, and especially for the FASTA module it's a bit more complicated because the `RecordSet`s are being reused. Also, it has to...

Thanks for your interest in this library and the pull request! To be honest, I avoided this kind of functionality up to now, because there is several possibilities to implement...

I rewrote the whole library and added support for "FASTX" with two different approaches. GZIP recognition should be easy to implement now (for files, not stdin). I'm not sure yet...

Thanks for your interest in contributing, I would be happy about such a PR. The header (ID and description) are already available as strings through the `Record::id()`, `desc()` and `id_desc()`...