tidysq icon indicating copy to clipboard operation
tidysq copied to clipboard

tidy processing of biological sequences in R

Results 51 tidysq issues
Sort by recently updated
recently updated
newest added

I would like to concatenate sequences and add a determined number of N between sequences Let's say I have these sequences >seq1 AACC >seq2 CCAA >seq3 CCCT How I could...

enhancement

I'm trying to bite() based on start (X7) and end(X8) columns but I cannot figure it out. ![image](https://user-images.githubusercontent.com/18449327/229152164-48635d0e-c0dd-40a3-9da5-907a286ad194.png)

discussion

The codon table used by translate() has 4 codons swapped, where the wrong amino acid is called (specifically TTN codons). ```{r} test_tib % mutate(tidysq_codons = sq(codons), tidysq_aa = translate(tidysq_codons)) test_tib...

bug

For functions like `read_fasta()`. Currently the error thrown for missing file is non-informative.

enhancement

Currently the order of matching is dna_bsc -> rna_bsc -> ami_bsc -> dna_ext -> rna_ext -> ami_ext -> unt. However, `[A, C, T, G, N]` is matched by ami_bsc alphabet....

bug
refactor

`sqibble` is non formalized idea, by formalization of which the package may benefit in numerous ways. We can define `sqibble` as a `tibble` containing at least one column of type...

enhancement
discussion
refactor

A simple AMP case study. 1) dataset: around 500 AMP sequences along with additional boolean columns Anti-Gram-, Anti-Gram+, Antifungal, Anticancer etc. 2) reading .fasta file and .csv file with labels...

Maybe it would be valuable to redefine `LenSq` as `unsigned long long int` and add addtional type of `UnsignedLenSq`?

C++ adventures