tidysq
tidysq copied to clipboard
tidy processing of biological sequences in R
I would like to concatenate sequences and add a determined number of N between sequences Let's say I have these sequences >seq1 AACC >seq2 CCAA >seq3 CCCT How I could...
I'm trying to bite() based on start (X7) and end(X8) columns but I cannot figure it out. data:image/s3,"s3://crabby-images/f5c18/f5c187ec87d72c3c2ec970e7a9d2ef5ba00046e0" alt="image"
The codon table used by translate() has 4 codons swapped, where the wrong amino acid is called (specifically TTN codons). ```{r} test_tib % mutate(tidysq_codons = sq(codons), tidysq_aa = translate(tidysq_codons)) test_tib...
For functions like `read_fasta()`. Currently the error thrown for missing file is non-informative.
Currently the order of matching is dna_bsc -> rna_bsc -> ami_bsc -> dna_ext -> rna_ext -> ami_ext -> unt. However, `[A, C, T, G, N]` is matched by ami_bsc alphabet....
`sqibble` is non formalized idea, by formalization of which the package may benefit in numerous ways. We can define `sqibble` as a `tibble` containing at least one column of type...
A simple AMP case study. 1) dataset: around 500 AMP sequences along with additional boolean columns Anti-Gram-, Anti-Gram+, Antifungal, Anticancer etc. 2) reading .fasta file and .csv file with labels...
Maybe it would be valuable to redefine `LenSq` as `unsigned long long int` and add addtional type of `UnsignedLenSq`?