STing icon indicating copy to clipboard operation
STing copied to clipboard

Profile file for cgMLST

Open lskatz opened this issue 2 years ago • 3 comments

Hi, related to #11, could you show me what a profile file should look like for cg/wgMLST? I downloaded the wgMLST scheme from the ChewBBACA website and I want to make a STing database out of it. I basically ran this inside the folder of locus fasta files:

ls *.fasta| head | \
  perl -MFile::Basename=basename -lane '
    BEGIN{print "[loci]";} 
    $n=basename($F[0], ".fasta"); 
    $n=~s/INNUENDO_cgMLST-//; 
    print join("\t", $n, $F[0]); 
    END{print "[profile]"; print "profile\tprofile.txt";}
  ' > config.txt

touch profile.txt

And then I get this error

[gzu2@monolith3 Salmonella_enterica.stringMLST]$ indexer -c config.txt
Loading sequences from sequences files:

N       Loci    #Seqs.  File
1       00031717        15      ./INNUENDO_cgMLST-00031717.fasta
2       00031718        35      ./INNUENDO_cgMLST-00031718.fasta
3       00031719        14      ./INNUENDO_cgMLST-00031719.fasta
4       00031720        42      ./INNUENDO_cgMLST-00031720.fasta
5       00031721        5       ./INNUENDO_cgMLST-00031721.fasta
6       00031722        30      ./INNUENDO_cgMLST-00031722.fasta
7       00031723        17      ./INNUENDO_cgMLST-00031723.fasta
8       00031724        17      ./INNUENDO_cgMLST-00031724.fasta
9       00031725        23      ./INNUENDO_cgMLST-00031725.fasta
10      00031726        11      ./INNUENDO_cgMLST-00031726.fasta

Total sequences loaded: 209

Loading the profiles file...
ERROR: At least 11 columns (a ST column + # loci in config file) are required in the profiles file but only 0 were found.

lskatz avatar Jun 27 '22 20:06 lskatz