deepspeech-cleaner icon indicating copy to clipboard operation
deepspeech-cleaner copied to clipboard

how to insert personal data (deepspeech format) ??

Open elpimous opened this issue 6 years ago • 0 comments

Hi ! First, BIG thanks for your soft !!! Very usefull.

Well, I'm french, and thanks to your soft, I was able to DL nearly 60 GO french data ! But I have my own personal data, and I'd like to feed it in the BDD, to incorporate it to the next process (lm...) My data are easy : one directory :

 axel_dev
     neo.csv
          wav_filename,wav_filesize,transcript
          ...
          record.4.wav,44204,oui d'accord
          ...
         record.4.wav
         ...
        (**46 audio files**)

My datas are similar to nicolas ones !!

Do I just have to rename my csv as nicolas one ? data.csv


I did it and just have on terminal :

   -------------------------------------------------

   <---> Language                      [French]
                                       [fr]

   -------------------------------------------------

   <---> Inserter
                     
   >---> found path                    [/media/nvidia/Data/material_deepspeech2/deepspeech-cleaner-master/languages/fr/datasets/neo/axel_dev]
   <---< recognize dataset             [nicolas]
   >---> csv found                     [1]
   >---> processing                    [7]
   >---> analyzing neo                 
   >---> inserting in db               

   -------------------------------------------------

Is it good ?? (only 7 audio files found ???)

How to see if it is correct ? Open config.db ?

Thanks a lot for your help Vincent (elpimous-robot)

elpimous avatar Feb 02 '19 16:02 elpimous