TensorFlowASR icon indicating copy to clipboard operation
TensorFlowASR copied to clipboard

Preprocess Dataset

Open atanumandal0491 opened this issue 4 years ago • 4 comments

Hi, @usimarit In 'datasets/asr_dataset.py' line 141 u called line 41 of 'augmentations/augmentation.py', which is calling self.signal_augmentations = self.parse(config.pop("signal_augment", {})) self.feature_augmentations = self.parse(config.pop("feature_augment", {})) In config file no dict of signal_augment is present, whereas feature_augment is showing None, results in outputs as None in sub-class '_augment'.

Can you guide me?

atanumandal0491 avatar May 23 '21 04:05 atanumandal0491

@atanumandal0491 Since version >1.0, I removed the predefined signal augmentations (I prefer augmentations implemented in tensorflow rather than nlpaug - to avoid the use of tf.numpy_function for TPU compatibility). If you want to use signal augmentations, you'll have to code them yourself. And the feature augmentations methods currently "predefined" are time masking and frequency masking in specaugment.

Showing the error messages is better for me to know what is happening in your case.

nglehuy avatar May 23 '21 10:05 nglehuy

Hi, I was able to solve the problem. But yet on training using RnntLoss, I am getting "inf" as training loss. Unable to find the solution.

atanumandal0491 avatar May 31 '21 08:05 atanumandal0491

@atanumandal0491 Please tell me more about what rnnt loss you are using (warp rnnt, tf rnnt)? What language you are training? What type of text decoding you are using (characters, subwords, sentence piece)? What version you are using?

nglehuy avatar Jun 01 '21 06:06 nglehuy

Hi, I did a mail to you.

atanumandal0491 avatar Jun 01 '21 08:06 atanumandal0491