DeepLearningExamples Can we fine tune a fine tuned model for quartznet architecture?

Can we fine tune a fine tuned model for quartznet architecture?

Open wiamfa opened this issue 2 years ago • 1 comments

Hello , is there any pre-trained models for the french language? or Is there any possible way to fine-tune a fine-tuned model the second time?

thanks,

Mar 10 '22 08:03 wiamfa

Hi @wiamfa

there is a pre-trained model for French from the NeMo project.

You can use it in NeMo, or follow these steps to load it in DeepLearningExamples:

Change the extension from .nemo to .tar.gz.
After unpacking you'll find two files: model_weights.ckpt and model_config.yaml.
Make a copy of configs/quartznet15x5_speedp-online-1.15_speca.yaml and replace labels with those from model_config.yaml.
Rename the keys in model_weights.ckpt:

import torch
remap = lambda k: (k.replace('encoder.encoder', 'encoder.layers')
                   .replace('decoder.decoder_layers', 'decoder.layers')
                   .replace('conv.weight', 'weight'))
ckpt = torch.load('model_weights.ckpt')
torch.save({'state_dict': {remap(k): v for k, v in ckpt.items() if 'preproc' not in k}},
           'qn_15x5_french.pt')

Voilà!

Is there any possible way to fine-tune a fine-tuned model the second time?

This should work fine too.

Mar 11 '22 11:03 alancucki

DeepLearningExamples DeepLearningExamples copied to clipboard

Can we fine tune a fine tuned model for quartznet architecture?

DeepLearningExamples
DeepLearningExamples copied to clipboard