AmarnathVoicing
AmarnathVoicing
I have an issue with memory allocation/deallocation with this module in the triton server, ` name: "decoder" backend: "onnxruntime" max_batch_size: 0 input [ { name: "ASR", data_type: TYPE_FP32, dims: [...
Thank you! Will try this.
Hello @jeremy110, I am trying to train 0.6B model(en) with 900hrs of training data (eng+hindi). Is it recommended to train encoder as well or decoder is enough?
Thank you. I will use the third approach as i am using code switching data and can use aggregate tokenizer. The 3rd point is really helpful.
@UmerrAhsan Did you solve for memory allocation/deallocation in this? If so, can you share the method? I am getting high latency of more than 1s because of this.