Masao Someki

Results 58 comments of Masao Someki

@rajeevbaalwan The node `/encoders/encoders.0/self_attn/Add` is the masking process. I think increasing the `max_seq_len` will fix this issue! ```python tag_name = 'your model' m = ASRModelExport() # Add the following export...

In the masking process, your input audio seems to have a 171 frame length, while the mask has a 127 frame length. This difference causes this issue. The frame length...

@rajeevbaalwan Yes, it does not work with batched input. If you want to run batched inference, then you need to: 1. Add the dynamic axes for batch dimension in the...

@rajeevbaalwan Sorry for the inconvenience, but currently we have no plan to support batch inference. We have investigated the speed up with batched inference in our paper by tring to...

@rajeevbaalwan I assume that the user of this library is more like an individual who wants to execute the ESPnet model on a low-resource constraint, such as Raspi. If the...

@rajeevbaalwan In the following line, ESPnet-onnx has export function for language models! https://github.com/espnet/espnet_onnx/blob/d617487a12e186f5240a74121f88af328fef2f02/espnet_onnx/export/asr/export_asr.py#L113-L126

Hi @neso613, please install `espnet_model_zoo`. You can install it via `pip install espnet_model_zoo`

Thank you for reporting your issue @shakhbozabdigapporov. I would like to confirm the following: 1. What is your ESPnet and ESPnet-ONNX version? Does the same error occur with the latest...

@shakhbozabdigapporov After some analysis, I came to think that there might be bugs around the decoder part of the Tacotron2 implementation. If it is okay, would you share the `tts_conf`?...

Hello @shakhbozabdigapporov, would you please review the following points? I've thoroughly investigated this issue but couldn't identify any bugs. - Could you attempt to reproduce this issue using the pretrained...