Fangjun Kuang comments

Results 683 comments of


Fangjun Kuang

Strange behavior using PyTorch DDP

> Thanks for the heads up about the torchaudio loss! @snakers4 You may find https://github.com/danpovey/fast_rnnt useful.

Add acknowledgement to Mobvoi

I did not mean that you are using the name `TCN` and `max-pooling` loss. What I mean is that the network architecture and parameters, i.e., the values of - number...

segfault with large tensor for pytorch

Can you try https://github.com/csukuangfj/optimized_transducer ? It uses less memory and may fix your issue.

There is a problem about training a RNN-T model?

There is also an implementation at https://github.com/csukuangfj/optimized_transducer that uses less GPU memory.

the grad of cpu ,cuda are not same

Please have a try with https://github.com/csukuangfj/optimized_transducer It has a consistent gradient for CPU and CUDA.

[Feature request] C++ api support maybe with c++20 coroutine enabled

@npuichigo We have an ongoing PR #99 , almost finished, about C++ APIs for non-streaming ASR. The streaming part is still under development. You can find an example usage by...

[Feature request] C++ api support maybe with c++20 coroutine enabled

@npuichigo Do you have an interest in using https://github.com/chriskohlhoff/asio with c++20 coroutine? It may take years to get the above proposal accepted by the c++ standard.

Do we need endpointing?

> It appears that the server gets stuck on long files such as 10 minutes of Audio. Are you using the offline server or the streaming server? Are you using...

Do we need endpointing?

I think the streaming version should be able to handle very long utterances. I will test it locally to see whether it is reproducible.

Do we need endpointing?

> It appears that the streaming server expects to know the length of audio upfront. What do you mean by this? What the server needs to know is the number...