Fangjun Kuang issues

Results 152 issues of


Fangjun Kuang

Print informative error messages if upload fails.

It wastes me more than 3 hours to figure out the real reason why the upload fails. Hope this pull-request can avoid wasting other's time. The code is from https://github.com/Anaconda-Platform/anaconda-client/issues/501#issuecomment-470742898...

Add acknowledgement to Mobvoi

The TCN model and the max-pooling loss are basically the same as the one used inside Mobvoi. Also, one of the contributors did his internship at Mobvoi. I would recommend...

[镜像请求] Next-gen Kaldi (k2)

### 先决条件 (Prerequisites) - [X] 我已确认这个镜像源从未在 [其他 issues](https://github.com/tuna/issues) 中讨论过。 I am sure that this repo has NEVER been discussed in [other issues](https://github.com/tuna/issues). - [X] 我已确认这个镜像源没有我选择的镜像站上线。 I am sure that this...

Mirror Request

Add endpoint detection for streaming greedy search

[Help is wanted] Support mace/onnx/tflite

`libtorch` is easy to use. However, the size of its shared libraries is large, see below. ![80a](https://user-images.githubusercontent.com/5284924/177512166-780505e6-21b5-4ece-96cb-6e2553ca4ae7.png) It is nice to support the following frameworks that are more lightweight compared...

help-is-wanted

WIP: Begin to add C++ API for streaming ASR

Will first make it work for the streaming model [ConvEmformer2](https://github.com/k2-fsa/icefall/tree/master/egs/librispeech/ASR/conv_emformer_transducer_stateless2)

WIP: Add timestamp

Here are some initial results. For the following test wav (Note: Its name should be `1089-134686-0002.wav`, not `1089-134686-0001.wav`) https://huggingface.co/csukuangfj/icefall-asr-librispeech-pruned-transducer-stateless3-2022-05-13/blob/main/test_wavs/1089-134686-0001.wav The ground truth word timestamp from https://github.com/CorentinJ/librispeech-alignments is ``` ",AFTER,EARLY,NIGHTFALL,THE,YELLOW,LAMPS,WOULD,LIGHT,UP,HERE,AND,THERE,THE,SQUALID,QUARTER,OF,THE,BROTHELS," "0.360,0.730,1.040,1.770,1.900,2.160,2.590,2.760,3.070,3.270,3.520,3.660,4.090,4.210,4.780,5.310,5.420,5.500,6.160,6.625"...

FYI: 新一代 Kaldi QQ 交流群

群号码为 744602236 若使用中有什么问题，或者有什么建议，都可以在群里面或者 github 上面提出来. See also https://github.com/k2-fsa/icefall/issues/498

WIP: Begin to add doc for developers.

Try sherpa from within your browser without installing anything

FYI: We have created a huggingface space https://huggingface.co/spaces/k2-fsa/automatic-speech-recognition that uses pre-trained models from [icefall](https://github.com/k2-fsa/icefall) together with sherpa for automatic speech recognition. You can try it using your browser, either uploading...