TensorFlowASR issues

Question: Predict network in RNNT model

1

Hi, I have a question on the predict network in the code. For example: Input training label is such as below. Now we got 261 vocabularies . (just for example,...

wenjingyang

question

Training ASR models in multiple languages

9

`TensorFlowASR` makes it quite easy to train and deploy almost SOTA ASR models, but it provides a pretrained model only in English. On the other hand, FAIR has recently published...

monatis

help wanted

State of the Art for conformer and beam decoding

17

Hi, Thanks for developing this great tool kit. I had 2 questions about the conformer model :- 1. For the conformer model in ```examples/conformer``` , I think almost all the...

abhinavg4

discussion

A question on the language model

Hi, I also have a problem related to this ticket(https://github.com/TensorSpeech/TensorFlowASR/issues/44). I generated a CTC models with deepspeech2. I used a language model and voc download from http://www.openslr.org/11/. (3-gram.arpa is converted...

wenjingyang

Support multi-GPU training

1

It could be fun to support training over multi-GPU. The goal is not to increase the batch size but to increase the model size, so the model would have to...

gandroz

question

Contextual Biasing | Adding hints at runtime for Streaming transducers

3

Hi, It is really great work. Thank you very much for the streaming transducers. Is it possible to add hints at runtime (In streaming transducers)(Section: 4) (Say I have some...

tumusudheer

enhancement

token level timestep

6

Is it possible to output token level timestep？ eg： hello 100-600 world 712-900 .......

Mddct

enhancement

best practices for confidence extraction

3

What would be the best way to show/extract the confidence of the recognition (greedy or beam search) during inference. The sequence/word confidence would be interesting to see.

danielkope

enhancement

SimCLR loss for self-supervised speech representation learning

2

Hi, Self-supervised pretraining for speech representation is a promising technique for developing ASR in resource-constraint languages with little transcribed data, and SimCLR is applied with success for this purpose in...

monatis

enhancement

RNNT loss

I have tried to use warprnnt-tensorflow to compute the loss, however, it turns out warprnnt-tensorflow binding does not support our GPU (A30). So I think I need to focus on...

liuyibox

TensorFlowASR
TensorFlowASR copied to clipboard

Metadata

Question: Predict network in RNNT model

Training ASR models in multiple languages

State of the Art for conformer and beam decoding

A question on the language model

Support multi-GPU training

Contextual Biasing | Adding hints at runtime for Streaming transducers

token level timestep

best practices for confidence extraction

SimCLR loss for self-supervised speech representation learning

RNNT loss

← Metadata

Owner

Metadata

TensorFlowASR TensorFlowASR copied to clipboard

Metadata

← Metadata

Owner

Metadata

TensorFlowASR
TensorFlowASR copied to clipboard