conformer icon indicating copy to clipboard operation
conformer copied to clipboard

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Results 21 conformer issues
Sort by recently updated
recently updated
newest added

In conformer/convolution.py, line 183, the code ``` output_lengths = input_lengths >> 2 output_lengths -= 1 ``` when the result of input_lengths >>2 is xx.75, the torchaudio.transforms.RNNTLoss will raise "input length...

In the Conformer original paper, the number of parameters are However, with the implementation in this repo, the number of parameters are slightly different ``` Conformer small: 10.16 M Conformer...

Transducer inference할 때 audio encoder의 output을 time step마다 한개씩 넣는 이유가 있을까요? Real time을 대비해서 그렇게 inference를 하는 것 같은데 non-real time일때는 어떻게 inference가 되는건지 제가 이해하고 있는게 맞는건지 잘...

I'm not sure, but I think these are the right replacement kwargs: ```py model = Conformer(dim=dim, dim_head=32, depth=3).to(device) ```

I'm confused by the Class Conv2dSubampling in convolution.py.What does the second return output_lengths mean?

I want to test the test conformer in real time as a feature of my project. If anyone has any update kindly share the resources. Thanks

## My enviroment: - Python:3.10.10 - pip:23.0.1 - Kernel:6.2.12-arch1-1 ## problem When I tried to install conformer locally, I encountered error shown below. I'm not so familiar with Python and...

I am using the following training function and librispeech dataset. Every time the output of the model while training become Nan as a result the loss is also nan. What...

thanks alot for the great work .can i use the mult-scale features aggregation conformer for acoustic scene classification ?