icefall issues

ASR SEAME Recipe

1

This pull request is for the SEAME recipe and includes the following: 1. Standard Zipformer with RNNT loss 2. Zipformer with hybrid autoregressive transducer (HAT) loss 3. Proposed Zipformer with...

AmirHussein96

Add TTS for the aishell3 Chinese dataset

csukuangfj

A question about the data preparation on AMI corpus

Hello, thank you for making such a great toolkit. I am reproducing [pruned_transducer_stateless7](https://github.com/k2-fsa/icefall/tree/c45e9fecfb89bada0233a7b6cd9626fb6633a696/egs/ami/ASR/pruned_transducer_stateless7) on AMI corpus. When I applied GSS enhancer to training data (prepare.sh: stage 3, local/prepare_ami_gss.sh: stage 4),...

hiranoyu0830

FYI: Slides for the Interspeech 2023 tutorial

4

Please see https://livejohnshopkins-my.sharepoint.com/:p:/g/personal/mwiesne2_jh_edu/EYqRDl8cIr5BsVDxi1MOW5EBUpdqh10WFkzqixPIFM63hg?e=u3lrmL

csukuangfj

Recipe for Multi-lingual LibriSpeech

Add a recipe for [MLS](https://arxiv.org/pdf/2012.03411). More descriptions will follow.

marcoyang1998

Support CTC/AED option for Zipformer recipe

This PR supports CTC/AED system for `zipformer` recipe. * CTC/AED results on LibriSpeech, trained for 50 epochs (--ctc-loss-scale=0.1, --attention-decoder-loss-scale=0.9), decoding method: sample 100-best paths from CTC lattice and rescore with...

yaozengwei

[WIP] Recipes for ICMC-ASR competition

Hi all, This PR is to support the recipe for [ICMC-ASR competition](https://icmcasr.org/). Processing the IHM and SDM data, along with baseline AEC-IVA and GSS approach are included in this recipe....

wd929

Troubles adapting SURT AMI model to my data

13

Hi, I am trying to use SURT AMI recipe to adapt pre-trained model (SURT_BASE) to my data. I have prepared dataset similarly to AMI & ISCI datasets preparations, CutSet contains...

kfmn

WIP: Add doc about FST-based CTC forced alignment.

2

It is based on [CTC FORCED ALIGNMENT API TUTORIAL](https://pytorch.org/audio/main/tutorials/ctc_forced_alignment_api_tutorial.html) from [torchaudio](https://github.com/pytorch/audio), but we are using an FST-based approach. I can produce identical output with [torchaudio](https://github.com/pytorch/audio) using https://github.com/k2-fsa/kaldi-decoder. I am refactoring...

csukuangfj

Errors noticed after extensively testing Zipformer model

36

I have extensively used Zipformer model (both streaming and non-streaming variant) and I have noticed the following errors. The test has been done with greedy search and as well as...

kafan1986

icefall
icefall copied to clipboard

Metadata

ASR SEAME Recipe

Add TTS for the aishell3 Chinese dataset

A question about the data preparation on AMI corpus

FYI: Slides for the Interspeech 2023 tutorial

Recipe for Multi-lingual LibriSpeech

Support CTC/AED option for Zipformer recipe

[WIP] Recipes for ICMC-ASR competition

Troubles adapting SURT AMI model to my data

WIP: Add doc about FST-based CTC forced alignment.

Errors noticed after extensively testing Zipformer model

← Metadata

Owner

Metadata

icefall icefall copied to clipboard

Metadata

← Metadata

Owner

Metadata

icefall
icefall copied to clipboard