Yuecong Min comments

Results 55 comments of


                                            Yuecong Min

Gloss Segment Boundary Assignment Algorithm (GSBA)

Thank you for your attention. We aim to maintain simplicity in the released code, and this specific section primarily enhances boundary accuracy. You can implement GSBA by extending the pseudo...

Torch not compiled with CUDA enabled

@ethio-artifical You just need to comment `model.cuda()` line in [main.py](https://github.com/ycmin95/VAC_CSLR/blob/main/main.py#L118), and modify the pre-trained load process in [main.py](https://github.com/ycmin95/VAC_CSLR/blob/main/main.py#L122) to `state_dict = torch.load(weight_path, map_location=torch.device('cpu'))`. The command to evaluate the pre-trained model...

unable to run successfully when I run main.py

You can check whether you successfully install ctcdecode.

unable to successfully install CTC

As mentioned in README, you can also try the pytorch version of CTC.

unable to successfully install CTC

You can also try [decoder provided in tensorflow](https://www.tensorflow.org/api_docs/python/tf/nn/ctc_beam_search_decoder), which is also adopted by [recent work](https://github.com/FangyunWei/SLRT/blob/67ddead91f66a2757378a43a59294b9e5e5d6ba1/TwoStreamNetwork/modelling/recognition.py#L16).

Visualize Epoch vs Loss

Hi, the loss with in each epoch is recorded in the [loss_value](https://github.com/ycmin95/VAC_CSLR/blob/main/seq_scripts.py#L32) during training, you can maintain a global recorder to plot epoch vs loss.

Does your PHOENIX-2014-T dataset have /1/ folder?

The released code is based on Phoenix14, you can modify the data format according to your requirements.

Visualizing the predicted alignments

Hi, I'd like to clarify that we do not currently utilize SGML files to visualize the alignment. However, you may refer to the discussion about the alignment in [this issue](https://github.com/ycmin95/VAC_CSLR/issues/4)....

ctcdecode

Hi, I have not try this in windows system, perhaps you can find some useful information in [relevant issues here](https://github.com/parlance/ctcdecode/issues?q=windows).

unable to run these repository on google colab

Hi, all experiments are conducted on local severs and I am not familiar with google colab. You can find some tutorials about google colab first, and then try to modify...