DSLP issues

OOM problem with the model nat_ctc_sd_ss

1

I trained the model "nat_ctc_sd_ss" with the command in the README.md on Tesla V100 GPU, but i got **Out of memory** problem. Is there anything to be changed? My train...

YudiZh

ninja: build stopped: subcommand failed. Traceback (most recent call last): File "/home/nihao/anaconda3/envs/DSLP/lib/python3.7/site-packages/torch/utils/cpp_extension.py", line 1673, in _run_ninja_build env=env) File "/home/nihao/anaconda3/envs/DSLP/lib/python3.7/subprocess.py", line 512, in run output=stdout, stderr=stderr) subprocess.CalledProcessError: Command '['ninja', '-v']' returned...

thunder123321

No glat_sd arch

2

Hi Chengyang, thanks for your great code! I'm trying to reproduce the GLAT+DSLP model, I checked your given training scripts, but I found there is no "--arch glat_sd" registered model...

bbo0924

The shape of probs_seq does not match with the shape of the vocabulary Segmentation fault (core dumped)

6

[/home/nihao/nihao-users2/yuhao/DSLP/env/ctcdecode/ctcdecode/src/ctc_beam_search_decoder.cpp:32] FATAL: "(probs_seq[i].size()) == (vocabulary.size())" check failed. The shape of probs_seq does not match with the shape of the vocabulary [/home/nihao/nihao-users2/yuhao/DSLP/env/ctcdecode/ctcdecode/src/ctc_beam_search_decoder.cpp:32] FATAL: "(probs_seq[i].size()) == (vocabulary.size())" check failed. The shape of...

thunder123321

hyperparameters on iwslt dataset

I use follow hyperparameters run on iwslt14, but it seems performs bad. result show only bleu 26 on iwslt14, Does anyone know the appropriate hyperparameters for the iwslt dataset? thanks...

hhh07

The problem of training and generation scripts of CMLM

3

Hi, thank you for releasing the code！ I have a question about the given bash scripts of training and inference. The training scripts of the CMLM+DSLP `python3 train.py data-bin/wmt14.en-de_kd --source-lang...

JasmineChen123

imputer error while running train.py for GLAT with DSLP

The train command i used: python3 train.py data-bin/wmt14.en-de_kd --source-lang en --target-lang de --save-dir checkpoints --eval-tokenized-bleu \ --keep-interval-updates 5 --save-interval-updates 500 --validate-interval-updates 500 --maximize-best-checkpoint-metric \ --eval-bleu-remove-bpe --eval-bleu-print-samples --best-checkpoint-metric bleu --log-format simple...

SylasreeKS

about inference

2

hi，thankyou for release code！ I have a question about the different pipline between train and inference 。the paper says that in inference stage the predict out of every decoder layer...

piaohe20221128

DSLP
DSLP copied to clipboard

Metadata

KD

OOM problem with the model nat_ctc_sd_ss

ctcdecode install error

No glat_sd arch

The shape of probs_seq does not match with the shape of the vocabulary Segmentation fault (core dumped)

hyperparameters on iwslt dataset

The problem of training and generation scripts of CMLM

imputer error while running train.py for GLAT with DSLP

about inference

← Metadata

Owner

Metadata

DSLP DSLP copied to clipboard

Metadata

← Metadata

Owner

Metadata

DSLP
DSLP copied to clipboard