Chuan Meng comments

Repositories
Issues
Comments

Results 5 comments of


                                            Chuan Meng

并行版本的解码函数有错误

我仔细看了一下，没错误吧，gammar_r_l是[tagset_size, tagset_size]，trainsitions也是[tagset_size, tagset_size]

position_ids with left padding

I have the same concern.

Why did you set tokenizer.pad_token_id=0 (

@lywinged Hi, do you keep the pad_token_id 0 for both training and for batch-based inference?

Loss drop in cero

What is the difference between optim="paged_adamw_8bit" and optim="paged_adamw_32bit"?

Interactive mode

I am sorry for the fact that we did not specially design an interactive fashion in our released code. If you want to modify, I suggest you could transfer the...