xyliu

Results 3 issues of xyliu

Hello, I found you add a placeholder token to the tokenizer using the follow code: tokenizer._add_tokens([""], special_tokens=True) tokenizer.placeholder_token = "" And the placeholder token is used in the follow code:...

When I train the model, if some modules (parameters) in the model are not involved in the current forward propagation calculation, then the parameters in these modules are not gradient...

stale