Vladimir Korshunov

Results 3 issues of Vladimir Korshunov

Hello. Large model trained from scratch has wrong config, resulting in errors below: `RuntimeError: Error(s) in loading state_dict for GPT2LMHeadModel: Missing key(s) in state_dict: "transformer.h.36.ln_1.weight", "transformer.h.36.ln_1.bias", ... , "transformer.h.47.mlp.c_proj.weight", "transformer.h.47.mlp.c_proj.bias"....

## Descriptiom When starting WH3-Mod-Manager, the disk is used for quite a long time (~30s on nvme ssd), and the `config.json` and `config_backup_v2.2.0.json` files are overwritten several times. Each files...

I have remote jupyter notebook server. I've setup on it docker with jupyter as were in earlier versions shown, then installed latest available library from github. At first it wouldn't...