JunZhan2000
JunZhan2000
I am trying to train a Chinese model of a conformer. When I train with 4 2080ti, there will be an error in the middle of the epoch: CUDA_ERROR_ILLEGAL_ADDRESS: an...
Hello, thank you very much for your code and videos! I'm using this code repository to train on the flowers dataset with a batch size of 32 for 200 epochs,...
Traceback (most recent call last): | 0/1 [00:00
when run inference code, loading the model, but torch.load() returned a dict get error: > File "vall-e/vall_e/__main__.py", line 30, in main ar = torch.load(args.ar_ckpt).to(args.device) AttributeError: 'dict' object has no attribute...
I'm not familiar with the music domain, are there any open-source datasets available for use?
Great job! When will you open source MosIT data?
Hello, thank you very much for your work. Can you give a code for multi-GPU or multi-node training?
> NotImplementedError: No operator found for `memory_efficient_attention_forward` with inputs: query : shape=(8, 1024, 1, 64) (torch.float32) key : shape=(8, 1024, 1, 64) (torch.float32) value : shape=(8, 1024, 1, 64) (torch.float32)...
### 描述问题 InternEvo能否load预训练llama2的参数,再继续预训练,用hf的格式还是原始的格式