JunZhan2000

Results 11 issues of JunZhan2000

I am trying to train a Chinese model of a conformer. When I train with 4 2080ti, there will be an error in the middle of the epoch: CUDA_ERROR_ILLEGAL_ADDRESS: an...

bug
more info needed

Hello, thank you very much for your code and videos! I'm using this code repository to train on the flowers dataset with a batch size of 32 for 200 epochs,...

Traceback (most recent call last): | 0/1 [00:00

when run inference code, loading the model, but torch.load() returned a dict get error: > File "vall-e/vall_e/__main__.py", line 30, in main ar = torch.load(args.ar_ckpt).to(args.device) AttributeError: 'dict' object has no attribute...

I'm not familiar with the music domain, are there any open-source datasets available for use?

Great job! When will you open source MosIT data?

Hello, thank you very much for your work. Can you give a code for multi-GPU or multi-node training?

> NotImplementedError: No operator found for `memory_efficient_attention_forward` with inputs: query : shape=(8, 1024, 1, 64) (torch.float32) key : shape=(8, 1024, 1, 64) (torch.float32) value : shape=(8, 1024, 1, 64) (torch.float32)...

### 描述问题 InternEvo能否load预训练llama2的参数,再继续预训练,用hf的格式还是原始的格式

question