DreamSound questions about the hyper parameter of the DB setting

questions about the hyper parameter of the DB setting

Open Everglow-ZJU opened this issue 1 year ago • 0 comments

Dear authors, You claimed in the paper that "using a single NVIDIA RTX-3090 GPU with a training batch size of 4, employing learning rates of 4 × 10−6 for DB", is that correct? I found it slow on my V100GPU under this hyper parameter setting(two and an half hour for training a single concept),my gradient accumulation steps is set to 1, and max_train_steps is 1500, I would appreciate it if you could help me

Jan 04 '24 14:01 Everglow-ZJU

DreamSound DreamSound copied to clipboard

questions about the hyper parameter of the DB setting

DreamSound
DreamSound copied to clipboard