vocal-remover icon indicating copy to clipboard operation
vocal-remover copied to clipboard

Num Samples

Open ripnonfegames opened this issue 2 years ago • 8 comments

Whenever I run a training perimeter, I keep getting an error even though I'm following the exact instructions from the readme: python train.py --dataset path/to/dataset --reduction_rate 0.5 --mixup_rate 0.5 --gpu 0 path/to/dataset in my case is: C:\Users\hinds\Downloads\Compressed\vocal-remover\practice\, Yet I keep getting this specific error: num_samples should be a positive integer value, but got num_samples=0

Heres my Log: train_20220319195133.log

ripnonfegames avatar Mar 20 '22 00:03 ripnonfegames

If the number of instruments is different from that of mixtures, the error occurs. Please check your dataset directory and make sure that the numbers are the same.

tsurumeso avatar Mar 20 '22 16:03 tsurumeso

Number of instruments as in?

ripnonfegames avatar Mar 20 '22 16:03 ripnonfegames

dataset

tsurumeso avatar Mar 31 '22 13:03 tsurumeso

@tsurumeso I have the same error, while the number of instruments and mixes files matches

[](https://i.imgur.com/JgwnJaK.png)

Nekitt1 avatar Apr 09 '22 22:04 Nekitt1

I get the same error. I have the same number of each, and the output even shows that it paired them up. But it still says that the number of samples is 0.

AronYstad avatar Apr 30 '22 16:04 AronYstad

If you use the default parameters, the size of the dataset should be at least 5 pairs.

tsurumeso avatar May 01 '22 12:05 tsurumeso

That fixed the problem, but now it seems like it's only using one pair for the training. At least in the output, it only lists one pair. Will it switch to another one after a while or do I need to do something? I had to change the batch size to not run out of memory, if that is part of the problem.

AronYstad avatar May 01 '22 18:05 AronYstad

At least in the output, it only lists one pair. Will it switch to another one after a while or do I need to do something?

The pair is a validation dataset. With default parameters, 20% of the dataset is used for validation, and 80% for training. You don't need to do anything.

tsurumeso avatar May 05 '22 14:05 tsurumeso