GIGA icon indicating copy to clipboard operation
GIGA copied to clipboard

The time to generate the training set

Open TAO-TAO-TAO-TAO-TAO opened this issue 2 years ago • 1 comments

Hello, may I ask some questions about datasets? Does the dataset used to train the model contain one million positive samples and one million negative samples? How much time does it take to generate these datasets in total?

TAO-TAO-TAO-TAO-TAO avatar Dec 13 '23 07:12 TAO-TAO-TAO-TAO-TAO

Hi, I honestly do not remember the exact time. We run multiprocessing on multiple CPU machines to generate the data. In my rough memory, it takes less than 3 days.

Steve-Tod avatar Feb 28 '24 16:02 Steve-Tod