Thanks for your brilliant work. Could you provide more training details? In your paper you mentioned that to fully train a model, you first train from scratch then use data...
I have a custom dataset which contains lots of images. During the training the data loading seems to be the bottleneck. But the dataset does not have the classification label,...
How can I get the txt file train_list?
Thanks for the great work. Could you also provide the model and demo codes for person detection task?
Is there a way to get the recorded audio instead of only recognized text?