Kun Zhou

Results 3 issues of Kun Zhou

Thanks for sharing the Audiocap dataset. However, I note that the audiocap provided only contains training files. Can you also share dev and test files? Thanks a lot!

enhancement

Thanks for your implementation! I am curious if you could share data preprocessing scripts to get "audio_ann_sum.txt" and "audio_sum.hdf5" on mixed of Libritts and aishell?

I reproduced TTA results accordingly, however, they are far from the performance that reported in the original AUDIT paper. I wonder if there is any pre-trained models or demos from...