TTSentdex9000
TTSentdex9000 copied to clipboard
Thoughts on post-processing
I think it would be helpful if you could also provide some original audio samples without the robot sound effect (ideally, reading the same text) so that we can have a good estimation of the frequency range those effects are in. Then we can filter out those ranges in the frequency domain by using a fourier transform package like numpy.fft.fft()