nguyenlm
Results
23
comments of
nguyenlm
Hi @Alexey322 I think the author used padding for doing `stft` (aka fast fourier transform) on all frames of the input audio segment. ```python spec = torch.stft(y, n_fft, hop_length=hop_size, win_length=win_size,...
@yl4579 I have the same question as @yihuitang. What is the reasonable `prod_embd`? And How to use it in training?
For anyone who is facing this issue, try to replace `is_primitive_type` with `is_primitive_type_annotation`