Kaizhi Qian

Results 196 comments of Kaizhi Qian
trafficstars

Can you print the shape of it?

Just let me know the shape.

I mean the shape of the 3rd section

"I can't understand what is the **third section** and how to generate it? What is array that's highlight in picture?" This was your original question. What is the shape of...

There are definitely more than 3 elements in your highlighted area

These are the spectrograms

Again, the shape please. Also, where did you get that metadata?

Those are the speaker embeddings. In that case, you already had the code to generate this. If not, you can write your own very easily. I don't keep the code,...

You probably need to fine-tune your bottleneck dimensions.

There's detailed information in the paper on how to tune the bottleneck.