jeremy110

Results 20 comments of jeremy110

You can use the code to save the picture. ![1589448253049](https://user-images.githubusercontent.com/52516810/81917551-e9393780-9607-11ea-88cf-cf5c99f408de.jpg)

I revise the code.It works for me. ![1589447687509](https://user-images.githubusercontent.com/52516810/81916532-84311200-9606-11ea-809c-18e7636c5fd6.jpg)

I can reply your second question.When you run the speakerDiarization.py,you can change both of them in here. ![1589447948103](https://user-images.githubusercontent.com/52516810/81917137-631cf100-9607-11ea-90f7-175e0fa8da1f.jpg)

I don't have a great solution because each feature of audio isn't the same. I test 10 min news of Taiwan and then embedding_per_second=0.9 and overlap_rate=0.3 are absolutely great for...

In your first href.It's to produce weight.h5. ![1595398941926](https://user-images.githubusercontent.com/52516810/88141772-d4d87300-cc26-11ea-948b-640ce7d2b36c.jpg)

You can directly reduce your batch size and increase the epochs.

If you know that the speaker number correspond to your speaker name,you can add your speaker name here in advance.

Let me make sure I have it right.So you want to revise the predict function and it can return your speaker name?

Maybe you can add your speaker name which cluster_id correspond to in BeamState in uisrnn.py and use it when prediction function return the results.

you must assure the shape of feats is right.For example:(256,512),the 512 is your observation_dim