picheny-nyu
picheny-nyu
Using the output to identify sections of speech in parent-toddler conversations to transcribe as input for unsupervised speech recognition fine-tuning. Figure better to miss questionable segments than train on false...
Thanks. I do need diarization, though - I want to process the adult and toddler speech separately. Would you suggest I just use a downleveled version of the diarization pipeline...
If I understand this correctly (and I may not) Diarization pipeline 3.0 seems to use the WeChat embeddings; older versions of the pipeline seem to use the Speechbrain version. I...
I am using the methodology described in https://github.com/DanBerrebbi/AISHELL-4.git which I thought was based on your original work, but perhaps not?