speech-datasets
speech-datasets copied to clipboard
Transcript issues for 4363614 in earnings-21
https://github.com/revdotcom/speech-datasets/blob/1852d8e8f79745415e17ed294f1de0f884513465/earnings21/transcripts/nlp_references/4363614.nlp#L2-L44
It seems the transcript there has some issue, as quoted. E.g. <unk> for company's name, <inaudible> for person's name.
This can be checked against here