speech-datasets icon indicating copy to clipboard operation
speech-datasets copied to clipboard

Normalization files for earnings22 dataset

Open elchilinga opened this issue 2 years ago • 6 comments

Hi there,

Are there any normalization files for the earnings-22 dataset? If yes, could you please share it with me?

Thanks in advance.

elchilinga avatar Jun 16 '22 11:06 elchilinga

Hi there,

I want kindly to know, is there any update??

elchilinga avatar Jun 23 '22 06:06 elchilinga

Hi there, Sorry about the delayed response -- when we released earnings-22 we didn't have plans on releasing the normalized files. If possible, could you share what your use case is, there might be a way we could still help.

pique0822 avatar Jun 24 '22 21:06 pique0822

Hi Pique,

Thanks for the response.

I wanted to change the digits into letters, clear punctuation marks, and also change uppercase into lowercase. If you have a script, specified especially for the earnings-22 dataset, it would be good.

elchilinga avatar Jun 25 '22 14:06 elchilinga

Hi elchilinga,

Thanks for sharing what you would use the files for, we really appreciate it. From our perspective, it seems like the best path to help you would certainly be sharing the normalization files. It will take us some time however given that we're nearing the end of a quarter and starting up a new one -- we'll give you an update once we have those files ready.

pique0822 avatar Jun 27 '22 15:06 pique0822

Hi Pique,

Thank you, I will look forward to your response.

elchilinga avatar Jun 29 '22 09:06 elchilinga

Hi there, Are there any updates on this?

AniChilingaryan avatar Dec 23 '22 12:12 AniChilingaryan

Sorry for not getting to this earlier! Added in #45

qmac avatar Aug 26 '24 18:08 qmac