SLAM-LLM Data Form of the MaLa-ASR

Data Form of the MaLa-ASR

Open zsLin177 opened this issue 5 months ago • 2 comments

System Info

torch 2.1

Information

[X] The official example scripts
[ ] My own modified scripts

🐛 Describe the bug

bash decode_MaLa-ASR_withkeywords_L95.sh

Hi, I'm currently working on reproducing the results of MaLa-ASR and have downloaded the slidespeech dataset from https://www.openslr.org/144/. While running the provided decoding script, I noticed that it requires the file located at /nfs/yangguanrou.ygr/slidespeech/${split}_oracle_v1/. Could you please clarify what the format of this file is? Do I need to preprocess the downloaded data in any specific way, such as splitting the audio based on timestamps?

Error logs

no file named test_oracle_v1

Expected behavior

Could you please provide the steps for data processing and explain the format of the data? Thanks, looking forward to your reply.

Aug 28 '24 02:08 zsLin177

SLAM-LLM SLAM-LLM copied to clipboard

Data Form of the MaLa-ASR

System Info

Information

🐛 Describe the bug

Error logs

Expected behavior

SLAM-LLM
SLAM-LLM copied to clipboard