FunASR icon indicating copy to clipboard operation
FunASR copied to clipboard

whisper-larger-v3模型,识别结果没有时间戳,10min音频

Open ExpressGit opened this issue 1 year ago • 1 comments

10分钟的音频 依然没有时间戳 模型;whisper-large-v3

❓ Questions and Help

from funasr import AutoModel

model = AutoModel( model="iic/Whisper-large-v3", vad_model="iic/speech_fsmn_vad_zh-cn-16k-common-pytorch", vad_kwargs={"max_single_segment_time": 30000}, )

DecodingOptions = { "task": "transcribe", "language": None, "without_timestamps": False, }

res = model.generate( DecodingOptions=DecodingOptions, batch_size_s=0, input="data/ch_multi.wav", )

print(res)

  • OS (e.g., Linux): centos7
  • FunASR Version (e.g., 1.0.0):最新
  • ModelScope Version (e.g., 1.11.0):1.14.0
  • PyTorch Version (e.g., 2.0.0):2.1.3
  • How you installed funasr (pip, source):pip
  • Python version:3.10
  • GPU (e.g., V100M32) 3080
  • CUDA/cuDNN version (e.g., cuda11.7):cuda11.8
  • Docker version (e.g., funasr-runtime-sdk-cpu-0.4.1)
  • Any other relevant information:

ExpressGit avatar Jul 06 '24 02:07 ExpressGit

Please update funasr by source code.

LauraGPT avatar Aug 01 '24 17:08 LauraGPT

同问

MJ666-K avatar Dec 03 '24 06:12 MJ666-K

同问,没有时间戳

shidingz avatar Dec 03 '24 12:12 shidingz

有结果了@一下我

MJ666-K avatar Dec 04 '24 12:12 MJ666-K