FunASR icon indicating copy to clipboard operation
FunASR copied to clipboard

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Results 555 FunASR issues
Sort by recently updated
recently updated
newest added
trafficstars

## 📚 Documentation 看到文档中 Whisper-large- v3(⭐🍀 ) | 语音识别,带时间戳,非流式 写着whipser-large-v3是带时间戳,但是使用后输出的结果只有text,请问时间戳需要其他配置吗

documentation

The run_server.sh said /websocket/build/bin but the project dosent has build folder

question

模型初始化1次,连续转写两个音频文件,第一个会走gpu,第二个不会走gpu。 torch.set_num_threads(4)方法对我不奏效

question

我看readme文档只有gpu离线语音识别的,实时的没有。 或者实时的语音识别server用docker部署的话怎么使用gpu加速

question

同样的数据集,单卡训练都是正常的,但是多卡训练的时候,会报错。 已经尝试过逐步减小batch_size,并没有解决问题。 [rank0]:[E ProcessGroupNCCL.cpp:563] [Rank 0] Watchdog caught collective operation timeout: WorkNCCL(SeqNum=3437, OpType=ALLREDUCE, NumelIn=4311252, NumelOut=4311252, Timeout(ms)=600000) ran for 600057 milliseconds before timing out. [rank1]:[E ProcessGroupNCCL.cpp:563] [Rank 1] Watchdog caught collective...

question