FireRedASR issues

What is the best practice for extra long audio?

3

For long audio (e.g.,> 20 min), what is the best practice? What do you think of the following method? - Split into 30s chunks, with 10s overlap. - Get a...

dingkwang

限定torch版本

5

在使用torch==2.6.0的时候，运行inference_fireredasr_aed.sh 会报错，需要稍微降低一下版本 ``` Traceback (most recent call last): File "/home/orange/FireRedASR/examples/fireredasr/speech2text.py", line 105, in main(args) File "/home/orange/FireRedASR/examples/fireredasr/speech2text.py", line 43, in main model = FireRedAsr.from_pretrained(args.asr_type, args.model_dir) File "/home/orange/FireRedASR/examples/fireredasr/models/fireredasr.py", line 25, in from_pretrained...

learningpro

使用hf-mirror下载的FireRedASR-LLM-L发生路径错误

1

huggingface_hub.errors.HFValidationError: Repo id must be in the form 'repo_name' or 'namespace/repo_name'

TES-VC

fix torch error

1

fixes error: ```shell torch/serialization.py", line 1470, in load raise pickle.UnpicklingError(_get_wo_message(str(e))) from None _pickle.UnpicklingError: Weights only load failed. This file can still be loaded, to do so you have two options,...

eschmidbauer

fix sample rate error in asr_feat.py

1

在 `asr_feat.py` 中， 1. `frame_length`, `frame_shift` 被传入了 `KaldifeatFbank` 但并未被使用 2. `wav` 中的 `sample_rate` 也没有传入 knf，导致所有音频都按照 16k 采样率提取特征 ```python class KaldifeatFbank: def __init__(self, num_mel_bins=80, frame_length=25, frame_shift=10, dither=1.0): self.dither = dither opts...

wujian752

May I ask for the the knowledge of hyperparameters (eg. lr, scheduler, optimizer) during training.

may ask if a sketch of hyperparameters can be shown.

ZihanLiao

请教下参考示例代码为什么一直报错

2

def speech_to_text(audio_path): logger.debug("开始语音识别") # 初始化 FireRedASR 模型 model = FireRedAsr.from_pretrained("aed", "pretrained_models/FireRedASR-AED-L") 执行报错 File "D:\AI\FireRedASR\app.py", line 83, in main result = speech_to_text(audio_path) File "D:\AI\FireRedASR\app.py", line 17, in speech_to_text model = FireRedAsr.from_pretrained("aed",...

barood

VLLM 部署

3

请问，如何使用 VLLM 部署 FireRedASR？

AhYi8

请问对粤语的支持怎么样？

5

zhangchongcool

Benchmark with gpt-4o-transcribe

1

It is a great work. But in my experience, it is not as good as gpt-4o-transcribe even for Chinese.

dingkwang

FireRedASR
FireRedASR copied to clipboard

Metadata

What is the best practice for extra long audio?

限定torch版本

使用hf-mirror下载的FireRedASR-LLM-L发生路径错误

fix torch error

fix sample rate error in asr_feat.py

May I ask for the the knowledge of hyperparameters (eg. lr, scheduler, optimizer) during training.

请教下参考示例代码为什么一直报错

VLLM 部署

请问对粤语的支持怎么样？

Benchmark with gpt-4o-transcribe

← Metadata

Owner

Metadata

FireRedASR FireRedASR copied to clipboard

Metadata

← Metadata

Owner

Metadata

FireRedASR
FireRedASR copied to clipboard