FireRedASR issues

欢迎试用 onnx 导出脚本

4

非常感谢小红书团队开源的 FireRedASR-AED 模型。我们内部对该模型进行了适配，使用 wenet 进行微调后，具有不错的效果。欢迎大家试用onnx导出脚本：[FireRedASR-AED-ONNX](https://github.com/coolhuhu/FireRedASR-AED-ONNX)

coolhuhu

Achieve Over 20% Speedup with PyTorch SDPA

1

The attention computation is the most time-consuming part during inference. The attention implementation in this project is ```python class DecoderScaledDotProductAttention(nn.Module): def __init__(self, temperature): super().__init__() self.temperature = temperature self.INF = float("inf")...

wxwmd

Turn into a `pip`-installable package

1

fakerybakery

[ROCm] Add Torch SDPA and xFormers optimization for FireRedASR

1

Hi FireRedTeam, thanks for your great work! This PR aims to add FireRedASR optimization on ROCm on target platform AMD Instinct MI300+ GPU. - Add `docker/Dockerfile.rocm` to quickly setup ROCm7...

sammysun0711

cache enc kv proj for cross-attention

1

The kv-projection in cross-attention is calculated in every decoding step which is redundant since encoder_outputs doesn't change during whole decoding phase, this PR add a simple caching mechanism in cross-attn...

tingqli

Optimize beam search & add flash attention+xformers support

3

SDPA erformance improvement is approximately 50%, flash attention nearly 100%, depends on the data and the batch size. The greater the difference in audio length, the better the optimization effect....

xsank

希望支持热词

9

如题

itzhoujun

是否会考虑加入ONNX导出？

24

我测试了模型，中文的识别效果确实很赞。想请问在后续工作中是否会考虑加入模型的ONNX导出？

DrewdropLife

推理优化

1

请问除了代码中的fp16以及flash attention，还有什么加速LLM-based ASR推理的方法吗？谢谢！

Sedrick-Song

FireRedASR
FireRedASR copied to clipboard

Metadata

我在尝试运行的时候出现了以下错误

欢迎试用 onnx 导出脚本

Achieve Over 20% Speedup with PyTorch SDPA

Turn into a `pip`-installable package

[ROCm] Add Torch SDPA and xFormers optimization for FireRedASR

cache enc kv proj for cross-attention

Optimize beam search & add flash attention+xformers support

希望支持热词

是否会考虑加入ONNX导出？

推理优化

← Metadata

Owner

Metadata

FireRedASR FireRedASR copied to clipboard

Metadata

← Metadata

Owner

Metadata

FireRedASR
FireRedASR copied to clipboard