Vladimir Bataev comments

Results 24 comments of


                                            Vladimir Bataev

Dgalvez/cuda graphs greedy rnnt inference squash

@galv as I see, the issue can be fixed when passing appropriate device to cuda streams initializers and getters: ``` def with_conditional_node(while_loop_kernel, while_loop_args, while_loop_conditional_handle, device): ... body_stream = torch.cuda.Stream(device=device) previous_stream...

Dgalvez/cuda graphs greedy rnnt inference squash

@galvI tried some changes, and it seems I can get it to work. But I'm wondering why these changes are required and why everything works when creating a graph for...

Dgalvez/cuda graphs greedy rnnt inference squash

@galv I manually restarted Jenkins, but it is still waiting for an executor

Dgalvez/cuda graphs greedy rnnt inference squash

@galv please fix the test failing on Jenkins (the guard is needed) > FAILED tests/collections/asr/decoding/test_cuda_graph_rnnt_greedy_decoding.py::test_change_devices - ImportError: Found cuda-python 12.3.0rc4+8.gcb4e395, but at least version 12.3.0 is needed.

Dgalvez/cuda graphs greedy rnnt inference squash

jenkins

clarification : gram ctc - alphabet_size ? "a","b" or "ab" single logits output?

I'm sorry, Gram-CTC is not yet implemented, but it is first priority future task: [https://github.com/artbataev/end2end#future-plans](https://github.com/artbataev/end2end#future-plans), and I'm working on it. For now only CTC-Loss and CTC Beam Search Decoder with...

Vladimir Bataev

Dgalvez/cuda graphs greedy rnnt inference squash

Dgalvez/cuda graphs greedy rnnt inference squash

Dgalvez/cuda graphs greedy rnnt inference squash

Dgalvez/cuda graphs greedy rnnt inference squash

Dgalvez/cuda graphs greedy rnnt inference squash

clarification : gram ctc - alphabet_size ? "a","b" or "ab" single logits output?

Riva and k2 ASR WFST decoding (2)

Riva and k2 ASR WFST decoding (2)

CTC Greedy Decoding with NGPU-LM (N-Gram LM on GPU)

[NeMo 2.6] RNNT ASR inference fails on A100 (CUDA 12.8, PyTorch 2.9) with CUDA Graphs error CUDA failure! 35