Curator icon indicating copy to clipboard operation
Curator copied to clipboard

Translation example with ctranslate2's Translator.

Open uahmed93 opened this issue 1 year ago • 6 comments

As we have added support for HF model translation via CrossFit, we are working towards performance improvement with ctranslate2. This work depends on adding support for ctranslate2 in CrossFit, and then will need to create pipeline for this work in NDC.(Draft PR)

With a workaround for ctranslate2 in CrossFit, huge performance improvement was seen. On single GPU, following is the performance :

Experiment Standalone pytorch inference Standalone + ctranslate2 Crossfit+ctranslate2
Inference time ~1hr 50mins 23min 54sec 6min 29sec (including extra processing for workarund : 3sec)
BLEU score - 0.9585 0.9586

BLEU score was calculated w.r.t Standalone pytorch inference on 74058 sentences.

uahmed93 avatar Sep 16 '24 15:09 uahmed93

CC: @arhamm1 for awareness for the work here

VibhuJawa avatar Sep 17 '24 06:09 VibhuJawa

Added an example notebook here

uahmed93 avatar Sep 25 '24 19:09 uahmed93

Moving to next sprint per Arham's approval.

Christina-Young-NVIDIA avatar Oct 21 '24 20:10 Christina-Young-NVIDIA

Blocked by HF issue?

Christina-Young-NVIDIA avatar Nov 25 '24 21:11 Christina-Young-NVIDIA

No longer blocked - Vibhu is making Crossfit changes, and once these are complete this can proceed. Pushing to December sprint.

Christina-Young-NVIDIA avatar Dec 02 '24 16:12 Christina-Young-NVIDIA

@uahmed93 What is the latest on this?

sithape2025 avatar Jan 06 '25 21:01 sithape2025

This issue is stale because it has been open for 30 days with no activity. Remove stale label or comment or this will be closed in 7 days.

github-actions[bot] avatar Jul 27 '25 02:07 github-actions[bot]

Closed by https://github.com/NVIDIA-NeMo/Curator/pull/336.

sarahyurick avatar Jul 28 '25 16:07 sarahyurick