Pedro Valois
Pedro Valois
I also need batch inference. Any update on this?
LMK if you need help and we could open a PR
Thank for the positive response. @dmnunez1993 can you open the PR?
Any plan for adding it this year?
Thank you for the info. Do we have an ETA for next release?
I am not sure about the cost, but on my 4090, I am getting lifetime avg 522 tks/s (although it fluctuates around 400tks/s - 600tks/s).
Is there any update on this? I am looking into pre tokenizing 100Tb of data for NeMo / Megatron and was trying to look for alternatives to HF tokenizers.