Ao Tang comments

Results 25 comments of


                                            Ao Tang

Add Apex import Guard for MM collection

> Please capture the error and include in the ImportError so users will know exactly what's not being imported properly. ImportError is raised at the model level not layer level....

Add Nemotron HF Support

Hi @ArthurZucker , I updated the PR with the changes @suhara and @gheinrich had before. Let me know how you want to proceed. Thanks!

Add Nemotron HF Support

@ArthurZucker I have some question relating to the `Copied from` convention, if the code is not exactly the same but adapted from it, should I still have the `Copied from`...

Add Nemotron HF Support

@ArthurZucker All the issues has been resolved. Please take a look again! Thanks a lot for the review! Looks like there is still some CI not passed. I'll address them...

Add Nemotron HF Support

> IMO the best thing is to use `PreTokenizerFast` But `PreTrainedTokenizerFast` does not support directly load setencepiece (`tokenizer.model`) file.

Add Nemotron HF Support

> > But PreTrainedTokenizerFast does not support directly load setencepiece (tokenizer.model) file. > > It does not, but we can simply convert it in the conversion script! It's really up...

hugging face saved model inference

Hi @riyajatar37003 , if you convert the NeMO ckpt to hf ckpt using the `export_ckpt`, you should be able to run inference on HF side using script similar to below:...

Add Gemma3 VL model

Thanks @xiangxu-google for the PR. I will try to verify this on our end as well

Add Gemma3 VL model

Hi @xiangxu-google , I wonder have you tested the inference and compare the results between NeMo's and HF's ? I observed that for 1B model at least, NeMo output is...

Add Gemma3 VL model

Thanks @xiangxu-google , could you summarize what you have changed ? It is hard to see your change with the force merge. I also managed to review your change and...