Vedapani

Results 1 comments of Vedapani

I am also interested in faster inference for gemma-3 27b 4 bit quantized model. Would like to know if gemma 3 is supported by TensorRT LLM Engine in python. Thank...