CoinCheung
CoinCheung
Hi, thanks for paying attention to this !! This repo is currently designed for full parameter finetuning, but lora freezes most of the parameters. Since they contradict with each other,...
Hi, Thanks for being interested in this repo !!! Is there any experiments post that adding tensor parallelism or zero would improve training performance ?
Maybe the car itself is background category. The visualization code will not show background class, since it is not trained.
@moraxu Hi, does this mean that, after tensorrt 10, the recommended method is: train-pytorch-model -> int8-with-model-optimizer -> export-onnx -> compile-with-tensorrt ?
@moraxu Hi, can we use explicit int8 quantization now ? I mean we use ModelOpt to quantize the pytorch model into int8, and then export it to onnx from pytorch,...
@moraxu Hi, I packed up the code and the associated onnx file, it is accessable here: https://github.com/CoinCheung/eewee/releases/download/0.0.0/code.zip There is a readme file in the zip file, it describes the step...
Hi, Please use tensorrt-10. The code is updated to tensorrt-10.