laurenlong
laurenlong
Excuse me, I am looking to fine-tune the video-llava model, and I noticed that the finetune.sh file mentions three JSON files, but the downloaded dataset only contains videos. Could you...
My error message says: File "/home/Moonlit/Compresso/train.py", line 235, in main() File "/home/Moonlit/Compresso/train.py", line 204, in main from transformers.integrations import AzureMLCallback, ProgressCallback ImportError: cannot import name 'ProgressCallback' from 'transformers.integrations' (/home/longyongliu/.local/lib/python3.10/site-packages/transformers/integrations/__init__.py) It...
Hi, As the output of the model in each token's position represents the possibilities of next token, should the calculation of log_probs be misaligned. I mean "diff_logits[range(diff_logits.shape[0]-1), continue_ids[1:]].sum().item()" instead of...
## Description Same question as https://github.com/NVIDIA/TensorRT/issues/3133 and https://github.com/NVIDIA/TensorRT/issues/2828 . As shown in the figure below, my model has three separate and similar branches starting from a certain point and converging...
What is the relationship between whether or not sparse implementation is picked for TRT inference?
## Description I used apex's ASP to accomplish the N:M sparsity. I get the following result when I use `./trtexec --onnx=sparse.onnx --saveEngine=sparse.trt --sparsity=enable --fp16 --verbose` to generate the engine: ```...