fade_away comments

Results 46 comments of


                                            fade_away

Question about configurations of runtime arguments

branch main，commit id：66ef1df492f7bc9c8eeb01d7e14db01838e3f0bd ``` model=/data/vicuna-13b/vicuna-13b-v1.5/ tp=2 python convert_checkpoint.py --model_dir ${model} \ --output_dir ./tllm_checkpoint_2gpu_fp16 \ --dtype float16 --tp_size ${tp} trtllm-build --checkpoint_dir ./tllm_checkpoint_2gpu_fp16 \ --output_dir ./tmp/llama/13B/trt_engines/fp16/2-gpu \ --gemm_plugin float16 \ --use_fused_mlp \...

ImportError: cannot import name 'create_config_from_hugging_face' from 'tensorrt_llm.models.llama.convert' (/usr/local/lib/python3.10/dist-packages/tensorrt_llm/models/llama/convert.py)

> You can check the example version [here](https://github.com/NVIDIA/TensorRT-LLM/blob/main/tensorrt_llm/version.py) and compare it with the version of tensorrt_llm package. Hi I checked it, it's `0.9.0.dev2024030500`

fade_away

Question about configurations of runtime arguments

ImportError: cannot import name 'create_config_from_hugging_face' from 'tensorrt_llm.models.llama.convert' (/usr/local/lib/python3.10/dist-packages/tensorrt_llm/models/llama/convert.py)

Unsupported type: <class 'list'>

Is there any help message supported?

How to improve my triton kernel performance

KV caching?