1096125073
1096125073
### System Info x85-64 4 A10 0.9.0 ### Who can help? _No response_ ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks...
### System Info trt-llm v0.9.0 ### Who can help? @byshiue ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks - [X] An...
Two minor changes: 1. Add support for the Telechat model (VLLM now supports the Telechat model). 2. When executing the forward function, it is necessary to place the tensor in...
ADD MODULES_TO_NOT_CONVERT ATTRIBUTE TO GPTQ SERIES when using GPTQ, it may be necessary to exclude specific layers, which is a very useful feature.