X_Bee

Results 1 comments of X_Bee

My solution in `llama/convert_checkpoint.py` ```python ... # about line 666 def get_tllm_linear_weight(weight, prefix, bias=None, use_weight_only=False, plugin_weight_only_quant_type=torch.int8, dtype='float32', use_gemm_woq_plugin=True, postfix='weight'): results = {} print(f"{weight.shape=}") if use_weight_only: if len(weight.shape)==3: v = weight.permute(0,...