Ștefan-Gabriel Muscalu

Results 21 comments of Ștefan-Gabriel Muscalu

Just quantized `Mistral-Nemo-Instruct` and trying to run it I get the following error: ``` llm_load_tensors: ggml ctx size = 0.17 MiB llama_model_load: error loading model: check_tensor_dims: tensor 'blk.0.attn_q.weight' has wrong...