David-AU-github comments

Results 23 comments of


                                            David-AU-github

The DARE-TIES experiment.

> Thanks for sharing your results here! > > DARE-TIES does have a randomized element, yeah - it's part of the algorithm by design. If you want more reproducible merges...

GEmma 3 - pass-merge errors

Quick question: Same format with DARE TIES? other merge types?

Support for new Llama 3.2 - 1B / 3B ?

> it appears they slightly changed the names of some things there, not an expert but will try to see if i can find any differences Seems the issue may...

Support for new Llama 3.2 - 1B / 3B ?

@Jiahao-Yuan @cg123 Built a Llama 3.2 - 8X3B (18.4B parameters) this way: Found if you patched mergekit\_data\architectures\mistral.json as follows; it works: ``` "post_weights": [ { "name": "model.norm.weight", "input_space": "h_${num_layers}" },...

Critical Merging Bug just started...

Added issue: seems even when using "mergekit" work around ; that merge kit is not creating "tokenizer.model" for Gemma models. (previously it did). RESULT: Can't quant models from source without...

MOE错误

Might be in "pedantic" python package (?) ; I had to revert to an older Mergekit version.

KeyError model[0] did not exist in tensor?

Confirming exact same error ; mergekit can not find the "base_model" ; including if the path is local (absolute) on windows. Funny thing is some mergekits work fine - no...

KeyError model[0] did not exist in tensor?

@cg123 Thank you so much.;

imatrix: add option to display importance score statistics for a given imatrix file

> @ubergarm, just finished uploading [Qwen3-30B-A3B-GGUF](https://huggingface.co/eaddario/Qwen3-30B-A3B-GGUF). Summary of scores in the model card, and actual results in the _scores_ folder. > > A few things to consider: > > *...