compilade comments

Results 109 comments of


                                            compilade

convert: add tensor hash general.hash.sha256 to kv store

@mofosyne I agree with @Galunid regarding the overhead (both CPU-wise and memory-wise). This also has the exact same problems as the UUID autogeneration, because the hash for an `f32` model...

convert: add tensor hash general.hash.sha256 to kv store

@mofosyne > That's a bit strange, does llama-gguf-hash also show difference? No, the difference is only in the metadata, because of the hash introduced in this PR which differs, because...

Feature Request: Support Codestral Mamba

Some progress report: I have a local branch (not yet public) on top of #8526 in which I've started implementing the graph for Mamba-2. The conv step is very similar...

Feature Request: Support Codestral Mamba

Okay, the fully recurrent mode works for `Mamba-2`! (for the curious, see this branch: ) I'll open a PR soon (in the next days; still need to clean up some...

Feature Request: Support Codestral Mamba

Heads up that #15625 fixes a problem in the implementation of `SSM_SCAN`, which makes this model (Mamba-Codestral-7B-v0.1) better than it was when initially implemented here. So if you had some...

convert-*.py: autogenerate general.uuid if missing

> @compilade I recall you had an observation about potential issues with autogenerating uuids @mofosyne Yes, there are possible problems. - Should the UUID of a model be the same...

convert-*.py: autogenerate general.uuid if missing

@mofosyne Hashing the *source tensors* could work without making the memory usage too high (because they are `mmap`-ed), and would also solve the other equivalence problems, since the semantic of...

convert-*.py: autogenerate general.uuid if missing

> you mean like `generate_source_tensors_uuid()` in this? @mofosyne Yes, pretty much. This reads the whole source tensors twice (so it's slow), but I don't really see a way around that...

Bug: runtime error in `llama_get_logits_ith` after `simplify Mamba with advanced batch splits` commit.

Author of #8526 here. > Why can this happen? Basically it should not happen if it worked before. It's possible the internal changes in batch splits caused some external changes...

Bug: runtime error in `llama_get_logits_ith` after `simplify Mamba with advanced batch splits` commit.

> Thank you for the answer! Changing default idx to `-1` helps, the error no longer occurs. That is very good to know! > It looks like the feature was...