Georgi Gerganov comments

Results 1015 comments of


                                            Georgi Gerganov

Using memcpy on tensor data when the tensor is not contiguous

Yes that's correct. In some places the necessary checks / asserts are missing.

[2GPU] Memcpy2D of matrixXmatrix -- src size (and form)

I'm afraid it will be difficult for me to help here, because I don't have a multi-GPU system to test with and I am not very familiar with this code....

Citing the ggml in a paper

Hi Sara, thanks for the interest! I guess just a link to the repo would be good enough

Feature: Circular Padding

@FSSRepo would you like to review this PR? I think you are planning to add `ggml_pad` - see if the 2 things play along together

Webgpu backend

Great - very useful! This is exactly what we need to get us started. This will stay a bit in the background for some time as there are more pressing...

Add example which implements Vision Transformer(ViT) image classification

Thanks! Looks interesting - will give it a try tomorrow and share it around

mpt-1b fails with mpt_model_load: unknown tensor 'transformer.blocks.0.attn.k_ln.weight' in model file

Maybe try to make the MPT example auto detect. I guess in long term we should just add MPT support to `llama.cpp`. For example, here is ongoing work to add...

> As evident methods in ggml-alloc.c, e.g. ggml_gallocr_reserve_n, use directly stderr. So that begs the question whether the internal logging in llama.cpp should not be made commonly available, e.g. in...

ggml : rewrite silu and softmax for cpu

On AMD Ryzen 9 5950X and M2 Ultra `SOFT_MAX` is about ~1.5x faster than `master` Using the following command to benchmark: ```bash make -j tests && ./tests/test-backend-ops -o SOFT_MAX -b...

ggml : rewrite silu and softmax for cpu

@LostRuins Which instruction set do you observe to fail (ARM, AVX, ..)?