Steward Garcia comments

Results 92 comments of


                                            Steward Garcia

Is there anyone who can't generate images correctly?

@leejet It's my impression or it seems that the CUDA backend is experiencing synchronization issues even from the CLIP model; it tends to happen sometimes. ```bash build\bin\Release\sd -m models/kotosmix_v10-f16.gguf -p...

Is there anyone who can't generate images correctly?

@Cyberhan123 Could you send me the CLI commands to perform this test? Your link is not allowing me to access Colab.

Is there anyone who can't generate images correctly?

@leejet to fix race condition of softmax in cuda comment the line 6499, this may solve the errors with artifacts when using VAE tiling: ```c++ while (nth < ncols_x &&...

slow ggml_vec_dot_f16 operator on Android

I think it would be better to support Vulkan backend for acceleration on Android devices, as ggml currently lacks good support for OpenCL (it is even considered obsolete). Unfortunately, I...

taesd model does not work after reloading the sd_ctx

@Jonathhhan I don't understand this case you're raising. Are you referring to reloading the model? Or deleting an already created context and creating a new one? Or that when you...

Flash attention abort trap

@leejet To fix this need this modification in ggml.c line 12401: ```bash if(dst->op_params[0] == 1) { // masked GGML_ASSERT(P >= 0); } ```

Flash attention abort trap

@leejet The GGUF file support is broken, you need to set GGML_MAX_NAME to 128 to prevent it from crashing when loading the model.

service deployment

I am thinking about implementing a REST API for image generation, but time has not allowed me. Additionally, this project changes a lot, and it's not worth it for me...

Created a Server example

Hello everyone, I have a llama.cpp with cpp-httplib [here](https://github.com/FSSRepo/llama.cpp). It doesn't require external dependencies. ## Limitations: * Just tested in Windows and Linux * Only CMake build. * Only one...

Created a Server example

@x4080 you can detail the error in Issues tab on my fork, please