Evgeny Igumnov issues

Results 5 issues of


                                            Evgeny Igumnov

Error: DriverError(CUDA_ERROR_OUT_OF_MEMORY, "out of memory") with multiple GPU

I have: 4x RTX 3080 = 40GB total memory (each GPU by 10 GB memory) I try to load model Mistral 7 about 15Gb file. But I take error: ```...

Mistral quantized example: "error: library kind `framework` is only supported on Apple targets"

``` C:\Users\igumn\candle\candle-examples\examples\mistral>cargo run --example mistral --features accelerate --release -- --prompt "Here is a sample quick sort implementation in rust " --quantized -n 400 Compiling cc v1.0.90 Compiling serde v1.0.197 Compiling...

gemma2:2b example

Hello Sir and Madam, Do you plan to add the gemma2:2b example? This model is very small and smart. Best regards, Evgeny

Example quantized with custom GGUF model error: cannot find llama.attention.head_count in metadata

``` C:\Users\igumn\candle\candle-examples\examples\quantized>cargo run --features=cuda --example quantized --release -- --model=gemma-2-2b-it.q4_k_m.gguf --prompt "def fibonacci(n): " Finished `release` profile [optimized] target(s) in 0.48s Running `C:\Users\igumn\candle\target\release\examples\quantized.exe --model=gemma-2-2b-it.q4_k_m.gguf --prompt "def fibonacci(n): "` avx: true, neon:...

Add Gemini API Support

**Description:** We need to integrate the Gemini API into our application. Google Gemini API offers free usage with rate limits, making it an attractive option for enhancing our service's capabilities....

good first issue