NEWbie0709
NEWbie0709
### What happened? I’m experiencing an issue when using litellm proxy to communicate with the qwen-vl-plus model for multimodal interactions. When I send an image URL directly to qwen-vl-plus, it...
### What happened? I've encountered an issue while using LiteLLM with the GROQ/Llama 3.2 Vision model and Qwen. The problem arises specifically when providing an image input. --GROQ/Llama 3.2 Vision...
Hello, I would like to train a model specifically on MITRE ATT&CK knowledge and help it familiarize itself with MITRE ATT&CK concepts using the Zainabsa99/mitre_attack dataset. However, the documentation mainly...
Does AirLLM currently support running 4-bit quantized models like unsloth/Qwen2-72B-bnb-4bit? I’m trying to load and run this model using AirLLM, but I’m encountering the following error during generation: > RuntimeError:...
I am currently fine-tuning the Llama 3.1 8B Instruct model using CURLoRA adapters on a single RTX 4090 GPU.  Problem: - It takes ~170 seconds per step (batch) during...
### What is the issue? After upgrading to CUDA 12.8 (from a previously working CUDA setup), Ollama now runs models using the CPU only, whereas it previously utilized the GPU...