André Pankraz
André Pankraz
**Describe the bug** I have some out of mems with 35 GB processes, stanze could be tracked down as reason. **To Reproduce** Steps to reproduce the behavior: 1. Take e.g....
Currently i cannot really use the "Sparse mode" of BGE-M3. Even with 8 GB VRAM and small batch sizes I get CUDA out of mem. Why does this mode need...
### Feature request Currently, BGE rerankers are limited in multilingual support, mainly in English and Chinese. Many XML RoBERTa Cross-Encoders are also English-focused. Could you please add support for other...
Hello, I have more a question than a feature request (or may be a feature request for a clean roadmap). I have seen that the Prompt Playground has now completely...
See title. When I update 1.0.200 to 300, Playground cannot find template formats anymore.
### Your current environment Docker on 4 x A100 SMX. BTW: vLLM 0.8.4 worked stable with same setup. 0.9.01 was already unstable (restarted few time a day), now even more....