SmallShark
Results
2
issues of
SmallShark
# What does this PR do? System Info transformers 4.42.3 Now gemma2 model generates long text that exceeds the window size (>4096), it will report a CUDA error, which seems...
### 🚀 The feature, motivation and pitch Now vLLM gemma2 does not support ROPE scaling, and I sincerely hope that support for it will be added in the future.
feature request