transformerlab-app
transformerlab-app copied to clipboard
Gemma2 inference issues on fastchat and mlx server
Using Gemma 2 9B IT:
- on Fastchat it returns "Hello Hello Hello Hello Hello..."
- on MLX it returns a correct answer but ends with "<end_of_turn>"