Junrui@Intel_SH

Results 2 issues of Junrui@Intel_SH

I try to run the TTS (English and Multi Language Text-to-Speech) in my PC. https://github.com/intel/intel-extension-for-transformers/blob/main/intel_extension_for_transformers/neural_chat/pipeline/plugins/audio/README.md It occured the `cannot import name 'WeightOnlyQuantizedLinear'` error info as below. ```shell ~/WorkSpace/TTS$ python eng-tts.py...

When ollama create `glm-4-9b-chat` model and inference, it always gives a **random and incorrect response from second round**, like below: The complete log of curl command is folded at here...

user issue