Tuanshu
Results
2
comments of
Tuanshu
Just encoutered similar issue. I can confirm downgrade to v0.105.0 works for me.
I have just tried the "INT4 Inference (CPU only)" example. It seems that: if it is the first run (no runtime_outs/ne_mistral_q_nf4_jblas_cfp32_g32.bin generated). the model name ("Intel/neural-chat-7b-v3-1") wont works, I need...