Liang Lv comments

Results 7 comments of


                                            Liang Lv

Issue with multiple input quantization(tf savedmodel)

Hi @Akshaysharma29, Could you please try to convert your savedmodel to pb file and then use session run to check if your fp32 graph is good? You didn't evaluate the...

Issue with multiple input quantization(tf savedmodel)

Hi @Akshaysharma29, There are built-in datasets 'bert' and 'mzbert' which are for the bert type of models. They use the tf_record format. The 'dummy' dataset is not suitable for bert...

Quantization Not Fusing Pad to Conv2D

@matthew-olson-intel The PR https://github.com/tensorflow/tensorflow/pull/53480 is related with Pad + Conv3D. It's not for Pad + Conv2D. Good news is that this issue has already been fixed by TF SPR-Base and...

Multiple issues with setting up the text chatbot service on SPR

@tbykowsk, Apologies for the delayed response regarding this issue. I hadn't realized it was assigned to me. You can refer to the [quick start example](https://github.com/intel/intel-extension-for-transformers/blob/main/intel_extension_for_transformers/neural_chat/examples/quick_start/chatbot/README.md) for setuping the chatbot service....

Python3.11: Could not build wheels for cchardet, which is required to install pyproject.toml-based projects

@bbelky, Please use python 3.10.

[NeuralChat] Support user management in backend server

> Let's take a look how vLLM and TGI supports. we can leverage them and add what we need if missing. They are serving framework, so they don't support user...

None of examples on README page works

Hi @olegmikul, To resolve the Chatbot issue, you'll need to install an additional requirements.txt file located at intel_extension_for_transformers/neural_chat/requirements_cpu.txt before running the chatbot. For the INT4 Inference issue, please execute `pip...