kevinintel

Results 62 comments of kevinintel
trafficstars

please update supported data types in https://github.com/intel/neural-speed/blob/main/docs/advanced_usage.md

It's not high priority task. Oracle Cloud is on roadmap, but not ARM based machine

I prefer entire text. Btw, OPEA is microservice-based, please think how to contribute to OPEA.

Please try to create PR first, Late Chunk may not better than current embedding, but you are welcome to expand the functionalities

we can add info in docs to remind user close bf16 on specific machines.

will assign to Zhihao

TGI-Gaudi and vllm supports Falcon 40B and Flacon 7B. We will validate Falcon-11B

TGI-Gaudi did not support this model: https://huggingface.co/tiiuae/falcon-11B-vlm we need to wait for TGI-Gaudi