Chen Xi
Chen Xi
hi, could you share the pretrained checkpoint or pb file , then we can check the accuracy. that would be nice.
I found the LICENSE file is missing in this repo, so it's hard to know the way to use this piece of work. Can you please add LICENSE?
Hi lei mao: Thanks for your blog , that benefits a lot to me as I’m enabling a style transfer model from tf-hub. I come across some problems, simply using...
## Type of Change gaudi modeling used in itrex for int4 kv-cache support
Hi gnuvince This demo is quite good for me, but i have question, how to extend the grammar to make it support define a function?
oneCCL commit: 5e7c7b7e33f5f679cb82547c4f7e49623ff0ab09 build: cmake .. -DCMAKE_C_COMPILER=icx -DCMAKE_CXX_COMPILER=icpx -DCOMPUTE_BACKEND=dpcpp run command: examples/sycl$ mpirun -n 2 ./sycl_allreduce_test gpu Log: preferred platform: Intel(R) Level-Zero, found: 8 GPU device(s) preferred platform: Intel(R) Level-Zero,...
- [x] I have read the [contributing guidelines](https://github.com/ggerganov/llama.cpp/blob/master/CONTRIBUTING.md) - Self-reported review complexity: - [ ] Low - [ ] Medium - [x] High fix multi-gpu crash issue by filter the...
### System Info ```shell Optimum-habana: main dd3cc08937a85175eafd3b511484c86fed889b79 Docker images: vault.habana.ai/gaudi-docker/1.15.1/ubuntu22.04/habanalabs/pytorch-installer-2.2.0:latest DeepSpeed: main 7e475fbdcf45d763f83d47dd6c9c181be731d92e ``` ### Information - [X] The official example scripts - [ ] My own modified scripts ###...
### What happened? deepseek-v2 model will meet out of memory issue with the kv buffer size allocating about 43G with a 160K context length from the model. But when you...