Wang, Zhe issues

Results 10 issues of


                                            Wang, Zhe

Enabling ollama to run on Intel GPUs with SYCL backend

Hi, I am submitting this pr to enable ollama to run on Intel GPUs with SYCL as the backend. This pr was [originally](https://github.com/ollama/ollama/pull/2458) started by @felipeagc who is currently unable...

[RFC] options about low-bit GEMM kernels contribution on x86 CPUs

Hi, here is Zhe from Intel AI software engineering team. Thank you for creating this amazing project AutoGPTQ. # Motivation My colleagues have done some pretty good work on low-bit...

remove matB & matB_acc block_size_x constrain for better simd_lane utilization

## Type of Change feature or bug fix or documentation or others API changed or not ## Description detail description Issues: xxx ## Expected Behavior & Potential Risk the expected...

[RFC] options about low-bit GEMM kernels contribution on x86 CPUs

Hi, here is Zhe from Intel AI software engineering team. Thank you for creating this amazing project AutoAWQ. # Motivation My colleagues have done some pretty good work on low-bit...

How to detect iGPU free memory

Hi, I'd like to ask how to use level-zero/sycl to detect available memory of Intel iGPUs (e.g., Intel® Iris® Xe Graphics, PCI ID: 46A0, codename: Alder Lake-P). I found relevant...

question

Support intel igpus

Hi, I’m submitted this PR to enable Intel iGPU through the `OLLAMA_INTEL_IGPU` environment variable. Due to the limitations of Intel’s foundation software (details can be seen in this [issue](https://github.com/intel/compute-runtime/issues/742)), it...

Wang, Zhe

Enabling ollama to run on Intel GPUs with SYCL backend

[RFC] options about low-bit GEMM kernels contribution on x86 CPUs

remove matB & matB_acc block_size_x constrain for better simd_lane utilization

[RFC] options about low-bit GEMM kernels contribution on x86 CPUs

How to detect iGPU free memory

Support intel igpus

qbits deprecate clip postfix

Int4 dequantize kernel

Add zp no degrad dequant

How to detect iGPU free memory