Wang, Zhe
Wang, Zhe
Hi, I am submitting this pr to enable ollama to run on Intel GPUs with SYCL as the backend. This pr was [originally](https://github.com/ollama/ollama/pull/2458) started by @felipeagc who is currently unable...
Hi, here is Zhe from Intel AI software engineering team. Thank you for creating this amazing project AutoGPTQ. # Motivation My colleagues have done some pretty good work on low-bit...
## Type of Change feature or bug fix or documentation or others API changed or not ## Description detail description Issues: xxx ## Expected Behavior & Potential Risk the expected...
Hi, here is Zhe from Intel AI software engineering team. Thank you for creating this amazing project AutoAWQ. # Motivation My colleagues have done some pretty good work on low-bit...
Hi, I'd like to ask how to use level-zero/sycl to detect available memory of Intel iGPUs (e.g., Intel® Iris® Xe Graphics, PCI ID: 46A0, codename: Alder Lake-P). I found relevant...
Hi, I’m submitted this PR to enable Intel iGPU through the `OLLAMA_INTEL_IGPU` environment variable. Due to the limitations of Intel’s foundation software (details can be seen in this [issue](https://github.com/intel/compute-runtime/issues/742)), it...
## Type of Change feature or bug fix or documentation or others API changed or not ## Description detail description JIRA ticket: xxx ## Expected Behavior & Potential Risk the...
## Type of Change feature or bug fix or documentation or others: feature API changed or not: add a new kernel ## Description int4 dequantize kernel with very high bandwidth...
## Type of Change feature or bug fix or documentation or others API changed or not ## Description detail description Issues: xxx ## Expected Behavior & Potential Risk the expected...
Hi, I'd like to ask how to use level-zero/sycl to detect available memory of Intel iGPUs (e.g., Intel® Iris® Xe Graphics, PCI ID: 46A0, codename: Alder Lake-P). I found relevant...