Ma Mingfei comments

Results 93 comments of


                                            Ma Mingfei

Enable Intel GPU

@dbyoung18 does this one support int4 woq ?

[Feature] RFC for adding CPU support for SGLang

@zhyncs need to reopen this one. We are currently working on an internal branch to make sure everything is ready and then we will start upstream to sglang main branch....

[Feature] RFC for adding CPU support for SGLang

comment to keep this thread active, optimization work pretty much done internally.

[Feature] RFC for adding CPU support for SGLang

upstream the C++ kernels on https://github.com/sgl-project/sglang/pull/5150

[Feature] RFC for adding CPU support for SGLang

update using CMakeLists.txt: https://github.com/sgl-project/sglang/pull/6115

[Feature] RFC for adding CPU support for SGLang

fp8 gemm: https://github.com/sgl-project/sglang/pull/6216

[Feature] RFC for adding CPU support for SGLang

enable intel amx attention backend: replace https://github.com/sgl-project/sglang/pull/6143 with https://github.com/sgl-project/sglang/pull/6405 https://github.com/sgl-project/sglang/pull/6408

[Feature] RFC for adding CPU support for SGLang

add fp8 shared moe kernels https://github.com/sgl-project/sglang/pull/6339 the shared moe kernels is an innovation that we have done on cpu backend, brings pretty good performance speedup for decoding when concurrency is...

[Feature] RFC for adding CPU support for SGLang

add fp8 support for existing fused moe kernels: https://github.com/sgl-project/sglang/pull/6404

[Feature] RFC for adding CPU support for SGLang

Add docker build: https://github.com/sgl-project/sglang/pull/6458