awesome-cuda-and-hpc
awesome-cuda-and-hpc copied to clipboard
recommadation to a new light-weight llm serving framework--SFLLLM
trafficstars
https://github.com/wejoncy/sfllm Hi This can be worked on Windows/Linux/Macos or any torch compatiable OS. For Learning purpose but with impressive performance.