gpt4all [Feature] GPU Support for MoE models

[Feature] GPU Support for MoE models

Open KyberNull opened this issue 8 months ago • 0 comments

trafficstars

Feature Request

Are there any plans of supporting MoE models in the future? Models like the Granite3.1-3B and Mixtral-8x7B are popular MoE models. Their main appeal being high inference speeds and low resource usage, making them a good choice for in device usage. Currently only CPU inference is supported and is giving promising results, further GPU support using Vulkan should drastically improve inference speeds

Mar 14 '25 07:03 KyberNull

gpt4all gpt4all copied to clipboard

[Feature] GPU Support for MoE models

Feature Request

gpt4all
gpt4all copied to clipboard