gpt4all icon indicating copy to clipboard operation
gpt4all copied to clipboard

[Feature] GPU Support for MoE models

Open KyberNull opened this issue 8 months ago • 0 comments
trafficstars

Feature Request

Are there any plans of supporting MoE models in the future? Models like the Granite3.1-3B and Mixtral-8x7B are popular MoE models. Their main appeal being high inference speeds and low resource usage, making them a good choice for in device usage. Currently only CPU inference is supported and is giving promising results, further GPU support using Vulkan should drastically improve inference speeds

KyberNull avatar Mar 14 '25 07:03 KyberNull