ctransformers
ctransformers copied to clipboard
Multiple GPU support
I have 3 GPTQ models to consume and I have 4 GPUs available. How can I mention which model to load in which GPU? If I do not mention it is trying to load all models in cuda:0 and it is crashing.