jan
jan copied to clipboard

Published 20 hours ago •

Reame
Issues

Hub should autodetect users' RAM or VRAM to Recommend Models

Open dan-jan opened this issue 7 months ago • 1 comments

Objective

As part of larger epic, we need to autodetect the users' hardware and show recommending models
Our long-term goal is to help the user "run best inference quality for their given hardware".
[Stretch Goal] Is there a way for us to also autoconfigure inference engine params (e.g. GPU layer offloading)?

Questions

What happens if users have both GPU and RAM?
Are there edge cases (e.g. a user with a 4090 and somehow only 8gb of RAM - perhaps Windows eGPU users?)

Nov 29 '23 13:11 dan-jan

Notes on some principle to follow:

Should try to avoid system permission (infer hardware from a more generic way without high system permission)
Windows, Linux, MacOS is different
Should be automatic for first time user

Nov 30 '23 02:11 tikikun

Deprecated. We have new designs

Jan 05 '24 08:01 0xSage