LocalAI icon indicating copy to clipboard operation
LocalAI copied to clipboard

feat: WIP: Adjust GPU Layers

Open siddimore opened this issue 1 year ago • 4 comments

Description

  1. Add GGUF Parser

TODO

  1. Install nvidia-smi driver
  2. Fetch GPU Device information
  3. Offload layers to GPU baed on GGUF Parser metadata

This PR fixes #3541

Notes for Reviewers

Signed commits

  • [ ] Yes, I signed my commits.

siddimore avatar Oct 06 '24 05:10 siddimore

Deploy Preview for localai ready!

Name Link
Latest commit cd1dc5dac4591bdf61dad25443f7f9bde40a2e4f
Latest deploy log https://app.netlify.com/sites/localai/deploys/6705d246290228000812ee36
Deploy Preview https://deploy-preview-3737--localai.netlify.app
Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

netlify[bot] avatar Oct 06 '24 05:10 netlify[bot]

@mudler can u kindly check the PR approach and give some high level feedback when possible?

Next step that i will add is some sort of GPU_Layer estimator based on:

  1. VRAM from GGUF Parsing
  2. Noof GPU's on the device

siddimore avatar Oct 08 '24 05:10 siddimore

@siddimore thanks for taking a stab at this, direction looks good here - just few minor nits here and there but definitely not blockers

mudler avatar Oct 09 '24 10:10 mudler

@siddimore thanks for taking a stab at this, direction looks good here - just few minor nits here and there but definitely not blockers

thanks much @mudler you are welcome!! i will improve the code and add some more testing. Appreciate the feedback and will fix the comments

siddimore avatar Oct 10 '24 03:10 siddimore

This PR is stale because it has been open 90 days with no activity. Remove stale label or comment or this will be closed in 10 days.

github-actions[bot] avatar Jul 09 '25 02:07 github-actions[bot]

This PR was closed because it has been stalled for 10 days with no activity.

github-actions[bot] avatar Jul 20 '25 02:07 github-actions[bot]