h2o-llmstudio
h2o-llmstudio copied to clipboard
[CODE IMPROVEMENT] HF Push improvements
Explore the following things:
- Is it possible to specify the device of the tensors when pushing?
- Is it always the case that CPU loading has float32 and double the size?
- Explore if possible to load models sharded, and merge weights, and then push - for larger models