Add -local flag to llms/hf_llm/convert.py for reading source HF models from filesystem.
This PR is to add a local flag (-l, --local) to convert.py to enable the reading of HF Models already on the filesystem during a conversion operation. Currently the script has to download the model weights from HF directly before beginning the conversion to MLX format.
Setting the --local flag disables uploads to huggingface during conversion as the hf_path argument would not specify the original model in the modelcard.
I consolidated the fetch_from_local and fech_from_hub functions into one. Though I kept the fetch_from_hub function name as-is. Let me know if you'd like to change that to just load_model or something more neutral as it is not technically fetching from the hub if local=True is passed.
I also removed all the code related to non-safetensors.
@awni , I think I've incorporated everything from your initial review. Please let me know if there is anything else for this PR.
Thanks a ton! I will check it shortly!