inference icon indicating copy to clipboard operation
inference copied to clipboard

REF: Remove some builtin old models and `ggmlv3` model format

Open ChengjieLi28 opened this issue 6 months ago • 0 comments

  1. Remove some builtin old models
  • Baichuan baichuan-chat
  • Starcoder starcoderplus starchat
  • glaive-coder
  • wizardlm-v1.0
  • vicuna-v1.3 vicuna-v1.5
  • OpenBuddy
  • orca
  • falcon
  • Chatglm chatglm-2
  • Tiny-llama (TODO)
  • opt (TODO)
  1. Change llama-2 / llama-2-chat ggmlv3 format to ggufv2 format.
  2. Remove support for ggmlv3 format.
  3. Remove register model s3 schema support.
  4. Remove support for self-hosted models.
  5. Remove some unused codes.
  6. Remove opencv for a required dependency. Place it in all dependency.
  7. Rename pytorch dir to transformers and rename ggml dir to llama_cpp.

ChengjieLi28 avatar Aug 14 '24 07:08 ChengjieLi28