inference
inference copied to clipboard

Published 20 hours ago •

Reame
Issues

REF: Remove some builtin old models and `ggmlv3` model format

Open ChengjieLi28 opened this issue 6 months ago • 0 comments

Remove some builtin old models

Baichuan baichuan-chat
Starcoder starcoderplus starchat
glaive-coder
wizardlm-v1.0
vicuna-v1.3 vicuna-v1.5
OpenBuddy
orca
falcon
Chatglm chatglm-2
Tiny-llama (TODO)
opt (TODO)

Change llama-2 / llama-2-chat ggmlv3 format to ggufv2 format.
Remove support for ggmlv3 format.
Remove register model s3 schema support.
Remove support for self-hosted models.
Remove some unused codes.
Remove opencv for a required dependency. Place it in all dependency.
Rename pytorch dir to transformers and rename ggml dir to llama_cpp.

Aug 14 '24 07:08 ChengjieLi28