katsu560
katsu560
Old CPU has only AVX instructions, no AVX2 and no FMA instructions. So, I added AVX support code to ggml.c, ggml.h and whisper.cpp. I also added 'inline' to ggml_fp16_to_fp32, ggml_fp16_t...
Thanks @ggerganov for your sharing. I want to use GPT on my local pc. rinna Co.,Ltd is Japanese AI company. rinna provides some GPT models on huggingface. https://huggingface.co/rinna/japanese-gpt-1b https://huggingface.co/rinna/japanese-gpt2-xsmall https://huggingface.co/rinna/japanese-gpt2-small...
I revised src/CMakeLists.txt for fixing indents and commands for Haiku and adding OpenBLAS detection. OpenBLAS detection codes in whisper CMakeLists.txt isn't enough. That detects only OpenBLAS library. Thus, I added...
As for current implementation, gpt_tokenize() shows each byte of multi bytes character if unknown token is existed, like below ``` test_gpt_tokenizer : 0 tests failed out of 0 tests. gpt_tokenize:...
Some app like yolov3-tiny needs additional files to execute such as label(coco.names) and alphabet labels(100_0.png, ...) files. If these files are embedded to a model(gguf) file and the app read...
- [x] I have read the [contributing guidelines](https://github.com/ggerganov/llama.cpp/blob/master/CONTRIBUTING.md) - Self-reported review complexity: - [ ] Low - [x] Medium - [ ] High sync ggml's PR: Embed yolo files `#831`...