katsu560 issues

Results 7 issues of


                                            katsu560

Add AVX support, inline to ggml_fp16_to_fp32, ggml_fp16_t ggml_fp32_to_fp16 and some Makefile changes

Old CPU has only AVX instructions, no AVX2 and no FMA instructions. So, I added AVX support code to ggml.c, ggml.h and whisper.cpp. I also added 'inline' to ggml_fp16_to_fp32, ggml_fp16_t...

Add ggml performance test

[Feature Request] rinna's Japanese GPT model support

Thanks @ggerganov for your sharing. I want to use GPT on my local pc. rinna Co.,Ltd is Japanese AI company. rinna provides some GPT models on huggingface. https://huggingface.co/rinna/japanese-gpt-1b https://huggingface.co/rinna/japanese-gpt2-xsmall https://huggingface.co/rinna/japanese-gpt2-small...

enhancement

good first issue

add OpenBLAS detection and modify tests codes

I revised src/CMakeLists.txt for fixing indents and commands for Haiku and adding OpenBLAS detection. OpenBLAS detection codes in whisper CMakeLists.txt isn't enough. That detects only OpenBLAS library. Thus, I added...

fix showing unknown token at gpt_tokenize

As for current implementation, gpt_tokenize() shows each byte of multi bytes character if unknown token is existed, like below ``` test_gpt_tokenizer : 0 tests failed out of 0 tests. gpt_tokenize:...

Embed yolo files

Some app like yolov3-tiny needs additional files to execute such as label(coco.names) and alphabet labels(100_0.png, ...) files. If these files are embedded to a model(gguf) file and the app read...

Embed files

- [x] I have read the [contributing guidelines](https://github.com/ggerganov/llama.cpp/blob/master/CONTRIBUTING.md) - Self-reported review complexity: - [ ] Low - [x] Medium - [ ] High sync ggml's PR: Embed yolo files `#831`...

Review Complexity : Low

python