Austin Huang comments

Results 39 comments of


                                            Austin Huang

GPU Support

Closing for now but will update here when there's more info on this.

[Feature request] Add simple HTTP API server like in llama.cpp with api like OpenAI

Great suggestion, if there's others who interested please +emoji above and we'll prioritize this :)

Generate compressed weights file from finetune

Hi @sanjay920, really cool that you're trying a fine tune already. We're working on releasing a conversion script soon (hopefully within the next few days), but would be useful to...

[Feature request] Add quantization methods

Understood, the -sfp models are 8 bit weights, but I understand people are interested in more aggressive quantization. BTW for just decreasing the memory footprint there was a commit that...

Allow building on Windows using `clang-cl` toolchain

ouch sorry about that, will take a closer look a bit later this evening.

WSL support: "Killed", once executed on WSL

Hi @Code-keys , can you try with 2b-it-sfp.sbs? SFP uses compressed 8bit weights. also make sure to use the right build (if you use 2b-it vs. 2b-it-sfp there's a different...

WSL support: "Killed", once executed on WSL

Re memory issues: I'm going to adjust some defaults + make kSeqLen configurable that should improve the situation a bit. In configs.h, a key parameter kSeqLen which preallocates a kv...

make error on orangepi 5 (arm)

I think this is related to the namespace issue, which should be resolved in this commit on the dev branch: https://github.com/google/gemma.cpp/commit/4a0d23f47ee36370e8db429648f58af6fdb9f953 Might try with the dev branch or alternatively I'll...

request to remove `cmake`, `highway`, and `gtest`

I agree cmake can be painful but the landscape of build systems + package management for C++ ... leaves something to be desired and for all its faults cmake seems...

request to remove `cmake`, `highway`, and `gtest`

If @jart is fine with it, what if we adapt that Makefile + submodules for local builds? If anyone wants to submit a PR I'm happy to review. Has anyone...