Nikhil Gupta issues

Results 7 issues of


                                            Nikhil Gupta

Can not find -lpthread for android ndk arm64-v8a build

Hello, I am trying to build bazel/example:main with ndk r25 tool chain for android api level 31 . I am using bazel version 6.0.0 . When I try to compile...

PR welcome

error: array type has incomplete element type xnn_timestamp (struct timespec branch)

Hello @alankelly @wei-v-wang , How can we fix this issue if we are sticking to ubuntu 16 & gcc 5.4.0 ? I have tried #define _POSIX_C_SOURCE 199309L as suggested by...

Prefill Processing

Hello, I am sorry if question is very basic but need a little help over here. Cant we just skip the attention processing and continue from here for input prompt...

[Memory Error]: Scudo OOM when running LLM on S24 Ultra or any latest android launched with android 14 or above

Hello, I am trying to run LLM on s24 Ultra device with 12GB of ram. My LLM has a large embedding size of 160984 * 2048 . The fp32 file...

stale

Is it possible to set a custom background with the blurred effect instead of the the bg behind the vs code window?

[Bug] Difference in token ids between Hugging Face tokenization scheme vs llm-export scheme

Hello @wangzhaode , I would like to report an issue which I recently discovered. I can see if a sentence piece model is used for tokenizer ( like llama2) ,...

[Request]: Help in adding support for Models with Grouped Query Attention (GQA)

Hello, I am trying to add support for models with GQA eg. Tiny llama The indicator for grouped query attention is when num_key_value_heads < num_attention_heads in config.json file For TinyLlama...

help wanted