Karan Singh comments

Results 19 comments of


                                            Karan Singh

Support for s4cmd --continue get source dest

Waiting for this feature to be added to S4cmd, FYI S3cmd has it

OpenWrt release

@rapiz1 I am a FRP user , have been using it in prod since an year. But now i am stuck with FRP because of its large footprint ~10MB. For...

Could you please include `mipsle` in releases?

We have a use case to install Grafana Agent on hundreds of OpenWRT based routers, these devices are based on MIPS architecture. Currently we are building our own, It would...

Could you please include `mipsle` in releases?

Hi @rfratto 👋 Would really appreciate your attention on this.

Could you please include `mipsle` in releases?

@missingcharacter did you manage to find a workaround on this topic.

i tried setting `CUDA_VISIBLE_DEVICES=` but it failed ``` docker run --shm-size 1g --net=host -p 8080:80 -v $PWD/Llama-2-7b-hf:/data -e HUGGING_FACE_HUB_TOKEN=$token -e HF_HUB_ENABLE_HF_TRANSFER=0 -e CUDA_VISIBLE_DEVICES= ghcr.io/huggingface/text-generation-inference:latest --model-id NousResearch/Llama-2-7b-hf ``` - logs ```...

Option to use CPU instead

@OlivierDehaene Thanks for the suggestion. I was doing this on AWS with g5.xlarge instance type and getting OOM , i bumped up the instance type to g5.2xlarge and able to...

The HF_TRANSFER is not working for the model CalderaAI/30B-Lazarus

This works for me ``` docker run --shm-size 1g --net=host -p 8080:80 -v $PWD/data:/data -e HUGGING_FACE_HUB_TOKEN=$token -e HF_HUB_ENABLE_HF_TRANSFER=0 ghcr.io/huggingface/text-generation-inference:latest --model-id TheBloke/Llama-2-13B-chat-GGML --quantize bitsandbytes ```

Server and client dashboard

Adding a UI or at least API to rathole, would kick the ass of FRP