Karan Singh
Karan Singh
Waiting for this feature to be added to S4cmd, FYI S3cmd has it
@rapiz1 I am a FRP user , have been using it in prod since an year. But now i am stuck with FRP because of its large footprint ~10MB. For...
+1 to UI +100 to API
We have a use case to install Grafana Agent on hundreds of OpenWRT based routers, these devices are based on MIPS architecture. Currently we are building our own, It would...
Hi @rfratto 👋 Would really appreciate your attention on this.
@missingcharacter did you manage to find a workaround on this topic.
i tried setting `CUDA_VISIBLE_DEVICES=` but it failed ``` docker run --shm-size 1g --net=host -p 8080:80 -v $PWD/Llama-2-7b-hf:/data -e HUGGING_FACE_HUB_TOKEN=$token -e HF_HUB_ENABLE_HF_TRANSFER=0 -e CUDA_VISIBLE_DEVICES= ghcr.io/huggingface/text-generation-inference:latest --model-id NousResearch/Llama-2-7b-hf ``` - logs ```...
@OlivierDehaene Thanks for the suggestion. I was doing this on AWS with g5.xlarge instance type and getting OOM , i bumped up the instance type to g5.2xlarge and able to...
This works for me ``` docker run --shm-size 1g --net=host -p 8080:80 -v $PWD/data:/data -e HUGGING_FACE_HUB_TOKEN=$token -e HF_HUB_ENABLE_HF_TRANSFER=0 ghcr.io/huggingface/text-generation-inference:latest --model-id TheBloke/Llama-2-13B-chat-GGML --quantize bitsandbytes ```
Adding a UI or at least API to rathole, would kick the ass of FRP