blog issues

Update sagemaker-huggingface-llm.md

Fixed two broken links that were leading to 404 error pages.

Falcon-40B thread problem.

history: [['You are World renown expert on quantum mechanics and the Bell inequality. Do you understand? ', '']] Exception in thread Thread-10 (generate_and_signal_complete): Traceback (most recent call last): File "/home/developer/mambaforge/envs/Guanaco/lib/python3.10/threading.py",...

phalexo

Add Falcon 7b and 40b support

Just curious when will QLoRA support quantization of new model?

superchargez

Assisted generation errors due to use_cache

Following the instructions in the blog post for assisted generation, I run into some issues. (FYI, both the longform_model and assistant_model are finetuned versions of OPT, which is the exact...

andersonbcdefg

Gpt-neox-20b model take 1 minutes for 100 token using 4 bit quantization.

How I can reduce time for more the 100 token. ? The model take 1 minutes for 100 token using model in 4bit quantization.

imrankh46

train a decision transformer

accelerate 0.19.0 gym 0.21.0 huggingface-hub 0.14.1 numpy 1.24.3 packaging 23.1 pandas 2.0.1 transformers 4.29.2 Platform:I have tried in both Linux and Windows Python version: 3.8.10 I am trying to execute...

Sergiodmp

`hf-bitsandbytes-integration.md` Incorrect Dequantization

3

Hi, In the `bitsandbytes` [integration blog](https://github.com/huggingface/blog/blob/main/hf-bitsandbytes-integration.md), it says one could retrieve the FP16 weights via ``` (int8_model[0].weight.CB * int8_model[0].weight.SCB) / 127 ``` However, this is incorrect. In the case of...

HanGuo97

blog
blog copied to clipboard

Metadata

Update sagemaker-huggingface-llm.md

Falcon-40B thread problem.

Add Falcon 7b and 40b support

Assisted generation errors due to use_cache

Gpt-neox-20b model take 1 minutes for 100 token using 4 bit quantization.

train a decision transformer

`hf-bitsandbytes-integration.md` Incorrect Dequantization

How to check the number of parameters in a SetFitModel object?

Division by Zero error upon implementing the diffuser pipeline in the network

OpenLLaMA weights

← Metadata

Owner

Metadata

blog blog copied to clipboard

Metadata

← Metadata

Owner

Metadata

blog
blog copied to clipboard