BELLE issues

Results 163 BELLE issues

Sort by recently updated

Misspell in seed tasks

line 172 of BELLE/1.5M/zh_seed_tasks.json "给定一个主题，基于这个主题写一篇作为，要求立意清晰，思想积极向上。。" It should be "作文", not "作为".

试着调节deepspeed参数，但是无法在24G的3090显卡上进行训练非Lora的模型

``` deepspeed --num_gpus=1 finetune.py --model_config_file run_config/Llama_config.json --deepspeed run_config/deepspeed_config.json ```

autoexpect

如何使用量化后的模型做微调？

作者您好，由于显存过小，我希望使用量化后的模型做微调。我下载了LELLE-7B-gptq，我该如何配置Bloom_config.json？期待答复，谢谢。

adot08

大神们好。我运行Bloom模型，并修改fp16改为False，然后报以下错： ``` Traceback (most recent call last): File "finetune.py", line 236, in train(args) File "finetune.py", line 214, in train trainer.train(resume_from_checkpoint = args.resume_from_checkpoint) File "/root/anaconda3/envs/Belle/lib/python3.8/site-packages/transformers/trainer.py", line 1662, in train return inner_training_loop(...

Tian14267

RuntimeError: Failed to import transformers.models.auto because of the following error (look up to see its traceback): libssl.so.10: cannot open shared object file: No such file or directory这个错有人遇到过吗

我这是运行的gptq，md文档里的推理脚本CUDA_VISIBLE_DEVICES=0 python bloom_inference.py bloom --wbits 4 --groupsize 128 --load bloom/bloom7b-0.2m-8bit-128g.pt --text "hello"

liukangjia666

为什么BELLE经过GPTQ量化（8bit/4bit）后，模型的推理速度变慢了很多呢

BELLE-7B（bloom）量化后，推理速度显著降低。 BELLE-7B（LLaMA）量化后，推理速度也下降了一部分。代码： ``` import time import torch import torch.nn as nn from gptq import * from modelutils import * from quant import * from transformers import AutoTokenizer from random...

liguodongiot

复刻了一个推理的脚本，int8量化后单卡8g显存可跑

复刻了最近出的几个大模型，int8量化后大概是8g左右显存，单卡可以跑起来，[belle](https://github.com/Tongjilibo/bert4torch/blob/master/examples/basic/basic_language_model_belle.py)，[chatglm](https://github.com/Tongjilibo/bert4torch/blob/master/examples/basic/basic_language_model_chatglm.py), [llama](https://github.com/Tongjilibo/bert4torch/blob/master/examples/basic/basic_language_model_llama.py)

Tongjilibo

BELLE
BELLE copied to clipboard

Metadata

Misspell in seed tasks

试着调节deepspeed参数，但是无法在24G的3090显卡上进行训练非Lora的模型

如何使用量化后的模型做微调？

运行Bloom模型并修改fp16改为False之后报错

RuntimeError: Failed to import transformers.models.auto because of the following error (look up to see its traceback): libssl.so.10: cannot open shared object file: No such file or directory这个错有人遇到过吗

为什么BELLE经过GPTQ量化（8bit/4bit）后，模型的推理速度变慢了很多呢

复刻了一个推理的脚本，int8量化后单卡8g显存可跑

支持在原版llama上做增量预训练吗？

RuntimeError: Error(s) in loading state_dict for BloomForCausalLM: 这个是内存不足还是torch版本不同

请问quant_cuda这个library要怎样正常import呢？

← Metadata

Owner

Metadata

BELLE BELLE copied to clipboard

Metadata

← Metadata

Owner

Metadata

BELLE
BELLE copied to clipboard