JaonLiu comments

Results 72 comments of


                                            JaonLiu

Add inference code

> No, I didn't encounter that error. Can you give me more context? just use : ``` instructions = [ "模仿鲁迅的风格, 吐槽一下最近食堂饭菜涨价", ] ```

Qwen1.5-0.5B-Chat-GPTQ-Int4、Qwen1.5-0.5B-Chat-GPTQ-Int8、Qwen1.5-0.5B-Chat-GPTQ-Int8 这3个模型用vllm部署后，请求都报错了。只有Qwen1.5-0.5B-Chat-AWQ和Qwen1.5-0.5B-Chat可以正常返回。报错内容都是： ``` ransformers/tokenization_utils_fast.py", line 612, in convert_tokens_to_string return self.backend_tokenizer.decoder.decode(tokens) TypeError: argument 'tokens': 'NoneType' object cannot be converted to 'PyString' ```

JaonLiu

Add inference code

GPTQ-int4 vllm部署出错

网页问答异常

ERROR: Could not build wheels for xformers, which is required to install pyproject.toml-based projects

Can not train llama-7b due to OOM on 40GA100

Can not train llama-7b due to OOM on 40GA100

Can not train llama-7b due to OOM on 40GA100

Can not train llama-7b due to OOM on 40GA100

Does Post-training full integer quantization support BERT?

载入数据问题：load_msra_ner_without_dev