Zhang Chen comments

Results 15 comments of


                                            Zhang Chen

module 'tensorflow' has no attribute 'flags'

> install tensorflow 1.14 solved my problem, thank you

Nothing comes out when using GPT-3

Seems davinci haven't generated sentence with `"` as a EOS, when I add `--keep_outputs_without_eos` flag, everything is OK.

I got error "Retry attempts exhausted. Last exception: Error communicating with OpenAI" when i run starup.py

I met the same error, and solved just by setting `OPENAI_PROXY` in config.yaml. Sees the http_proxy/https_proxy in env not work.

Support history display in tonic_validate UI

> This is a good idea. Would you mind shooting us an email at [[email protected]](mailto:[email protected]). I'd like to chat a bit over email about your use case so we can...

When I change the model from `Qwen1.5-7B-Chat` to `Qwen2-7B-Instruct`, the same error is still there.

> which finetune.py were you using? did you mix the use of Qwen and Qwen2 model and code? I'm using [Qwen/finetune.py](https://github.com/QwenLM/Qwen/blob/main/finetune.py). I was finetuning Qwen 1.5 with model_max_length 32768, and...

When I change the model from `Qwen1.5-7B-Chat` to `Qwen2-7B-Instruct`, the same error is still there.

> we are also in the process of deprecating the finetune.py in this repo and we advise you to use training frameworks, including [Axolotl](https://github.com/OpenAccess-AI-Collective/axolotl), [Llama-Factory](https://github.com/hiyouga/LLaMA-Factory), [Swift](https://github.com/modelscope/swift), etc., to finetune your...

When I change the model from `Qwen1.5-7B-Chat` to `Qwen2-7B-Instruct`, the same error is still there.

@yangjianxin1 where can I found `src/train.py` mentioned in https://qwen.readthedocs.io/en/latest/training/SFT/llama_factory.html? ![image](https://github.com/user-attachments/assets/70612cd6-3d52-4c0b-9129-a8f4d90ce999)

When I change the model from `Qwen1.5-7B-Chat` to `Qwen2-7B-Instruct`, the same error is still there.

how should I set the param `--flash_attn` ![image](https://github.com/user-attachments/assets/1e378c36-b8df-442e-a024-f730f4cb38ea) There is an error msg "**train.py: error: argument --flash_attn: expected one argument**"

[BUG] <title> model_max_length 32768 not work

according to [README](https://github.com/QwenLM/Qwen/blob/main/README.md), Qwen-7B does support 32K Max Length. ![image](https://github.com/user-attachments/assets/6652569c-e872-4e67-9eed-a598682c64cb)

[BUG] <title> model_max_length 32768 not work

I noticed input_ids are padded to the max_length in batch, but labels are not. The checkpoint is at `site-packages\transformers\tokenization_utils_base.py`, line 3450 ![image](https://github.com/user-attachments/assets/e9e74430-24c1-4492-8c1f-3d1f3d64cd99) ![image](https://github.com/user-attachments/assets/848dd737-0608-4c10-a536-b82763d53246) ``` Traceback (most recent call last): File...